all InfoSec news
DSARSR: Deep Stacked Auto-encoders Enhanced Robust Speaker Recognition. (arXiv:2307.02751v1 [cs.SD])
cs.CR updates on arXiv.org arxiv.org
Speaker recognition is a biometric modality that utilizes the speaker's
speech segments to recognize the identity, determining whether the test speaker
belongs to one of the enrolled speakers. In order to improve the robustness of
the i-vector framework on cross-channel conditions and explore the nova method
for applying deep learning to speaker recognition, the Stacked Auto-encoders
are used to get the abstract extraction of the i-vector instead of applying
PLDA. After pre-processing and feature extraction, the speaker and
channel-independent speeches …
auto biometric channel conditions deep learning framework identity nova order recognition robustness speakers speech s speech test