UMotion - Informatique - 2018 - Odyssey - Odyssey 2018

Prendre des notes

Il n’y a pas de note disponible pour vous pour cette vidéo.

Connectez-vous pour en créer une nouvelle.

Disciplines

Types

Mots clés

Informatique

Noise Robustness

\r\n\r\n

Ondřej Novotný, Oldřich Plchot, Pavel Matějka, Ladislav Mošner and Ondřej Glembek

\r\n\r\n

Text-independent speaker verification (SV) is currently in the process of embracing DNN modeling in every stage of SV system. Slowly, the DNN-based approaches such as end-to-end modelling and systems based on DNN embeddings start to be competitive even in challenging and diverse channel conditions of recent NIST SREs. Domain adaptation and need for large amount of training data are still a challenge for current discriminative systems and (unlike with generative models), we see significant gains from data augmentation, simulation and other techniques designed to overcome lack of the training data. We present an analysis of an SV system based on DNN embeddings (x-vectors) and focus on robustness across diverse data domains such as standard telephone and microphone conversations, both in clean, noisy and reverberant environments. We also evaluate the system on challenging far-field data created by re-transmitting a subset of NIST SRE 2008 and 2010 microphone interviews. We compare our results with the state-of-the-art i-vector system. In general, we were able to achieve better performance with the DNN-based systems, but most importantly, we have confirmed the robustness of such systems across multiple data domains.

\r\n\r\n

Cite as: Novotný, O., Plchot, O., Matějka, P., Mošner, L., Glembek, O. (2018) On the use of X-vectors for Robust Speaker Recognition. Proc. Odyssey 2018 The Speaker and Language Recognition Workshop, 168-175, DOI: 10.21437/Speaker Odyssey.2018-24.

\r\n\r\n

\r\n

\r\n\r\n

\r\n

Ajouté par : Emmanuelle Billard (ebillard)
Ajouté le : 17 octobre 2018 02:00
Chaîne :
- Informatique
Type : Enseignement
Langue principale : Français
Discipline(s) :
- Informatique
- Stic

Informatique

2018 - Odyssey

Odyssey 2018 - On the use of X-vectors for Robust Speaker Recognition

Noise Robustness

Infos