UMotion - Informatique - 2018 - Odyssey - Odyssey 2018

Prendre des notes

Il n’y a pas de note disponible pour vous pour cette vidéo.

Connectez-vous pour en créer une nouvelle.

Disciplines

Types

Mots clés

Informatique

Noise Robustness

\r\n\r\n

Weiwei Lin, Man-Wai Mak, Longxin Li and Jen-Tzung Chien

\r\n\r\n

Domain mismatch, caused by the discrepancy between training and test data, can severely degrade the performance of speaker verification (SV) systems. What's more, both training and test data themselves could be composed of heterogeneous subsets, with each subset corresponding to one sub-domain. These multi-source mismatches can further degrade SV performance. This paper proposes incorporating maximum mean discrepancy (MMD) into the loss function of autoencoders to reduce theses mismatches. Specifically, we generalize MMD to measure the discrepancies among multiple distributions. We call this generalized MMD as domain-wise MMD. Using domain-wise MMD as an objective function, we derive a domain-invariant autoencoder (DAD) for multi-source i-vector adaptation. The DAD directly encodes the features that minimize the multi-source mismatch. By replacing the original i-vectors with these domain-invariant feature vectors for PLDA training, we reduce the EER by 11.8% in NIST 2016 SRE when compared to PLDA without adaptation.

\r\n\r\n

Cite as: Lin, W., Mak, M., Li, L., Chien, J. (2018) Reducing Domain Mismatch by Maximum Mean Discrepancy Based Autoencoders . Proc. Odyssey 2018 The Speaker and Language Recognition Workshop, 162-167, DOI: 10.21437/Speaker Odyssey.2018-23.

\r\n

Ajouté par : Emmanuelle Billard (ebillard)
Ajouté le : 17 octobre 2018 02:00
Chaîne :
- Informatique
Type : Enseignement
Langue principale : Français
Discipline(s) :
- Informatique
- Stic

Informatique

2018 - Odyssey

Odyssey 2018 - Reducing Domain Mismatch by Maximum Mean Discrepancy Based Autoencoders

Noise Robustness

Infos