Publication Details

Factorization of Discriminatively Trained i-Vector Extractor for Speaker Recognition

NOVOTNÝ Ondřej, PLCHOT Oldřich, GLEMBEK Ondřej and BURGET Lukáš. Factorization of Discriminatively Trained i-Vector Extractor for Speaker Recognition. In: Proceedings of Interspeech. Graz: International Speech Communication Association, 2019, pp. 4330-4334. ISSN 1990-9772. Available from: https://www.isca-speech.org/archive/Interspeech_2019/pdfs/1757.pdf

Czech title

Faktorizace diskriminativně trénovaného extraktoru i-vektorů pro rozpoznávání mluvčího

Type

conference paper

Language

english

Authors

Novotný Ondřej, Ing., Ph.D. (DCGM FIT BUT)
Plchot Oldřich, Ing., Ph.D. (DCGM FIT BUT)
Glembek Ondřej, Ing., Ph.D. (DCGM FIT BUT)
Burget Lukáš, doc. Ing., Ph.D. (DCGM FIT BUT)

URL

Keywords

SRE

Abstract

In this work, we continue in our research on i-vector extractor for speaker verification (SV) and we optimize its architecture for fast and effective discriminative training. We were motivated by computational and memory requirements caused by the large number of parameters of the original generative ivector model. Our aim is to preserve the power of the original generative model, and at the same time focus the model towards extraction of speaker-related information. We show that it is possible to represent a standard generative i-vector extractor by a model with significantly less parameters and obtain similar performance on SV tasks. We can further refine this compact model by discriminative training and obtain i-vectors that lead to better performance on various SV benchmarks representing different acoustic domains.

Published

2019

Pages

4330-4334

Journal

Proceedings of Interspeech - on-line, vol. 2019, no. 9, ISSN 1990-9772

Proceedings

Proceedings of Interspeech

Conference

Interspeech Conference, Graz, AT

Publisher

International Speech Communication Association

Place

Graz, AT

DOI

10.21437/Interspeech.2019-1757

UT WoS

000831796404095

EID Scopus

2-s2.0-85074713812

BibTeX

@INPROCEEDINGS{FITPUB12091,
   author = "Ond\v{r}ej Novotn\'{y} and Old\v{r}ich Plchot and Ond\v{r}ej Glembek and Luk\'{a}\v{s} Burget",
   title = "Factorization of Discriminatively Trained i-Vector Extractor for Speaker Recognition",
   pages = "4330--4334",
   booktitle = "Proceedings of Interspeech",
   journal = "Proceedings of Interspeech - on-line",
   volume = 2019,
   number = 9,
   year = 2019,
   location = "Graz, AT",
   publisher = "International Speech Communication Association",
   ISSN = "1990-9772",
   doi = "10.21437/Interspeech.2019-1757",
   language = "english",
   url = "https://www.fit.vut.cz/research/publication/12091"
}