Publication Details

Analysis of X-Vectors for Low-Resource Speech Recognition

KARAFIÁT Martin, VESELÝ Karel, ČERNOCKÝ Jan, PROFANT Ján, NYTRA Jiří, HLAVÁČEK Miroslav and PAVLÍČEK Tomáš. Analysis of X-Vectors for Low-Resource Speech Recognition. In: ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Toronto, Ontario: IEEE Signal Processing Society, 2021, pp. 6998-7002. ISBN 978-1-7281-7605-5.
Czech title
Analýza x-vektorů pro rozpoznávání řeči s omezenými zdroji
Type
conference paper
Language
english
Authors
Karafiát Martin, Ing., Ph.D. (DCGM FIT BUT)
Veselý Karel, Ing., Ph.D. (DCGM FIT BUT)
Černocký Jan, prof. Dr. Ing. (DCGM FIT BUT)
Profant Ján (Phonexia)
Nytra Jiří (Phonexia)
Hlaváček Miroslav (Phonexia)
Pavlíček Tomáš, Ing. (Phonexia)
URL
Keywords

speech recognition, adaptation, x-vectors, data augmentation, robustness

Abstract

The paper presents a study of usability of x-vectors for adaptation of automatic speech recognition (ASR) systems. Xvectors are Neural Network (NN)-based speaker embeddings recently proposed in speaker recognition (SR). They quickly replaced common i-vectors and became new state-of-the-art technique. Here, the same approach is adopted for ASR with the hope of similar outcome. All experiments were done on ASR for the latest IARPA MATERIAL evaluation running on Pashto language. Over 1% absolute improvement was observed with x-vectors over traditional i-vectors, even when the x-vector extractor was not trained on target Pashto data.

Published
2021
Pages
6998-7002
Proceedings
ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
Conference
2021 IEEE International Conference on Acoustics, Speech and Signal Processing, Toronto, CA
ISBN
978-1-7281-7605-5
Publisher
IEEE Signal Processing Society
Place
Toronto, Ontario, CA
DOI
UT WoS
000704288407055
EID Scopus
BibTeX
@INPROCEEDINGS{FITPUB12525,
   author = "Martin Karafi\'{a}t and Karel Vesel\'{y} and Jan \v{C}ernock\'{y} and J\'{a}n Profant and Ji\v{r}\'{i} Nytra and Miroslav Hlav\'{a}\v{c}ek and Tom\'{a}\v{s} Pavl\'{i}\v{c}ek",
   title = "Analysis of X-Vectors for Low-Resource Speech Recognition",
   pages = "6998--7002",
   booktitle = "ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)",
   year = 2021,
   location = "Toronto, Ontario, CA",
   publisher = "IEEE Signal Processing Society",
   ISBN = "978-1-7281-7605-5",
   doi = "10.1109/ICASSP39728.2021.9414725",
   language = "english",
   url = "https://www.fit.vut.cz/research/publication/12525"
}
Back to top