Publication Details
BUT 2014 Babel System: Analysis of adaptation in NN based systems
Grézl František, Ing., Ph.D. (DCGM FIT BUT)
Veselý Karel, Ing., Ph.D. (DCGM FIT BUT)
Hannemann Mirko, Dipl.-Ing. (DCGM FIT BUT)
Szőke Igor, Ing., Ph.D. (DCGM FIT BUT)
Černocký Jan, prof. Dr. Ing. (DCGM FIT BUT)
speech recognition, discriminative training,
bottle-neck neural networks, deep neural networks, adaptation
of neural networks, fundamental frequency
The article is about the BUT 2014 Babel System. It describes the analysis of adaptation in neural networks based systems.
Features based on a hierarchy of neural networks with compressive layers - Stacked Bottle-Neck (SBN) features - were recently shown to provide excellent performance in LVCSR systems. This paper summarizes several techniques investigated in our work towards Babel 2014 evaluations: (1) using several versions of fundamental frequency (F0) estimates, (2) semi-supervised training on un-transcribed data and mainly (3) adapting the NN structure at different levels. They are tested on three 2014 Babel languages with full GMM- and DNN-based systems. Separately and in combination, they are shown to outperform the baselines and confirm the usefulness of bottle-neck features in current ASR systems.
@INPROCEEDINGS{FITPUB10745, author = "Martin Karafi\'{a}t and Franti\v{s}ek Gr\'{e}zl and Karel Vesel\'{y} and Mirko Hannemann and Igor Sz\H{o}ke and Jan \v{C}ernock\'{y}", title = "BUT 2014 Babel System: Analysis of adaptation in NN based systems", pages = "3002--3006", booktitle = "Proceedings of Interspeech 2014", year = 2014, location = "Singapore, SG", publisher = "International Speech Communication Association", ISBN = "978-1-63439-435-2", language = "english", url = "https://www.fit.vut.cz/research/publication/10745" }