Publication Details
BUT ASR System for BABEL Surprise Evaluation 2014
Veselý Karel, Ing., Ph.D. (DCGM FIT BUT)
Szőke Igor, Ing., Ph.D. (DCGM FIT BUT)
Burget Lukáš, doc. Ing., Ph.D. (DCGM FIT BUT)
Grézl František, Ing., Ph.D. (DCGM FIT BUT)
Hannemann Mirko, Dipl.-Ing. (DCGM FIT BUT)
Černocký Jan, prof. Dr. Ing. (DCGM FIT BUT)
speech recognition, discriminative training, bottle-neck neural networks, deep neural networks, adaptation of neural networks, noisy speech
This paper describes Brno University of Technology (BUT) ASR system for 2014 BABEL Surprise language evaluation (Tamil).
The paper describes Brno University of Technology (BUT) ASR system for 2014 BABEL Surprise language evaluation (Tamil). While being largely based on our previous work, two original contributions were brought: (1) speaker-adapted bottle-neck neural network (BN) features were investigated as an input to DNN recognizer and semi-supervised training was found effective. (2) Adding of noise to training data outperformed a classical de-noising technique while dealing with noisy test data was found beneficial, and the performance of this approach was verified on a relatively clean training/test data setup from a different language. All results are reported on BABEL 2014 Tamil data.
@INPROCEEDINGS{FITPUB10799, author = "Martin Karafi\'{a}t and Karel Vesel\'{y} and Igor Sz\H{o}ke and Luk\'{a}\v{s} Burget and Franti\v{s}ek Gr\'{e}zl and Mirko Hannemann and Jan \v{C}ernock\'{y}", title = "BUT ASR System for BABEL Surprise Evaluation 2014", pages = "501--506", booktitle = "Proceedings of 2014 Spoken Language Technology Workshop", year = 2014, location = "South Lake Tahoe, Nevada, US", publisher = "IEEE Signal Processing Society", ISBN = "978-1-4799-7129-9", doi = "10.1109/SLT.2014.7078625", language = "english", url = "https://www.fit.vut.cz/research/publication/10799" }