Publication Details
Multilingual Region-Dependent Transforms
Burget Lukáš, doc. Ing., Ph.D. (DCGM FIT BUT)
Grézl František, Ing., Ph.D. (DCGM FIT BUT)
Veselý Karel, Ing., Ph.D. (DCGM FIT BUT)
Černocký Jan, prof. Dr. Ing. (DCGM FIT BUT)
Automatic speech recognition, Region-Dependent Transforms, Multilingual speech recognition, Feedforward neural networks
This paper presented our further steps in the development of a feature extraction scheme easily transferable to a new language with severely limited training data.
In recent years, trained feature extraction (FE) schemes based on neural networks have replaced or complemented traditional approaches in top performing systems. This paper deals with FE in multilingual scenarios with a target language with low amount of transcribed data. Continuing our previous work on multilingual training of Stacked Bottle-Neck Neural Network FE schemes, we concentrate on improving the discriminatively trained Region- Dependent Transforms. We show that multilingual training of RDT can be implemented by merging statistics from several languages. In our case we used up to 11 source languages to build a FE which generalize well for a new language. This allows us to build a strong bootstrapping model for the final ASR system. The results are produced on IARPA Babel data.
@INPROCEEDINGS{FITPUB11146, author = "Martin Karafi\'{a}t and Luk\'{a}\v{s} Burget and Franti\v{s}ek Gr\'{e}zl and Karel Vesel\'{y} and Jan \v{C}ernock\'{y}", title = "Multilingual Region-Dependent Transforms", pages = "5430--5434", booktitle = "Proceedings of the 41th IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2016), 2016", year = 2016, location = "Shanghai, CN", publisher = "IEEE Signal Processing Society", ISBN = "978-1-4799-9988-0", doi = "10.1109/ICASSP.2016.7472715", language = "english", url = "https://www.fit.vut.cz/research/publication/11146" }