Publication Details

Convolutive Bottleneck Network Features for LVCSR

VESELÝ Karel, KARAFIÁT Martin and GRÉZL František. Convolutive Bottleneck Network Features for LVCSR. In: Proceedings of ASRU 2011. Big Island, Hawaii: IEEE Signal Processing Society, 2011, pp. 42-47. ISBN 978-1-4673-0366-8.
Czech title
Příznaky z konvoluční sítě s úzkým hrdlem pro LVCSR
Type
conference paper
Language
english
Authors
URL
Keywords

Bottleneck features, Tandem LVCSR system, linear bottleneck, Convolutional Bottleneck Network

Abstract

Workshop Article about novel features for tandem LVCSR system, which are based on Convolutive Bottleneck Network. It extends the previous work on Universal Context network by using linear bottleneck and expansion to Convolutive Bottleneck Network,

so all the parameters are trained together.
Annotation

In this paper, we focus on improvements of the bottleneck ANN in a Tandem LVCSR system. First, the influence of training set size and the ANN size is evaluated. Second, a very positive effect of linear bottleneck is shown. Finally a Convolutive Bottleneck Network is proposed as extension of the current stateof- the-art Universal Context Network. The proposed training method leads to 5.5% relative reduction of WER, compared to the Universal Context ANN baseline. The relative improvement compared to the 5-layer single-bottleneck network is 17.7%. The dataset ctstrain07 composed of more than 2000 hours of English Conversational Telephone Speech was used for the experiments. The TNet toolkit with CUDA GPGPU implementation was used for fast training.

Published
2011
Pages
42-47
Proceedings
Proceedings of ASRU 2011
Conference
IEEE 2011 Workshop on Automatic Speech Recognition and Understanding, Hilton Waikoloa Village Resort, Big Island, Hawaii, US
ISBN
978-1-4673-0366-8
Publisher
IEEE Signal Processing Society
Place
Big Island, Hawaii, US
BibTeX
@INPROCEEDINGS{FITPUB9763,
   author = "Karel Vesel\'{y} and Martin Karafi\'{a}t and Franti\v{s}ek Gr\'{e}zl",
   title = "Convolutive Bottleneck Network Features for LVCSR",
   pages = "42--47",
   booktitle = "Proceedings of ASRU 2011",
   year = 2011,
   location = "Big Island, Hawaii, US",
   publisher = "IEEE Signal Processing Society",
   ISBN = "978-1-4673-0366-8",
   language = "english",
   url = "https://www.fit.vut.cz/research/publication/9763"
}
Back to top