Publication Details
Subword-based spoken term detection in audio course lectures
Norouzian Atta (MCGILL)
Reddy Aarthi (MCGILL)
Coy Andre (MCGILL)
Gupta Vishwa (CRIM)
Karafiát Martin, Ing., Ph.D. (DCGM FIT BUT)
Speech recognition, spoken term detection
This paper regards the subword-based spoken term detection in audio course lectures. It investigates spoken term dection (STD) from audio recordings.
This paper investigates spoken term detection (STD) from audio recordings of course lectures obtained from an existing media repository. STD is performed from word lattices generated offline using an automatic speech recognition (ASR) system configured from a meetings domain. An efficient STD approach is presented where lattice paths which are likely to contain search terms are identified and an efficient phone based distance is used to detect the occurrence of search terms in phonetic expansions of promising lattice paths. STD and ASR results are reported for both in-vocabulary (IV) and outof- vocabulary (OOV) search terms in this lecture speech domain.
@INPROCEEDINGS{FITPUB9312, author = "Richard Rose and Atta Norouzian and Aarthi Reddy and Andre Coy and Vishwa Gupta and Martin Karafi\'{a}t", title = "Subword-based spoken term detection in audio course lectures", pages = "5282--5285", booktitle = "Proc. International Conference on Acoustics, Speech, and Signal Processing", journal = "Proc. International Conference on Acoustics, Speech, and Signal Processing", volume = 2010, number = 3, year = 2010, location = "Dallas, US", publisher = "IEEE Signal Processing Society", ISBN = "978-1-4244-4296-6", ISSN = "1520-6149", language = "english", url = "https://www.fit.vut.cz/research/publication/9312" }