Publication Details

An Empirical evaluation of zero resource acoustic unit discovery

LIU Chunxi, YANG Jinyi, SUN Ming, KESIRAJU Santosh, ROTT Alena, ONDEL Yang Lucas Antoine Francois, GHAHREMANI Pegah, DEHAK Najim, BURGET Lukáš and KHUDANPUR Sanjeev. An Empirical evaluation of zero resource acoustic unit discovery. In: Proceedings of ICASSP 2017. New Orleans: IEEE Signal Processing Society, 2017, pp. 5305-5309. ISBN 978-1-5090-4117-6.
Czech title
Empirické hodnocení automatického hledání řečových jednotek bez popsaných trénovacích dat
Type
conference paper
Language
english
Authors
Liu Chunxi (JHU)
Yang Jinyi (JHU)
Sun Ming (AmazonCom)
Kesiraju Santosh (DCGM FIT BUT)
Rott Alena (USTAN)
Ondel Yang Lucas Antoine Francois, Mgr., Ph.D. (DCGM FIT BUT)
Ghahremani Pegah (JHU)
Dehak Najim (JHU)
Burget Lukáš, doc. Ing., Ph.D. (DCGM FIT BUT)
Khudanpur Sanjeev (JHU)
URL
Keywords

Acoustic unit discovery, unsupervised linear discriminant analysis, evaluation methods, zero resource

Abstract

Acoustic unit discovery (AUD) is a process of automatically identifying a categorical acoustic unit inventory from speech and producing corresponding acoustic unit tokenizations. AUD provides an important avenue for unsupervised acoustic model training in a zero resource setting where expert-provided linguistic knowledge and transcribed speech are unavailable. Therefore, to further facilitate zero-resource AUD process, in this paper, we demonstrate acoustic feature representations can be significantly improved by (i) performing linear discriminant analysis (LDA) in an unsupervised self-trained fashion, and (ii) leveraging resources of other languages through building a multilingual bottleneck (BN) feature extractor to give effective cross-lingual generalization. Moreover, we perform comprehensive evaluations of AUD efficacy on multiple downstream speech applications, and their correlated performance suggests that AUD evaluations are feasible using different alternative language resources when only a subset of these evaluation resources can be available in typical zero resource applications.

Annotation

Acoustic unit discovery (AUD) is a process of automatically identifying a categorical acoustic unit inventory from speech and producing corresponding acoustic unit tokenizations. AUD provides an important avenue for unsupervised acoustic model training in a zero resource setting where expert-provided linguistic knowledge and transcribed speech are unavailable. Therefore, to further facilitate zero-resource AUD process, in this paper, we demonstrate acoustic feature representations can be significantly improved by (i) performing linear discriminant analysis (LDA) in an unsupervised self-trained fashion, and (ii) leveraging resources of other languages through building a multilingual bottleneck (BN) feature extractor to give effective cross-lingual generalization. Moreover, we perform comprehensive evaluations of AUD efficacy on multiple downstream speech applications, and their correlated performance suggests that AUD evaluations are feasible using different alternative language resources when only a subset of these evaluation resources can be available in typical zero resource applications.

Published
2017
Pages
5305-5309
Proceedings
Proceedings of ICASSP 2017
Conference
2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), New Orleans, USA, US
ISBN
978-1-5090-4117-6
Publisher
IEEE Signal Processing Society
Place
New Orleans, US
DOI
UT WoS
000414286205093
EID Scopus
BibTeX
@INPROCEEDINGS{FITPUB11471,
   author = "Chunxi Liu and Jinyi Yang and Ming Sun and Santosh Kesiraju and Alena Rott and Francois Antoine Lucas Yang Ondel and Pegah Ghahremani and Najim Dehak and Luk\'{a}\v{s} Burget and Sanjeev Khudanpur",
   title = "An Empirical evaluation of zero resource acoustic unit discovery",
   pages = "5305--5309",
   booktitle = "Proceedings of ICASSP 2017",
   year = 2017,
   location = "New Orleans, US",
   publisher = "IEEE Signal Processing Society",
   ISBN = "978-1-5090-4117-6",
   doi = "10.1109/ICASSP.2017.7953169",
   language = "english",
   url = "https://www.fit.vut.cz/research/publication/11471"
}
Back to top