Result Details

All-Pole Modeling for Definition of Speech Features in Aurora3 DSR Task

MOTLÍČEK, P.; ČERNOCKÝ, J. All-Pole Modeling for Definition of Speech Features in Aurora3 DSR Task. 6th International Conference, TSD 2003 České Budějovice, Czech Republic, September 2003 Proceedings. Lecture Notes in Computer Science. České Budějovice: University of West Bohemia in Pilsen, 2003. no. 09, p. 295-300. ISBN: 3-540-20024-X. ISSN: 0302-9743.

Type

conference paper

Language

English

Authors

Motlíček Petr, doc. Ing., Ph.D., FIT (FIT)
Černocký Jan, prof. Dr. Ing.

Abstract

All-Pole Modeling for Definition of Speech Features in Aurora3 DSR Task

Keywords

speech, speech processing, feature extraction ,speech recognittion, PLP, MFCC

URL

Annotation

This contribution investigates various all-pole modeling based feature extraction techniques for robust ASR. Final performances are compared to well-known Mel-frequency cepstral coefficients. At the beginning, frequency independent approach of modeling speech power spectra that increases the performance of ASR system mainly in case of large mismatch between training and testing data is studied. Then, we focus on different types of features that can be extracted from all-pole model to reduce the overall word error rate. Achieved recognition performances show that line spectral frequencies are more suitable parameters for ASR where the input speech is corrupted by different types of real noises than cepstrum based features. In experiments SpeechDat-Car databases used for front-end evaluation of advanced distributed speech recognition (DSR) systems were used.

Published

2003

Pages

295–300

Journal

Lecture Notes in Computer Science, vol. 2003, no. 09, ISSN 0302-9743

Proceedings

6th International Conference, TSD 2003 České Budějovice, Czech Republic, September 2003 Proceedings

Conference

International Conference on Text Speech and Dialogue, TSD 2003

ISBN

3-540-20024-X

Publisher

University of West Bohemia in Pilsen

Place

České Budějovice

BibTeX

@inproceedings{BUT14178,
  author="Petr {Motlíček} and Jan {Černocký}",
  title="All-Pole Modeling for Definition of Speech Features in Aurora3 DSR Task",
  booktitle="6th International Conference, TSD 2003 České Budějovice, Czech Republic, September 2003 Proceedings",
  year="2003",
  journal="Lecture Notes in Computer Science",
  volume="2003",
  number="09",
  pages="295--300",
  publisher="University of West Bohemia in Pilsen",
  address="České Budějovice",
  isbn="3-540-20024-X",
  issn="0302-9743",
  url="https://www.fit.vut.cz/research/publication/7234/"
}

Projects

Data driven and anthropic coding and recognition of speech, GACR, Postdoktorandské granty, GP102/02/D108, start: 2002-09-01, end: 2005-08-30, completed
Voice technologies for support of information society, GACR, Standardní projekty, GA102/02/0124, start: 2002-01-01, end: 2004-12-31, completed

Research groups

Speech Data Mining Research Group BUT Speech@FIT (RG SPEECH)

Departments

Department of Computer Graphics and Multimedia (DCGM)