Publication Details

Feature Extraction in Speech Coding and Recognition

MOTLÍČEK Petr. Feature Extraction in Speech Coding and Recognition. Portland: Oregon Graduate Institute of Science and Technology, 2002.

Czech title

Parametrizace v kódování a rozpoznávání řeči

Type

report

Language

english

Authors

Motlíček Petr, doc. Ing., Ph.D. (DCGM FIT BUT)

URL

http://www.fit.vutbr.cz/~motlicek/publi/2002/rp.pdf PDF

Keywords

speech coding, speech recognition, feature normalization, feature extraction, PLP, MFCC

Abstract

Feature Extraction in Speech Coding and Recognition

Annotation

This document is dealing with the feature extraction techniques generally used in speech recognition tasks. The most popular features used in speech recognition, such as Mel-Filter bank cepstral coefficients (MFCCs) and Perceptual Linear Prediction (PLP) coefficients, were taken as the baseline features. A significant amount of effort has been devoted to establishing speech feature extraction schemes which enable robust and high performance speech recognition in a range of operating environments. Each scheme consists of several processing stages that will be subjected to the theoretical analysis as well as their contribution to the resulting performance of the whole system. In many descriptions, feature extraction is considered as comprising three different stages: - namely static feature extraction -feature normalization -inclusion of temporal information. In our work we have concentrated on first two stages. In all our speech recognition experiments delta and acceleration components were applied as the temporal derivatives.

Published

2002

Pages

1-50

Publisher

Oregon Graduate Institute of Science and Technology

Place

Portland, US

BibTeX

@TECHREPORT{FITPUB7069,
   author = "Petr Motl\'{i}\v{c}ek",
   title = "Feature Extraction in Speech Coding and Recognition",
   pages = "1--50",
   year = 2002,
   location = "Portland, US",
   publisher = "Oregon Graduate Institute of Science and Technology",
   language = "english",
   url = "https://www.fit.vut.cz/research/publication/7069"
}