Publication Details

Time-domain based Temporal Processing with Application of

MOTLÍČEK Petr and ČERNOCKÝ Jan. Time-domain based Temporal Processing with Application of. In: Proc. EUROSPEECH 2003. Geneva: Institute for Perceptual Artificial Intelligence, 2003, pp. 821-824. ISSN 1018-4074.
Czech title
Zpracování řečového signálu v časové oblsati s použitím ortogonálních transformací
Type
conference paper
Language
english
Authors
URL
Keywords

speech proceesing, speech recognition, TRAP, feature extraction

Abstract

Time-domain based Temporal Processing with Application of Orthogonal Transformations

Annotation

In the paper, novel approach that efficiently extracts the temporal information of speech has been proposed. This algorithm is fully employed in time-domain, and the preprocessing blocks are well justified by psychoacoustic studies. The achieved results show the different properties of proposed algorithm compared to the traditional approach. The algorithm is advantageous in terms of possible modifications and computational inexpensiveness. Then, in our experiments, we have focused on different representation of time trajectories. Classical methods that are efficient in conventional feature extraction approaches showed not to be suitable to approximate temporal trajectories of speech. However, the application of some orthogonal transformations, such as discrete Fourier transform or discrete cosine transform, on top of previously derived temporal trajectories outperforms classification in original domain. In addition, these transformed features are very efficient to reduce the dimensionality of data. %in data reduction.

Published
2003
Pages
821-824
Journal
European Speech Communication, vol. 2003, no. 9, ISSN 1018-4074
Proceedings
Proc. EUROSPEECH 2003
Conference
Eurospeech 2003-Switzerland - 8th European conference on speech communication and technology, Geneva, CH
Publisher
Institute for Perceptual Artificial Intelligence
Place
Geneva, CH
BibTeX
@INPROCEEDINGS{FITPUB7232,
   author = "Petr Motl\'{i}\v{c}ek and Jan \v{C}ernock\'{y}",
   title = "Time-domain based Temporal Processing with Application of",
   pages = "821--824",
   booktitle = "Proc. EUROSPEECH 2003",
   journal = "European Speech Communication",
   volume = 2003,
   number = 9,
   year = 2003,
   location = "Geneva, CH",
   publisher = "Institute for Perceptual Artificial Intelligence",
   ISSN = "1018-4074",
   language = "english",
   url = "https://www.fit.vut.cz/research/publication/7232"
}
Back to top