Publication Details
Phoneme Recognition using Temporal Patterns
Schwarz Petr, Ing., Ph.D. (DCGM FIT BUT)
Heřmanský Hynek, prof. Ing., Dr.Eng. (DCGM FIT BUT)
Černocký Jan, prof. Dr. Ing. (DCGM FIT BUT)
speech recognition, feature extraction, temporal patterns
We investigate and compare several techniques for automatic recognition of unconstrained context-independent phoneme strings from TIMIT and NTIMIT databases. Among the compared techniques, the technique based on TempoRAl Patterns (TRAP) achieves the best results in the clean speech, it achieves about 10% relative improovements against baseline system. Its advantage is also observed in the presence of mismatch between training and testing conditions. Issues such as the optimal length of temporal patterns in the TRAP technique and the effectiveness of mean and variance normalization of the patterns and the multi-band input the TRAP estimations, are also explored.
@INPROCEEDINGS{FITPUB7241, author = "Pavel Mat\v{e}jka and Petr Schwarz and Hynek He\v{r}mansk\'{y} and Jan \v{C}ernock\'{y}", title = "Phoneme Recognition using Temporal Patterns", pages = "465--472", booktitle = "Proc. 6th International Conference Text, Speech and Dialogue, TSD2003", year = 2003, location = "Ceske Budejovice, CZ", publisher = "Springer Verlag", ISBN = "3-540-20024-X", language = "english", url = "https://www.fit.vut.cz/research/publication/7241" }