Publication Details

Text Augmentation for Language Models in High Error Recognition Scenario

BENEŠ Karel and BURGET Lukáš. Text Augmentation for Language Models in High Error Recognition Scenario. In: Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH. Brno: International Speech Communication Association, 2021, pp. 1872-1876. ISSN 1990-9772. Available from: https://www.isca-speech.org/archive/interspeech_2021/benes21_interspeech.html

Czech title

Augmentace textu pro jazykové modelování ve scénářích s vysokou chybovostí

Type

conference paper

Language

english

Authors

Beneš Karel, Ing. (DCGM FIT BUT)
Burget Lukáš, doc. Ing., Ph.D. (DCGM FIT BUT)

URL

Keywords

data augmentation, error simulation, language modeling, automatic speech recognition

Abstract

In this paper, we explore several data augmentation strategies for training of language models for speech recognition. We compare augmentation based on global error statistics with one based on unigram statistics of ASR errors and with labelsmoothing and its sampled variant. Additionally, we investigate the stability and the predictive power of perplexity estimated on augmented data. Despite being trivial, augmentation driven by global substitution, deletion and insertion rates achieves the best rescoring results. On the other hand, even though the associated perplexity measure is stable, it gives no better prediction of the final error rate than the vanilla one. Our best augmentation scheme increases the WER improvement from second-pass rescoring from 1.1% to 1.9% absolute on the CHiMe-6 challenge.

Published

2021

Pages

1872-1876

Journal

Proceedings of Interspeech - on-line, vol. 2021, no. 8, ISSN 1990-9772

Proceedings

Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH

Conference

Interspeech Conference, Brno, CZ

Publisher

International Speech Communication Association

Place

Brno, CZ

DOI

10.21437/Interspeech.2021-627

UT WoS

000841879501198

EID Scopus

2-s2.0-85119208077

BibTeX

@INPROCEEDINGS{FITPUB12606,
   author = "Karel Bene\v{s} and Luk\'{a}\v{s} Burget",
   title = "Text Augmentation for Language Models in High Error Recognition Scenario",
   pages = "1872--1876",
   booktitle = "Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH",
   journal = "Proceedings of Interspeech - on-line",
   volume = 2021,
   number = 8,
   year = 2021,
   location = "Brno, CZ",
   publisher = "International Speech Communication Association",
   ISSN = "1990-9772",
   doi = "10.21437/Interspeech.2021-627",
   language = "english",
   url = "https://www.fit.vut.cz/research/publication/12606"
}