Result Details

On-the-Fly Text Retrieval for end-to-end ASR Adaptation

YUSUF, B.; GOURAV, A.; GANDHE, A.; BULYKO, I. On-the-Fly Text Retrieval for end-to-end ASR Adaptation. In Proceedings of ICASSP 2023. Rhodes Island: IEEE Signal Processing Society, 2023. p. 1-5. ISBN: 978-1-7281-6327-7.

Type

conference paper

Language

English

Authors

Yusuf Bolaji, DCGM (FIT)
GOURAV, A.
Gandhe Ankur
BULYKO, I.

Abstract

End-to-end speech recognition models are improved by incorporat-
ing external text sources, typically by fusion with an external lan-
guage model. Such language models have to be retrained whenever
the corpus of interest changes. Furthermore, since they store the
entire corpus in their parameters, rare words can be challenging to
recall. In this work, we propose augmenting a transducer-based ASR
model with a retrieval language model, which directly retrieves from
an external text corpus plausible completions for a partial ASR hy-
pothesis. These completions are then integrated into subsequent pre-
dictions by an adapter, which is trained once, so that the corpus of
interest can be switched without incurring the computational over-
head of retraining. Our experiments show that the proposed model
significantly improves the performance of a transducer baseline on a
pair of question-answering datasets. Further, it outperforms shallow
fusion on recognition of named entities by about 7% relative; when
the two are combined, the relative improvement increases to 13%

Keywords

retrieval, language model, domain adaptation, end-to-end ASR, RNN transducer, contextual biasing

URL

Published

2023

Pages

1–5

Proceedings

Proceedings of ICASSP 2023

Conference

2023 IEEE International Conference on Acoustics, Speech and Signal Processing IEEE

ISBN

978-1-7281-6327-7

Publisher

IEEE Signal Processing Society

Place

Rhodes Island

DOI

10.1109/ICASSP49357.2023.10095857

EID Scopus

2-s2.0-85177593944

BibTeX

@inproceedings{BUT185196,
  author="YUSUF, B. and GOURAV, A. and GANDHE, A. and BULYKO, I.",
  title="On-the-Fly Text Retrieval for end-to-end ASR Adaptation",
  booktitle="Proceedings of ICASSP 2023",
  year="2023",
  pages="1--5",
  publisher="IEEE Signal Processing Society",
  address="Rhodes Island",
  doi="10.1109/ICASSP49357.2023.10095857",
  isbn="978-1-7281-6327-7",
  url="https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10095857"
}

Files

pdf yusuf_icassp2023_amazon paper.pdf 1 MB

Projects

Soudobé metody zpracování, analýzy a zobrazování multimediálních a 3D dat, BUT, Vnitřní projekty VUT, FIT-S-23-8278, start: 2023-03-01, end: 2026-02-28, running

Research groups

Speech Data Mining Research Group BUT Speech@FIT (RG SPEECH)

Departments

Department of Computer Graphics and Multimedia (DCGM)