Publication Details

Evaluating the Santa Barbara Corpus: Challenges of the Breadth of Conversational Spoken Language

MACIEJEWSKI Matthew, KLEMENT Dominik, HUANG Ruizhe, WIESNER Matthew and KHUDANPUR Sanjeev. Evaluating the Santa Barbara Corpus: Challenges of the Breadth of Conversational Spoken Language. In: Proceedings of Interspeech 2024. Kos: International Speech Communication Association, 2024, pp. 2155-2160. ISSN 1990-9772. Available from: https://www.isca-archive.org/interspeech_2024/maciejewski24_interspeech.pdf

Czech title

Hodnocení řečových technologií na Santa Barbara korpusu: výzvy konverzační mluvené řeči

Type

conference paper

Language

english

Authors

Maciejewski Matthew (JHU)
Klement Dominik, Bc. (FIT BUT)
Huang Ruizhe (JHU)
Wiesner Matthew (JHU)
Khudanpur Sanjeev (JHU)

URL

Keywords

conversational speech, diarization, speech recognition

Abstract

As speech technology has matured, there has been a push to- wards systems that can process conversational speech, reflect- ing the so-called "cocktail party problem," which includes not only more challenging acoustic conditions, but also necessi- tates solutions to new problems, such as identifying who spoke when and processing multiple concurrent streams of speech. Such problems have been approached primarily via corpora comprising business meetings and dinner parties, overlooking the broad range of conversational dynamics and speaker de- mographics that fall under the category of multi-talker speech. To this end, we introduce the use of the Santa Barbara Corpus of Spoken American English for evaluation of speech technol- ogy-including preparing the corpus and annotations for auto- matic processing, demonstrating the failure of state-of-the-art systems to withstand the heterogeneity of conditions, and high- lighting the situations where standard methods struggle to per- form at all

Published

2024

Pages

2155-2160

Journal

Proceedings of Interspeech - on-line, vol. 2024, no. 9, ISSN 1990-9772

Proceedings

Proceedings of Interspeech 2024

Conference

Interspeech Conference, Kos, GR

Publisher

International Speech Communication Association

Place

Kos, GR

DOI

10.21437/Interspeech.2024-2119

EID Scopus

2-s2.0-85214796368

BibTeX

@INPROCEEDINGS{FITPUB13325,
   author = "Matthew Maciejewski and Dominik Klement and Ruizhe Huang and Matthew Wiesner and Sanjeev Khudanpur",
   title = "Evaluating the Santa Barbara Corpus: Challenges of the Breadth of Conversational Spoken Language",
   pages = "2155--2160",
   booktitle = "Proceedings of Interspeech 2024",
   journal = "Proceedings of Interspeech - on-line",
   volume = 2024,
   number = 9,
   year = 2024,
   location = "Kos, GR",
   publisher = "International Speech Communication Association",
   ISSN = "1990-9772",
   doi = "10.21437/Interspeech.2024-2119",
   language = "english",
   url = "https://www.fit.vut.cz/research/publication/13325"
}