Publication Details

Speaker recognition on mono-channel telephony recordings

SOLEWICZ Yosef, COHEN Noa, ROHDIN Johan A., MADIKERI Srikanth and ČERNOCKÝ Jan. Speaker recognition on mono-channel telephony recordings. In: Proceedings of Odyssey 2022. Beijing: International Speech Communication Association, 2022, pp. 193-199. Available from: https://www.isca-speech.org/archive/pdfs/odyssey_2022/solewicz22_odyssey.pdf
Czech title
Rozpoznávání mluvčího v jednokanálových telefonních nahrávkách
Type
conference paper
Language
english
Authors
Solewicz Yosef (MoPS)
Cohen Noa (MoPS)
Rohdin Johan A., Dr. (DCGM FIT BUT)
Madikeri Srikanth (IDIAP)
Černocký Jan, prof. Dr. Ing. (DCGM FIT BUT)
URL
Keywords

speaker recognition, telephony recordings

Abstract

Conversations stored as mono data is a common problem in many real world speaker recognition applications. In this paper, we focus on investigative scenarios, where a number of mono telephone conversations are available for a speaker of interest. For example, a human operator may have verified that the speaker is present in these conversations. We propose several approaches for automatically creating enrollment models for the speaker of interest from such data. We then use the enrollment models to search for appearances of the speaker of interest in other calls. We analyze the performance of the different method on two dataset that matches our scenario, one is from a simulated case and one is from a real case.

Published
2022
Pages
193-199
Proceedings
Proceedings of Odyssey 2022
Conference
Odyssey 2022: The Speaker and Language Recognition Workshop, Beijing, CN
Publisher
International Speech Communication Association
Place
Beijing, CN
DOI
BibTeX
@INPROCEEDINGS{FITPUB12844,
   author = "Yosef Solewicz and Noa Cohen and A. Johan Rohdin and Srikanth Madikeri and Jan \v{C}ernock\'{y}",
   title = "Speaker recognition on mono-channel telephony recordings",
   pages = "193--199",
   booktitle = "Proceedings of Odyssey 2022",
   year = 2022,
   location = "Beijing, CN",
   publisher = "International Speech Communication Association",
   doi = "10.21437/Odyssey.2022-27",
   language = "english",
   url = "https://www.fit.vut.cz/research/publication/12844"
}
Back to top