Publication Details
Speaker recognition on mono-channel telephony recordings
Cohen Noa (MoPS)
Rohdin Johan A., Dr. (DCGM FIT BUT)
Madikeri Srikanth (IDIAP)
Černocký Jan, prof. Dr. Ing. (DCGM FIT BUT)
speaker recognition, telephony recordings
Conversations stored as mono data is a common problem in many real world speaker recognition applications. In this paper, we focus on investigative scenarios, where a number of mono telephone conversations are available for a speaker of interest. For example, a human operator may have verified that the speaker is present in these conversations. We propose several approaches for automatically creating enrollment models for the speaker of interest from such data. We then use the enrollment models to search for appearances of the speaker of interest in other calls. We analyze the performance of the different method on two dataset that matches our scenario, one is from a simulated case and one is from a real case.
@INPROCEEDINGS{FITPUB12844, author = "Yosef Solewicz and Noa Cohen and A. Johan Rohdin and Srikanth Madikeri and Jan \v{C}ernock\'{y}", title = "Speaker recognition on mono-channel telephony recordings", pages = "193--199", booktitle = "Proceedings of Odyssey 2022", year = 2022, location = "Beijing, CN", publisher = "International Speech Communication Association", doi = "10.21437/Odyssey.2022-27", language = "english", url = "https://www.fit.vut.cz/research/publication/12844" }