Publication Details

Speaker Verification with Application-Aware Beamforming

MOŠNER Ladislav, PLCHOT Oldřich, ROHDIN Johan A., BURGET Lukáš and ČERNOCKÝ Jan. Speaker Verification with Application-Aware Beamforming. In: IEEE Automatic Speech Recognition and Understanding Workshop - Proceedings (ASRU). Sentosa, Singapore: IEEE Signal Processing Society, 2019, pp. 411-418. ISBN 978-1-7281-0306-8.

Czech title

Rozpoznávání řečníka s aplikačně specifickým směrováním akustického paprsku

Type

conference paper

Language

english

Authors

Mošner Ladislav, Ing. (DCGM FIT BUT)
Plchot Oldřich, Ing., Ph.D. (DCGM FIT BUT)
Rohdin Johan A., Dr. (DCGM FIT BUT)
Burget Lukáš, doc. Ing., Ph.D. (DCGM FIT BUT)
Černocký Jan, prof. Dr. Ing. (DCGM FIT BUT)

URL

http://www.fit.vutbr.cz/research/groups/speech/publi/2019/mosner_asru2019_0000411.pdf PDF

Keywords

Speaker verification, beamforming, xvector, generalized eigenvalue problem

Abstract

Multichannel speech processing applications usually employ beamformers as means of speech enhancement through spatial filtering. Beamformers with learnable parameters require training to minimize a loss function that is not necessarily correlated with the final objective. In this paper, we present a framework employing recent neural network based generalized eigenvalue beamformer and application-specific model that allows for optimization of beamformer w.r.t. target application. In our case, the application is speaker verification which utilizes a speaker embedding (x-vector) extractor that conveniently comes with desired loss. We show that application-specific training of the beamformer brings performance improvements over a system trained in the standard way. We perform our analysis on the recently introduced VOiCES corpus which contains multichannel data and allows us to modify the evaluation trials such that enrollment recordings remain single-channel and test utterances are multichannel.

Published

2019

Pages

411-418

Proceedings

IEEE Automatic Speech Recognition and Understanding Workshop - Proceedings (ASRU)

Conference

2019 IEEE AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING WORKSHOP (ASRU 2019), Singapore, SG

ISBN

978-1-7281-0306-8

Publisher

IEEE Signal Processing Society

Place

Sentosa, Singapore, SG

DOI

10.1109/ASRU46091.2019.9003932

UT WoS

000539883100055

EID Scopus

2-s2.0-85081562834

BibTeX

@INPROCEEDINGS{FITPUB12152,
   author = "Ladislav Mo\v{s}ner and Old\v{r}ich Plchot and A. Johan Rohdin and Luk\'{a}\v{s} Burget and Jan \v{C}ernock\'{y}",
   title = "Speaker Verification with Application-Aware Beamforming",
   pages = "411--418",
   booktitle = "IEEE Automatic Speech Recognition and Understanding Workshop - Proceedings (ASRU)",
   year = 2019,
   location = "Sentosa, Singapore, SG",
   publisher = "IEEE Signal Processing Society",
   ISBN = "978-1-7281-0306-8",
   doi = "10.1109/ASRU46091.2019.9003932",
   language = "english",
   url = "https://www.fit.vut.cz/research/publication/12152"
}