Performance comparison of speaker recognition systems in presence of duration variability
Self archived versionfinal draft
MetadataShow full item record
CitationPoddar, A. Sahidullah, Md. Saha, G. (2016). Performance comparison of speaker recognition systems in presence of duration variability. 2015 Annual IEEE India Conference (INDICON), 1-6. 10.1109/INDICON.2015.7443464.
Performance of speaker recognition system is highly dependent on the amount of speech data used in training and testing. In this paper, we compare the performance of two different speaker recognition systems in presence of utterance duration variability. The first system is based on state-of-the-art total variability (also known as i-vector system), whereas the other one is classical speaker recognition system based on Gaussian mixture model with universal background model (GMM-UBM). We have conducted extensive experiments for different cases of length mismatch on two NIST corpora: NIST SRE 2008 and NIST SRE 2010. Our study reveals that the relative improvement of total variability based system gradually drops with the reduction in test utterance length. We also observe that if the speakers are enrolled with sufficient amount of training data, GMM-UBM system outperforms i-vector system for very short test utterances.