Impact of Bandwidth and Channel Variation on Presentation Attack Detection for Speaker Verification
Self archived versionfinal draft
MetadataShow full item record
CitationDelgado Hector. Todisco Massimiliano. Evans Nicholas. Sahidullah Md. Ming Liu Wei. Alegre Federico. Kinnunen Tomi. Fauve Benoit. (2017). Impact of Bandwidth and Channel Variation on Presentation Attack Detection for Speaker Verification. Proc. Int. Conf. of the Biometrics Special Interest Group (BIOSIG 2017), CFP1735S-ART, 1-6. 10.23919/BIOSIG.2017.8053510.
Vulnerabilities to presentation attacks can undermine confidence in automatic speaker verification (ASV) technology. While efforts to develop countermeasures, known as presentation attack detection (PAD) systems, are now under way, the majority of past work has been performed with high-quality speech data. Many practical ASV applications are narrowband and encompass various coding and other channel effects. PAD performance is largely untested in such scenarios. This paper reports an assessment of the impact of bandwidth and channel variation on PAD performance. Assessments using two current PAD solutions and two standard databases show that they provoke significant degradations in performance. Encouragingly, relative performance improvements of 98% can nonetheless be achieved through feature optimisation. This performance gain is achieved by optimising the spectro-temporal decomposition in the feature extraction process to compensate for narrowband speech. However, compensating for channel variation is considerably more challenging.