Impact of Bandwidth and Channel Variation on Presentation Attack Detection for Speaker Verification
Files
Self archived version
final draftDate
2017Author(s)
Unique identifier
10.23919/BIOSIG.2017.8053510Metadata
Show full item recordMore information
Self-archived item
Citation
Delgado Hector. Todisco Massimiliano. Evans Nicholas. Sahidullah Md. Ming Liu Wei. Alegre Federico. Kinnunen Tomi. Fauve Benoit. (2017). Impact of Bandwidth and Channel Variation on Presentation Attack Detection for Speaker Verification. Proc. Int. Conf. of the Biometrics Special Interest Group (BIOSIG 2017), CFP1735S-ART, 1-6. 10.23919/BIOSIG.2017.8053510.Rights
Abstract
Vulnerabilities to presentation attacks can undermine confidence in automatic speaker verification (ASV) technology. While efforts to develop countermeasures, known as presentation attack detection (PAD) systems, are now under way, the majority of past work has been performed with high-quality speech data. Many practical ASV applications are narrowband and encompass various coding and other channel effects. PAD performance is largely untested in such scenarios. This paper reports an assessment of the impact of bandwidth and channel variation on PAD performance. Assessments using two current PAD solutions and two standard databases show that they provoke significant degradations in performance. Encouragingly, relative performance improvements of 98% can nonetheless be achieved through feature optimisation. This performance gain is achieved by optimising the spectro-temporal decomposition in the feature extraction process to compensate for narrowband speech. However, compensating for channel variation is considerably more challenging.