Show simple item record

dc.contributor.authorSieranoja, Sami
dc.contributor.authorSahidullah, Md
dc.contributor.authorKinnunen, Tomi
dc.contributor.authorKomulainen, Jukka
dc.contributor.authorHadid, Abdenour
dc.date.accessioned2019-01-23T08:57:11Z
dc.date.available2019-01-23T08:57:11Z
dc.date.issued2018
dc.identifier.urihttps://erepo.uef.fi/handle/123456789/7358
dc.description.abstractAudiovisual speech synchrony detection is an important part of talking-face verification systems. Prior work has primarily focused on visual features and joint-space models, while standard mel-frequency cepstral coefficients (MFCCs) have been commonly used to present speech. We focus more closely on audio by studying the impact of context window length for delta feature computation and comparing MFCCs with simpler energy-based features in lip-sync detection. We select state-of-the-art hand-crafted lip-sync visual features, space-time auto-correlation of gradients (STACOG), and canonical correlation analysis (CCA), for joint-space modeling. To enhance joint space modeling, we adopt deep CCA (DCCA), a nonlinear extension of CCA. Our results on the XM2VTS data indicate substantially enhanced audiovisual speech synchrony detection, with an equal error rate (EER) of 3.68%. Further analysis reveals that failed lip region localization and beardedness of the subjects constitutes most of the errors. Thus, the lip motion description is the bottleneck, while the use of novel audio features or joint-modeling techniques is unlikely to boost lip-sync detection accuracy further.
dc.language.isoenglanti
dc.publisherIEEE
dc.relation.ispartof2018 IEEE 3rd International Conference on Signal and Image Processing (ICSIP)
dc.relation.urihttp://dx.doi.org/10.1109/SIPROCESS.2018.8600424
dc.rightsIn copyright 1.0
dc.subjectaudiovisual synchrony
dc.subjectpresentation attack detection
dc.subjectmultimodal processing
dc.subjectfeature extraction
dc.subjectmel-frequency cepstral coefficients (MFCCs)
dc.titleAudiovisual Synchrony Detection with Optimized Audio Features
dc.description.versionfinal draft
dc.contributor.departmentSchool of Computing, activities
uef.solecris.id59592735en
dc.type.publicationArtikkelit ja abstraktit tieteellisissä konferenssijulkaisuissa
dc.relation.doi10.1109/SIPROCESS.2018.8600424
dc.description.reviewstatuspeerReviewed
dc.format.pagerange377-381
dc.relation.isbn978-1-5386-6396-7
dc.rights.accesslevelopenAccess
dc.type.okmA4
uef.solecris.openaccessEi
dc.rights.copyright© IEEE
dc.type.displayTypearticleen
dc.type.displayTypeartikkelifi
dc.rights.urlhttps://rightsstatements.org/page/InC/1.0/


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record