Show simple item record

dc.contributor.authorGonzalez Hautamäki, Rosa
dc.contributor.authorKinnunen, Tomi
dc.contributor.editor-
dc.date.accessioned2020-06-11T11:29:02Z
dc.date.available2020-06-11T11:29:02Z
dc.date.issued2020
dc.identifier.urihttps://erepo.uef.fi/handle/123456789/8168
dc.description.abstractSpeaker comparison by listening is a valuable resource, for instance, in human voice discrimination studies, and voice conversion (VC) systems evaluations. Usually, listeners are provided with application-neutral guidelines that encourage retaining overall high speaker discrimination accuracy. Nonetheless, listeners are subject to misses (declaring same-speaker trial as different-speaker) and false alarms (vice versa) with possibly non-symmetric outcomes. In automatic speaker verification (ASV) applications, the consequences of a miss and a false alarm are rarely equal, and decision making policy is adjusted towards a given application with a desired miss/false alarm trade-off. We study whether listener decisions could similarly be controlled to provoke more accept (or reject) decisions, by framing the voice comparison task in different ways. Our neutral, forensic, user-convenient bank and secure bank scenarios are played by disjoint panels (through Amazon's Mechanical Turk), all judging the same speaker trials originated from RedDots and 2018 Voice Conversion Challenge (VCC 2018) data. Our results indicate that listener decisions can be influenced by modifying the task framing. As a subjective task, the challenge is how to drive the panel decisions to the desired direction (to reduce miss or false alarm rate). Our preliminary results suggest potential for novel, application-directed speaker discrimination designs.
dc.language.isoenglanti
dc.publisherIEEE
dc.relation.ispartof2019 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU): Proceedings
dc.relation.urihttp://dx.doi.org/10.1109/ASRU46091.2019.9003978
dc.rightsAll rights reserved
dc.subjectspeaker verification
dc.subjectspeaker discrimination
dc.subjectlistener performance
dc.subjectdecision making
dc.titleTowards Controlling False Alarm - Miss Trade-Off in Perceptual Speaker Comparison via Non-Neutral Listening Task Framing
dc.description.versionfinal draft
dc.contributor.departmentSchool of Computing, activities
uef.solecris.id68244777en
dc.type.publicationArtikkelit ja abstraktit tieteellisissä konferenssijulkaisuissa
dc.rights.accessrights© 2020 IEEE
dc.relation.doi10.1109/ASRU46091.2019.9003978
dc.description.reviewstatuspeerReviewed
dc.format.pagerange749-756
dc.relation.isbn978-1-7281-0306-8
dc.rights.accesslevelopenAccess
dc.type.okmA4
uef.solecris.openaccessEi


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record