Alali, Abdulazeez and Theodorakopoulos, Georgios ![]() ![]() |
Preview |
PDF
- Published Version
Available under License Creative Commons Attribution. Download (873kB) | Preview |
Abstract
Advances in deep learning have led to dramatic improvements in generative synthetic speech, eliminating robotic speech patterns to create speech that is indistinguishable from human voice. Although these advances are extremely useful in various applications, they also facilitate powerful attacks against both humans and machines. Recently, a new type of speech attack called partial fake (PF) speech has emerged. This paper studies how well humans and machines, including speaker recognition systems and existing fake-speech detection tools, can distinguish between human voice and computer-generated speech. Our study shows that both humans and machines can be easily deceived by PF speech, and the current defences against PF speech are insufficient. These findings emphasise the urgency of increasing awareness for humans and creating new automated defences against PF speech for machines.
Item Type: | Article |
---|---|
Date Type: | Published Online |
Status: | Published |
Schools: | Computer Science & Informatics |
Publisher: | MDPI |
ISSN: | 2624-800X |
Date of First Compliant Deposit: | 6 February 2025 |
Date of Acceptance: | 6 February 2025 |
Last Modified: | 10 Feb 2025 09:45 |
URI: | https://orca.cardiff.ac.uk/id/eprint/176003 |
Actions (repository staff only)
![]() |
Edit Item |