Close menu

SURE

Sunderland Repository records the research produced by the University of Sunderland including practice-based research and theses.

Who is Authentic Speaker?

Huang, Qiang (2024) Who is Authentic Speaker? In: The 29th International Conference on Automation and Computing (ICAC 2024). IEEE. (In Press)

Item Type: Book Section

Abstract

Voice conversion (VC) using deep learning technologies can now generate high quality one-to-many voices and thus has been used in some practical application fields, such as entertainment and healthcare. However, voice conversion can pose potential social issues when manipulated voices are employed for deceptive purposes. Moreover, it is a big challenge to find who are real speakers from the converted voices as the acoustic characteristics of source speakers are changed greatly. In this paper we attempt to explore the feasibility of identifying authentic speakers from converted voices. This study is conducted with the assumption that certain information from the source speakers persists, even when their voices undergo conversion into different target voices. Therefore our experiments are geared towards recognising the source speakers given the converted voices, which are generated by using FragmentVC on the randomly paired utterances from source and target speakers. To improve the robustness against converted voices, our recognition model is constructed by using hierarchical vector of locally aggregated descriptors (VLAD) in deep neural networks. The authentic speaker recognition system is mainly tested in two aspects, including the impact of quality of converted voices and the variations of VLAD. The dataset used in this work is VCTK corpus, where source and target speakers are randomly paired. The results obtained on the converted utterances show promising performances in recognising authentic speakers from converted voices.

[img] PDF
qh_icac2024.pdf
Restricted to Repository staff only

Download (1MB) | Request a copy

More Information

Depositing User: Qiang Huang

Identifiers

Item ID: 18273
URI: http://sure.sunderland.ac.uk/id/eprint/18273
Official URL: https://cacsuk.co.uk/icac/

Users with ORCIDS

ORCID for Qiang Huang: ORCID iD orcid.org/0000-0002-2943-2283

Catalogue record

Date Deposited: 24 Sep 2024 14:10
Last Modified: 24 Sep 2024 14:15

Contributors

Author: Qiang Huang ORCID iD

University Divisions

Faculty of Technology > School of Computer Science

Subjects

Computing > Human-Computer Interaction

Actions (login required)

View Item (Repository Staff Only) View Item (Repository Staff Only)