ORCA
Online Research @ Cardiff

Clear Cookie - decide language by browser settings

Comparison of acoustic voice features derived from mobile devices and studio microphone recordings

Fahed, Vitoria S., Doheny, Emer P., Busse, Monica

, Hoblyn, Jennifer and Lowery, Madeleine M. 2025. Comparison of acoustic voice features derived from mobile devices and studio microphone recordings. Journal of Voice 39 (2) , 559.e1-559.e18. 10.1016/j.jvoice.2022.10.006

[thumbnail of 1-s2.0-S0892199722003125-main.pdf]

Preview

PDF - Published Version
Available under License Creative Commons Attribution Non-commercial No Derivatives.
Download (3MB) | Preview

Official URL: https://doi.org/10.1016/j.jvoice.2022.10.006

Abstract

Objectives/Hypothesis Improvements in mobile device technology offer new opportunities for remote monitoring of voice for home and clinical assessment. However, there is a need to establish equivalence between features derived from signals recorded from mobile devices and gold standard microphone-preamplifiers. In this study acoustic voice features from android smartphone, tablet, and microphone-preamplifier recordings were compared. Methods Data were recorded from 37 volunteers (20 female) with no history of speech disorder and six volunteers with Huntington's disease (HD) during sustained vowel (SV) phonation, reading passage (RP), and five syllable repetition (SR) tasks. The following features were estimated: fundamental frequency median and standard deviation (F0 and SD F0), harmonics-to-noise ratio (HNR), local jitter, relative average perturbation of jitter (RAP), five-point period perturbation quotient (PPQ5), difference of differences of amplitude and periods (DDA and DDP), shimmer, and amplitude perturbation quotients (APQ3, APQ5, and APQ11). Results Bland-Altman analysis revealed good agreement between microphone and mobile devices for fundamental frequency, jitter, RAP, PPQ5, and DDP during all tasks and a bias for HNR, shimmer and its variants (APQ3, APQ5, APQ11, and DDA). Significant differences were observed between devices for HNR, shimmer, and its variants for all tasks. High correlation was observed between devices for all features, except SD F0 for RP. Similar results were observed in the HD group for SV and SR task. Biological sex had a significant effect on F0 and HNR during all tests, and for jitter, RAP, PPQ5, DDP, and shimmer for RP and SR. No significant effect of age was observed. Conclusions Mobile devices provided good agreement with state of the art, high-quality microphones during structured speech tasks for features derived from frequency components of the audio recordings. Caution should be taken when estimating HNR, shimmer and its variants from recordings made with mobile devices.

Item Type:	Article
Date Type:	Publication
Status:	Published
Schools:	Schools > Medicine Research Institutes & Centres > Centre for Trials Research (CNTRR)
Publisher:	Elsevier
ISSN:	0892-1997
Date of First Compliant Deposit:	24 October 2022
Date of Acceptance:	10 October 2022
Last Modified:	19 May 2025 12:44
URI:	https://orca.cardiff.ac.uk/id/eprint/153682

Actions (repository staff only)

Edit Item

Altmetric

Dimensions

Download Statistics

Downloads

Downloads per month over past year

View more statistics

CORE (COnnecting REpositories)