Cardiff University | Prifysgol Caerdydd ORCA
Online Research @ Cardiff 
WelshClear Cookie - decide language by browser settings

Comparison of acoustic voice features derived from mobile devices and studio microphone recordings

Fahed, Vitoria S., Doheny, Emer P., Busse, Monica ORCID:, Hoblyn, Jennifer and Lowery, Madeleine M. 2022. Comparison of acoustic voice features derived from mobile devices and studio microphone recordings. Journal of Voice 10.1016/j.jvoice.2022.10.006

[thumbnail of 1-s2.0-S0892199722003125-main.pdf]
PDF - Published Version
Available under License Creative Commons Attribution Non-commercial No Derivatives.

Download (3MB) | Preview


Objectives/Hypothesis Improvements in mobile device technology offer new opportunities for remote monitoring of voice for home and clinical assessment. However, there is a need to establish equivalence between features derived from signals recorded from mobile devices and gold standard microphone-preamplifiers. In this study acoustic voice features from android smartphone, tablet, and microphone-preamplifier recordings were compared. Methods Data were recorded from 37 volunteers (20 female) with no history of speech disorder and six volunteers with Huntington's disease (HD) during sustained vowel (SV) phonation, reading passage (RP), and five syllable repetition (SR) tasks. The following features were estimated: fundamental frequency median and standard deviation (F0 and SD F0), harmonics-to-noise ratio (HNR), local jitter, relative average perturbation of jitter (RAP), five-point period perturbation quotient (PPQ5), difference of differences of amplitude and periods (DDA and DDP), shimmer, and amplitude perturbation quotients (APQ3, APQ5, and APQ11). Results Bland-Altman analysis revealed good agreement between microphone and mobile devices for fundamental frequency, jitter, RAP, PPQ5, and DDP during all tasks and a bias for HNR, shimmer and its variants (APQ3, APQ5, APQ11, and DDA). Significant differences were observed between devices for HNR, shimmer, and its variants for all tasks. High correlation was observed between devices for all features, except SD F0 for RP. Similar results were observed in the HD group for SV and SR task. Biological sex had a significant effect on F0 and HNR during all tests, and for jitter, RAP, PPQ5, DDP, and shimmer for RP and SR. No significant effect of age was observed. Conclusions Mobile devices provided good agreement with state of the art, high-quality microphones during structured speech tasks for features derived from frequency components of the audio recordings. Caution should be taken when estimating HNR, shimmer and its variants from recordings made with mobile devices.

Item Type: Article
Date Type: Published Online
Status: In Press
Schools: Medicine
Centre for Trials Research (CNTRR)
Publisher: Elsevier
ISSN: 0892-1997
Date of First Compliant Deposit: 24 October 2022
Date of Acceptance: 10 October 2022
Last Modified: 06 May 2023 13:30

Actions (repository staff only)

Edit Item Edit Item


Downloads per month over past year

View more statistics