Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Feature Analysis of the Vowel [a:] in Individuals with Chronic Obstructive Pulmonary Disease and Healthy Controls
Blekinge Institute of Technology, Faculty of Engineering, Department of Health.ORCID iD: 0000-0003-1558-2309
Blekinge Institute of Technology, Faculty of Engineering, Department of Health.ORCID iD: 0000-0002-6752-017X
Blekinge Institute of Technology, Faculty of Computing, Department of Computer Science.ORCID iD: 0000-0002-4390-411X
Blekinge Institute of Technology, Faculty of Engineering, Department of Health.ORCID iD: 0000-0001-9870-8477
Show others and affiliations
2025 (English)In: Journal of Voice, ISSN 0892-1997, E-ISSN 1873-4588Article in journal (Refereed) Epub ahead of print
Abstract [en]

Background: In addition to impairing the lung function, chronic obstructive pulmonary disease (COPD) also affects phonatory characteristics. Recent research highlights the potential of voice as a digital biomarker to support clinical decision-making. While machine learning (ML) can detect disease patterns from acoustic features, clinical relevance requires understanding the relationship between the disorder and acoustic features.

Objective: This study investigates both statistical and clinical significance using Baseline Acoustic (BLA) and Mel-Frequency Cepstral Coefficient (MFCC) features with focusing on individuals with COPD and healthy controls (HC).

Method: Acoustic features derived from Swedish utterances of the vowel [a:], recorded via mobile phones from 48 age-matched participants (24 COPD, 24 HC; equal gender distribution), were analyzed. To reduce bias from varying recording counts, features were aggregated by averaging 10 randomly selected recordings per participant over 100 iterations. Vowel articulation was visualized in the vowel quadrilateral space using F1 (tongue height) and F2 (tongue advancement). Group differences were assessed using the Shapiro-Wilk test, Mann-Whitney U test (α = 0.05), Benjamini-Hochberg (BH) and Bonferroni corrections, Permutational Multivariate Analysis of Variance (PERMANOVA) test, and Cliff's Delta (δ).

Results: Of 101 features, 29 remained significant after BH correction and one after Bonferroni. Multivariate testing (p = 0.019) showed group separation. Additionally, 34 features demonstrated large effect sizes, suggesting potential as digital biomarkers.

Conclusion: Voice data recorded via mobile phones capture meaningful acoustic differences associated with COPD. These findings support the integration of voice-based assessments into eHealth platforms for noninvasive COPD screening and monitoring, which is pending further validation on larger populations.

Clinical Trial: NCT06705647

Place, publisher, year, edition, pages
Elsevier, 2025.
Keywords [en]
Chronic obstructive pulmonary disease; Effect size; Mel-frequency cepstral coefficient; Mobile phone-recorded voice data; Statistical analysis; Voice features; Vowel quadrilateral space
National Category
Respiratory Medicine and Allergy
Research subject
Applied Health Technology
Identifiers
URN: urn:nbn:se:bth-28033DOI: 10.1016/j.jvoice.2025.10.013PubMedID: 41168019Scopus ID: 2-s2.0-105024714181OAI: oai:DiVA.org:bth-28033DiVA, id: diva2:1966412
Funder
ELLIIT - The Linköping‐Lund Initiative on IT and Mobile CommunicationsAvailable from: 2025-06-10 Created: 2025-06-10 Last updated: 2026-01-02Bibliographically approved
In thesis
1. Voice as a Digital Biomarker: Machine Learning Applications for Chronic Obstructive Pulmonary Disease Assessment
Open this publication in new window or tab >>Voice as a Digital Biomarker: Machine Learning Applications for Chronic Obstructive Pulmonary Disease Assessment
2025 (English)Doctoral thesis, comprehensive summary (Other academic)
Abstract [en]

Chronic Obstructive Pulmonary Disease (COPD) is a leading cause of morbidity and mortality worldwide, with high underdiagnosis rates due to limitations in current diagnostic methods such as spirometry. This doctoral thesis explores the potential of voice as a digital biomarker to support the assessment of COPD, guided by the principles of Applied Health Technology (AHT), which emphasizes interdisciplinary collaboration and real-world applicability.

The research includes four interconnected studies. Study I presents a systematic literature review of machine learning (ML) applications for voice-affecting disorders, identifying COPD as underrepresented in current research. Study II addresses this gap by collecting a new dataset of vowel [a:] recordings from Swedish-speaking COPD patients and healthy controls once a week in self-determined quiet settings. Voice features, including baseline acoustic (BLA) parameters and Mel-Frequency Cepstral Coefficients (MFCCs), were extracted and used to train three ML classifiers: CatBoost (CB), Random Forest (RF), and Support Vector Machine (SVM). CB demonstrated the highest test accuracy at 78%. 

Study III investigates the effects of signal segmentation on model performance and shows that certain temporal segments of voice recordings contain more informative patterns, enhancing classification outcomes by increasing accuracy to 85%. Study IV applies statistical and practical significance tests to compare voice features between COPD and healthy groups. A total of 34 features, including shimmer measures and higher-order MFCC derivatives, were found to meaningfully differentiate the groups. 

This thesis reframes the human voice as a source of clinically relevant data, demonstrating how it can be digitized, analyzed, and interpreted using ML to aid COPD assessment. The results indicate that voice-based analysis can provide an accessible, non-invasive, and scalable complement to existing diagnostic tools. By integrating technical, clinical, and ethical perspectives, the thesis contributes new knowledge and practical methodologies that align with AHT's goal of creating value-driven, user-centered healthcare solutions. The findings support future development of mobile and remote voice-based screening tools for COPD and other conditions.

Place, publisher, year, edition, pages
Karlskrona: Blekinge Tekniska Högskola, 2025. p. 160
Series
Blekinge Institute of Technology Doctoral Dissertation Series, ISSN 1653-2090 ; 2025:07
Keywords
Chronic Obstructive Pulmonary Disease, Machine Learning, Noninvasive Diagnostic, Segmentation, Voice-based Analysis
National Category
Respiratory Medicine and Allergy Medical and Health Sciences Medical Informatics
Research subject
Applied Health Technology
Identifiers
urn:nbn:se:bth-28038 (URN)978-91-7295-503-5 (ISBN)
Public defence
2025-10-15, J1630, Valhallavägen 1, Karlskrona, 13:00 (English)
Opponent
Supervisors
Available from: 2025-08-11 Created: 2025-06-10 Last updated: 2025-10-28Bibliographically approved

Open Access in DiVA

fulltext(9329 kB)26 downloads
File information
File name FULLTEXT01.pdfFile size 9329 kBChecksum SHA-512
e5ccf70c643b05cae870150356613c7134b9c318c12cbafa79d5b26f24f47ecd2147b1137692ecad308d46e002c8e109d650693dc95697cead22823a7bc51cee
Type fulltextMimetype application/pdf

Other links

Publisher's full textPubMedScopus

Authority records

Idrisoglu, AlperDallora Moraes, Ana LuizaCheddad, AbbasAnderberg, PeterSanmartin Berglund, Johan

Search in DiVA

By author/editor
Idrisoglu, AlperDallora Moraes, Ana LuizaCheddad, AbbasAnderberg, PeterWhitling, SusannaJakobsson, AndreasSanmartin Berglund, Johan
By organisation
Department of HealthDepartment of Computer Science
In the same journal
Journal of Voice
Respiratory Medicine and Allergy

Search outside of DiVA

GoogleGoogle Scholar
Total: 26 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

doi
pubmed
urn-nbn

Altmetric score

doi
pubmed
urn-nbn
Total: 228 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf