67891011129 of 70
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Use of machine learning and voice for multiclass classification of Parkinson's disease, chronic obstructive pulmonary disease, and healthy controls
Blekinge Institute of Technology, Faculty of Engineering, Department of Health.ORCID iD: 0000-0003-1558-2309
Blekinge Institute of Technology, Faculty of Engineering, Department of Health.ORCID iD: 0000-0002-9099-0348
2026 (English)In: Scientific Reports, E-ISSN 2045-2322, Vol. 16, no 1, article id 15485Article in journal (Refereed) Published
Abstract [en]

Parkinson's disease (PD) and chronic obstructive pulmonary disease (COPD) are prevalent conditions with substantial impact on quality of life and health care systems. Both disorders affect voice production through different physiological mechanisms, yet neither condition has a widely adopted objective biomarker for routine clinical use. Voice analysis has emerged as a non-invasive digital biomarker candidate, but existing studies have largely focused on binary classification within a single disorder or language. This study aimed to evaluate whether an unified multiclass machine learning (ML) framework applied to sustained vowel "a" phonation can discriminate between PD, COPD, and healthy controls (HC) across linguistically distinct cohorts. Sustained vowel recordings were analyzed from Swedish speaking individuals with COPD and HC, and English-speaking individuals with PD and HC, collected under comparable mobile recording conditions. Acoustic features included baseline voice measures and Mel Frequency Cepstral Coefficients. A soft voting ML framework integrating support vector machine, random forest, CatBoost, and light gradient boosting classifiers was trained using nested cross validation with hyperparameter optimization. Data were partitioned at the participant level into a development cohort and an independent test cohort. Model performance was evaluated using accuracy, macro averaged precision, recall, F1 score, receiver operating characteristic analysis, and confusion matrices. Model interpretability was assessed using Shapley additive explanations and vowel space analysis. The final soft voting classifier achieved robust multiclass discrimination on the participant disjoint independent test set, with an overall accuracy of 0.842 and a macro averaged F1 score of 0.839. Classification performance differed across groups, with the highest performance observed for PD, intermediate performance for HC, and lower performance for COPD. Misclassifications occurred primarily between HC and COPD, while confusion between PD and COPD was minimal. Feature attribution analysis revealed class dependent relevance patterns, and vowel space analysis demonstrated subtle but consistent group level differences. These findings demonstrate the feasibility of using an explainable soft voting machine learning framework applied to sustained vowel phonation to distinguish between neurologically and respiratory driven voice impairments across linguistic contexts. The study supports voice as a promising digital biomarker modality for multiclass clinical discrimination using mobile recordings.

Place, publisher, year, edition, pages
Nature Publishing Group, 2026. Vol. 16, no 1, article id 15485
Keywords [en]
Parkinson's disease, Chronic obstructive pulmonary disease, Voice analysis, Machine learning, Digital biomarkers, Multiclass classification, Explainable artificial intelligence
National Category
Respiratory Medicine and Allergy
Identifiers
URN: urn:nbn:se:bth-29639DOI: 10.1038/s41598-026-53409-3ISI: 001771048300003PubMedID: 42156531Scopus ID: 2-s2.0-105039595753OAI: oai:DiVA.org:bth-29639DiVA, id: diva2:2064542
Available from: 2026-06-02 Created: 2026-06-02 Last updated: 2026-06-05Bibliographically approved

Open Access in DiVA

fulltext(3585 kB)10 downloads
File information
File name FULLTEXT01.pdfFile size 3585 kBChecksum SHA-512
258321c589ca5dd8bad4169a795a4d25e64325fe0bed2ca737d7d89c62adf9e92a376c481b1c033e1db06d9d4f7767afe240c759897027879a091235eb7b2d25
Type fulltextMimetype application/pdf

Other links

Publisher's full textPubMedScopus

Authority records

Idrisoglu, AlperBehrens, Anders

Search in DiVA

By author/editor
Idrisoglu, AlperBehrens, Anders
By organisation
Department of Health
In the same journal
Scientific Reports
Respiratory Medicine and Allergy

Search outside of DiVA

GoogleGoogle Scholar
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

doi
pubmed
urn-nbn

Altmetric score

doi
pubmed
urn-nbn
Total: 30 hits
67891011129 of 70
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf