Application of machine learning techniques for the characterization and early diagnosis of respiratory diseases such as COVID-19

Loading...
Thumbnail Image
Identifiers

Publication date

Authors

López Rivero, Alfonso José
Chinchilla Corbacho, Carlos
Romero Arias, Tatiana
Martín Merino, Manuel
Vaz, Paulo

Advisors

Editors

Journal Title

Journal ISSN

Volume Title

Publisher

SDG

goal-3

Metrics

Google Scholar

Research Projects

Organizational Units

Journal Issue

Abstract

This paper presents a robust methodology for the early and cost-effective diagnosis of COVID19 based on vocal features and machine learning techniques. The proposed methodology addresses all challenges inherent to the prediction of COVID-19, including those related to feature extraction and selection, the imbalance problem, and predictor training. In contrast to existing methodologies that rely solely on acoustic attributes of the voice, such as intensity or frequency, our approach represents a pioneering investigation that incorporates biomechanical aspects of vocal production. These include muscle tension, the coordination of articulatory movements, and respiration. The relationship between these characteristics and the presence of the virus is investigated rigorously using robust feature selection techniques. To this end, we have constructed an original dataset comprising patients with confirmed cases of COVID-19 infection and a control group, incorporating both acoustic and biomechanical features using Voice Clinical Software. The robustness and reproducibility of the experimental results have been enhanced through the rigorous comparison of several classifiers and feature selection algorithms, as well as the employment of resampling strategies. The application of random forests for feature selection has revealed that a limited set of biomechanical markers are significantly associated with the presence of COVID-19 infection. Moreover, a random forest classifier based on a subset of biomechanical and acoustic features demonstrates high efficacy in predicting cases of COVID-19 infection, achieving a sensitivity of S = (0.9212 ± 0.0775) while maintaining a specificity of Sp = (0.9150 ± 0.0649). Considering these findings, the proposed methodology can be regarded as a non-invasive and cost-effective alternative for the diagnosis of COVID-19 infection. Furthermore, it can be extended to the diagnosis of other respiratory diseases, provided that the vocal cords are affected.

Description

Keywords

Bibliographic reference

López Rivero, A. J., Chinchilla Corbacho, C., Romero Arias, T., Martín-Merino, M., & Vaz, P. (2024). Application of machine learning techniques for the characterization and early diagnosis of respiratory diseases such as covid-19. IEEE Access, 12, 160516-160528. https://doi.org/10.1109/ACCESS.2024.3487773

Type of document

Attribution-NonCommercial-NoDerivatives 4.0 Internacional

La licencia de este ítem se describe como Attribution-NonCommercial-NoDerivatives 4.0 Internacional