Application of machine learning techniques for the characterization and early diagnosis of respiratory diseases such as COVID-19
Loading...
Identifiers
Publication date
Authors
López Rivero, Alfonso José
Chinchilla Corbacho, Carlos
Romero Arias, Tatiana
Martín Merino, Manuel
Vaz, Paulo
Advisors
Editors
Journal Title
Journal ISSN
Volume Title
Publisher
Abstract
This paper presents a robust methodology for the early and cost-effective diagnosis of COVID19 based on vocal features and machine learning techniques. The proposed methodology addresses all challenges inherent to the prediction of COVID-19, including those related to feature extraction and selection, the
imbalance problem, and predictor training. In contrast to existing methodologies that rely solely on acoustic
attributes of the voice, such as intensity or frequency, our approach represents a pioneering investigation that
incorporates biomechanical aspects of vocal production. These include muscle tension, the coordination of
articulatory movements, and respiration. The relationship between these characteristics and the presence of
the virus is investigated rigorously using robust feature selection techniques. To this end, we have constructed
an original dataset comprising patients with confirmed cases of COVID-19 infection and a control group,
incorporating both acoustic and biomechanical features using Voice Clinical Software. The robustness
and reproducibility of the experimental results have been enhanced through the rigorous comparison of
several classifiers and feature selection algorithms, as well as the employment of resampling strategies. The
application of random forests for feature selection has revealed that a limited set of biomechanical markers
are significantly associated with the presence of COVID-19 infection. Moreover, a random forest classifier
based on a subset of biomechanical and acoustic features demonstrates high efficacy in predicting cases of
COVID-19 infection, achieving a sensitivity of S = (0.9212 ± 0.0775) while maintaining a specificity of Sp =
(0.9150 ± 0.0649). Considering these findings, the proposed methodology can be regarded as a non-invasive
and cost-effective alternative for the diagnosis of COVID-19 infection. Furthermore, it can be extended to
the diagnosis of other respiratory diseases, provided that the vocal cords are affected.
Description
UNESCO Subjects
Keywords
Bibliographic reference
López Rivero, A. J., Chinchilla Corbacho, C., Romero Arias, T., Martín-Merino, M., & Vaz, P. (2024). Application of machine learning techniques for the characterization and early diagnosis of respiratory diseases such as covid-19. IEEE Access, 12, 160516-160528. https://doi.org/10.1109/ACCESS.2024.3487773




