Comparison of machine learning algorithms for clinical event prediction (risk of coronary heart disease)

Beunza Nuin, Juan José; Puertas Sanz, Enrique; García Ovejero, Ester; Villalba, Gema; Condés Moreno, Emilia; Koleva, Gergana; Hurtado, Cristian; Landecho, Manuel

Comparison of machine learning algorithms for clinical event prediction (risk of coronary heart disease)

dc.contributor.author	Beunza Nuin, Juan José
dc.contributor.author	Puertas Sanz, Enrique
dc.contributor.author	García Ovejero, Ester
dc.contributor.author	Villalba, Gema
dc.contributor.author	Condés Moreno, Emilia
dc.contributor.author	Koleva, Gergana
dc.contributor.author	Hurtado, Cristian
dc.contributor.author	Landecho, Manuel
dc.date.accessioned	2019-11-13T13:06:30Z
dc.date.available	2019-11-13T13:06:30Z
dc.date.issued	2019
dc.description.abstract	The aim of this study is to compare the utility of several supervised machine learning (ML) algorithms for predicting clinical events in terms of their internal validity and accuracy. The results, which were obtained using two statistical software platforms, were also compared. The data used in this research come from the open database of the Framingham Heart Study, which originated in 1948 in Framingham, Massachusetts as a prospective study of risk factors for cardiovascular disease. Through data mining processes, three data models were elaborated and a comparative methodological study between the different ML algorithms – decision tree, random forest, support vector machines, neural networks, and logistic regression – was carried out. The global selection criterium for choosing the right set of hyperparameters and the type of data manipulation was the area under a curve (AUC). The software tools used to analyze the data were R-Studio® and RapidMiner®. The Framingham study open database contains 4240 observations. The algorithm that yielded the greatest AUC when analyzing the data in R-Studio was neural network applied to a model that excluded all observations in which there was at least one missing value (AUC = 0.71); when analyzing the data in RapidMiner and applying the same model, the best algorithm was support vector machines (AUC = 0.75). ML algorithms can reinforce the diagnostic and prognostic capacity of traditional regression techniques. Differences between the applicability of those algorithms and the results obtained with them were a function of the software platforms used in the data analysis.	spa
dc.description.filiation	UEM	spa
dc.description.impact	3.526 JCR (2019) Q2, 32/109 Computer Science, Interdisciplinary Applications, 7/27 Medical Informatics	spa
dc.description.impact	1.140 SJR (2019) Q1, 115/1377 Computer Science Applications, 10/141 Health Informatics	spa
dc.description.impact	No data IDR 2019	spa
dc.description.sponsorship	2019/UEM11	spa
dc.identifier.citation	Beunza, J. J., Puertas, E., García-Ovejero, E., Villalba, G., Condes, E., Koleva, G., ... & Landecho, M. F. (2019). Comparison of machine learning algorithms for clinical event prediction (risk of coronary heart disease). Journal of biomedical informatics, 97. https://doi.org/10.1016/j.jbi.2019.103257	spa
dc.identifier.doi	10.1016/j.jbi.2019.103257
dc.identifier.issn	1532-0464
dc.identifier.uri	http://hdl.handle.net/11268/8396
dc.language.iso	eng	spa
dc.peerreviewed	Si	spa
dc.rights.accessRights	restricted access	spa
dc.subject.uem	Informática médica	spa
dc.subject.unesco	Informática	spa
dc.subject.unesco	Ciencias médicas	spa
dc.title	Comparison of machine learning algorithms for clinical event prediction (risk of coronary heart disease)	spa
dc.type	journal article	spa
dspace.entity.type	Publication
relation.isAuthorOfPublication	ef9d544b-877a-4552-ba1d-61060f9c17ae
relation.isAuthorOfPublication	001b7f40-b837-4929-82ca-df26041a995a
relation.isAuthorOfPublication	add36d97-9c9a-41d8-91f2-5aee4000b5f9
relation.isAuthorOfPublication.latestForDiscovery	ef9d544b-877a-4552-ba1d-61060f9c17ae

Collections

Área de TIC y Bienestar

Comparison of machine learning algorithms for clinical event prediction (risk of coronary heart disease)

Files

Collections