ChatGPT performance in prosthodontics: Assessment of accuracy and repeatability in answer generation.

Freire Mancebo, Yolanda; Santamaría Laorden, Andrea; Orejas Pérez, Jaime; Gómez Sánchez, Margarita; Díaz-Flores García, Víctor; Suárez García, Ana

ChatGPT performance in prosthodontics: Assessment of accuracy and repeatability in answer generation.

Files

Suárez Gil_2024.pdf (928.95 KB)

Identifiers

URI: http://hdl.handle.net/11268/13123

DOI: 10.1016/j.prosdent.2024.01.018

Publication date

2024

Authors

Freire Mancebo, Yolanda

Santamaría Laorden, Andrea

Orejas Pérez, Jaime

Gómez Sánchez, Margarita

Díaz-Flores García, Víctor

SDG

Metrics

Abstract

Statement of problem The artificial intelligence (AI) software program ChatGPT is based on large language models (LLMs) and is widely accessible. However, in prosthodontics, little is known about its performance in generating answers. Purpose:The purpose of this study was to determine the performance of ChatGPT in generating answers about removable dental prostheses (RDPs) and tooth-supported fixed dental prostheses (FDPs). Material and methods: Thirty short questions were designed about RDPs and tooth-supported FDP, and 30 answers were generated for each of the questions using ChatGPT-4 in October 2023. The 900 generated answers were independently graded by experts using a 3-point Likert scale. The relative frequency and absolute percentage of answers were described. Accuracy was assessed using the Wald binomial method, while repeatability was evaluated using percentage agreement, Brennan and Prediger coefficient, Conger generalized Cohen kappa, Fleiss kappa, Gwet AC, and Krippendorff alpha methods. Confidence intervals were set at 95%. Statistical analysis was performed using the STATA software program. Results: The performance of ChatGPT in generating answers related to RDP and tooth-supported FDP was limited. The answers showed a reliability of 25.6%, with a confidence range between 22.9% and 28.6%. The repeatability ranged from substantial to moderate. Conclusions:The results show that currently ChatGPT has limited ability to generate answers related to RDPs and tooth-supported FDPs. Therefore, ChatGPT cannot replace a dentist, and, if professionals were to use it, they should be aware of its limitations.

UNESCO Subjects

Odontología
Inteligencia artificial

Bibliographic reference

Freire, Y., Santamaría Laorden, A., Orejas Pérez, J., Gómez Sánchez, M., Díaz-Flores García, V., & Suárez, A. (2024). ChatGPT performance in prosthodontics: Assessment of accuracy and repeatability in answer generation. The Journal of Prosthetic Dentistry, 131(4), 659.e1-659.e6. https://doi.org/10.1016/j.prosdent.2024.01.018

Editor's version

https://doi.org/10.1016/j.prosdent.2024.01.018

Type of document

journal article

Collections

Otras Áreas de Investigación de Salud

Full item page

La licencia de este ítem se describe como Attribution-NonCommercial-NoDerivatives 4.0 Internacional

ChatGPT performance in prosthodontics: Assessment of accuracy and repeatability in answer generation.

Files

Identifiers

Publication date

Authors

Advisors

Editors

Journal Title

Journal ISSN

Volume Title

Publisher

SDG

Metrics

Research Projects

Organizational Units

Journal Issue

Abstract

Description

UNESCO Subjects

Keywords

Bibliographic reference

Editor's version

Type of document

Collections