Comparative Evaluation of AI-Based Systems for Tinnitus

dc.authorscopusid 57077002600
dc.authorscopusid 57222759194
dc.contributor.author Yalınkılıç, A.
dc.contributor.author Erdem, M.Z.
dc.date.accessioned 2025-09-03T16:38:43Z
dc.date.available 2025-09-03T16:38:43Z
dc.date.issued 2025
dc.department T.C. Van Yüzüncü Yıl Üniversitesi en_US
dc.department-temp [Yalınkılıç A.] Van Yüzüncü Yıl University, Faculty of Medicine, Department of Otorhinolaryngology, Van, Turkey; [Erdem M.Z.] Van Yüzüncü Yıl University, Faculty of Medicine, Department of Otorhinolaryngology, Van, Turkey en_US
dc.description.abstract Introduction: Today, with the development of technology, the variety of information sources has increased. It is now possible to access information obtained from encyclopedias in seconds with a few clicks of a button. Rapid developments in artificial in telligence (AI) and the widespread use of large language models (LLMs) such as ChatGPT, Gemini, and Perplexity have revolutionized access to medi cal information. However, the accuracy and readability of the answers provided by these models are critical, especially in the healthcare domain. This study evaluates the performance of ChatGPT, Gemini, and Perplexity in addressing frequently asked questions abou t tinnitus, a common symptom in otolaryngology practice. Materials and Methods: Twenty frequently asked questions about tinnitus were posed to the models and their responses were evaluated by two otolaryngologists using global quality (GQS) and Likert scales for accuracy and reliability and the Gunni ng-Fog Index (GFI) for readability. Results: The findings reveal no significant difference in the reliability and quality of information between the models, but it was ob served that Gemini came out ahead in readability and ChatGPT in accuracy. However, Perplexity lagged in both metrics. These results highlight the varying strengths and weaknesses of LLMs, emphasizing the importance of model selection based on user needs. For example, ChatGPT is ideal for complex medical information, while Gemini is more accessible to wider audiences. Conclusion: This study demonstrates the potential of AI-enabled systems in healthcare; however, we suggest that future improvements should increase both accuracy and accessibility. © 2025, Yuzuncu Yil Universitesi Tip Fakultesi. All rights reserved. en_US
dc.identifier.doi 10.5505/VMJ.2025.96268
dc.identifier.endpage 117 en_US
dc.identifier.issn 1300-2694
dc.identifier.issue 3 en_US
dc.identifier.scopus 2-s2.0-105011480788
dc.identifier.scopusquality N/A
dc.identifier.startpage 113 en_US
dc.identifier.uri https://doi.org/10.5505/VMJ.2025.96268
dc.identifier.uri https://hdl.handle.net/20.500.14720/28371
dc.identifier.volume 32 en_US
dc.identifier.wosquality N/A
dc.language.iso en en_US
dc.publisher Yuzuncu Yil Universitesi Tip Fakultesi en_US
dc.relation.ispartof Van Medical Journal en_US
dc.relation.publicationcategory Makale - Uluslararası Hakemli Dergi - Kurum Öğretim Elemanı en_US
dc.rights info:eu-repo/semantics/closedAccess en_US
dc.subject Chatbots en_US
dc.subject ChatGPT en_US
dc.subject Large Language Models en_US
dc.subject Tinnitus en_US
dc.title Comparative Evaluation of AI-Based Systems for Tinnitus en_US
dc.type Article en_US
dspace.entity.type Publication

Files