Start Over

Evaluating Large Language Model (LLM) Performance on Established Breast Classification Systems.

Authors :: Haider, Syed Ali
Pressman, Sophia M.
Borna, Sahar
Gomez-Cabello, Cesar A.
Sehgal, Ajai
Leibovich, Bradley C.
Forte, Antonio Jorge
Source :: Diagnostics (2075-4418); Jul2024, Vol. 14 Issue 14, p1491, 16p
Publication Year :: 2024
Abstract: Medical researchers are increasingly utilizing advanced LLMs like ChatGPT-4 and Gemini to enhance diagnostic processes in the medical field. This research focuses on their ability to comprehend and apply complex medical classification systems for breast conditions, which can significantly aid plastic surgeons in making informed decisions for diagnosis and treatment, ultimately leading to improved patient outcomes. Fifty clinical scenarios were created to evaluate the classification accuracy of each LLM across five established breast-related classification systems. Scores from 0 to 2 were assigned to LLM responses to denote incorrect, partially correct, or completely correct classifications. Descriptive statistics were employed to compare the performances of ChatGPT-4 and Gemini. Gemini exhibited superior overall performance, achieving 98% accuracy compared to ChatGPT-4's 71%. While both models performed well in the Baker classification for capsular contracture and UTSW classification for gynecomastia, Gemini consistently outperformed ChatGPT-4 in other systems, such as the Fischer Grade Classification for gender-affirming mastectomy, Kajava Classification for ectopic breast tissue, and Regnault Classification for breast ptosis. With further development, integrating LLMs into plastic surgery practice will likely enhance diagnostic support and decision making. [ABSTRACT FROM AUTHOR]

Subjects :: MACHINE learning
LANGUAGE models
CHATGPT
ARTIFICIAL intelligence
PLASTIC surgeons
GYNECOMASTIA

Details

Language :: English
ISSN :: 20754418
Volume :: 14
Issue :: 14
Database :: Complementary Index
Journal :: Diagnostics (2075-4418)
Publication Type :: Academic Journal
Accession number :: 178689210
Full Text :: https://doi.org/10.3390/diagnostics14141491

Full Text Access

View/download PDF

Tools

Email
Cite

Printer

Authors Abstract Subjects Details

Searchworks

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources

Evaluating Large Language Model (LLM) Performance on Established Breast Classification Systems.

Abstract

Subjects

Details

Tools

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Evaluating Large Language Model (LLM) Performance on Established Breast Classification Systems.

Abstract

Subjects

Details

Tools

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources