Evaluating ChatGPT’s ability to simplify scientific abstracts for clinicians and the public

Doğru Hüzmeli, Esra; Moore-Vasram, Sarah; Phadke, Chetan; Shafiee, Erfan; Amanullah, Shabbir

Evaluating ChatGPT’s ability to simplify scientific abstracts for clinicians and the public

dc.contributor.author	Doğru Hüzmeli, Esra
dc.contributor.author	Moore-Vasram, Sarah
dc.contributor.author	Phadke, Chetan
dc.contributor.author	Shafiee, Erfan
dc.contributor.author	Amanullah, Shabbir
dc.date.accessioned	2025-10-20T06:43:02Z
dc.date.available	2025-10-20T06:43:02Z
dc.date.issued	2025
dc.department	Fakülteler, Sağlık Bilimleri Fakültesi, Fizyoterapi ve Rehabilitasyon Bölümü
dc.description.abstract	This study evaluated ChatGPT’s ability to simplify scientific abstracts for both public and clinician use. Ten questions were developed to assess ChatGPT’s ability to simplify scientific abstracts and improve their readability for both the public and clinicians. These questions were applied to 43 abstracts. The abstracts were selected through a convenience sample from Google Scholar by four interdisciplinary reviewers from physiotherapy, occupational therapy, and nursing backgrounds. Each abstract was summarized by ChatGPT on two separate occasions. These summaries were then reviewed independently by two different reviewers. Flesch Reading Ease scores were calculated for each summary and original abstract. A subgroup analysis explored differences in accuracy, clarity, and consistency across various study designs. ChatGPT’s summaries scored higher on the Flesch Reading Ease test than the original abstracts in 31 out of 43 papers, showing a significant improvement in readability (p = 0.005). Systematic reviews and meta-analyses consistently received higher scores for accuracy, clarity, and consistency, while clinical trials scored lower across these parameters. Despite its strengths, ChatGPT showed limitations in “Hallucination presence” and “Technical terms usage,” scoring below 7 out of 10. Hallucination rates varied by study type, with case reports having the lowest scores. Reviewer agreement across parameters demonstrated consistency in evaluations. ChatGPT shows promise for translating knowledge in clinical settings, helping to make scientific research more accessible to non-experts. However, its tendency toward hallucinations and technical jargon requires careful review by clinicians, patients, and caregivers. Further research is needed to assess its reliability and safety for broader use in healthcare communication.
dc.identifier.doi	10.1038/s41598-025-11086-8
dc.identifier.issn	2045-2322
dc.identifier.issue	1
dc.identifier.pmid	41022954
dc.identifier.scopus	2-s2.0-105017650419
dc.identifier.scopusquality	Q1
dc.identifier.uri	https://doi.org/10.1038/s41598-025-11086-8
dc.identifier.uri	https://hdl.handle.net/11501/2465
dc.identifier.volume	15
dc.identifier.wos	WOS:001586165500028
dc.identifier.wosquality	Q1
dc.indekslendigikaynak	Scopus
dc.indekslendigikaynak	Web of Science
dc.indekslendigikaynak	PubMed
dc.institutionauthor	Doğru Hüzmeli, Esra
dc.institutionauthorid	0000-0002-7025-8192
dc.language.iso	en
dc.publisher	Nature Research
dc.relation.ispartof	Scientific Reports
dc.relation.publicationcategory	Makale - Uluslararası Hakemli Dergi - Kurum Öğretim Elemanı
dc.rights	info:eu-repo/semantics/openAccess
dc.subject	ChatGPT
dc.subject	Flesch Reading Ease Score
dc.subject	Hallucination Presence
dc.subject	Healthcare Dissemination
dc.subject	Technical Terms
dc.title	Evaluating ChatGPT’s ability to simplify scientific abstracts for clinicians and the public
dc.type	Article

Dosyalar

Orijinal paket

Listeleniyor 1 - 1 / 1

İsim:: Tam Metin / Full Text
Boyut:: 1.95 MB
Biçim:: Adobe Portable Document Format

İndir

Lisans paketi

Listeleniyor 1 - 1 / 1

İsim:: license.txt
Boyut:: 1.17 KB
Biçim:: Item-specific license agreed to upon submission
Açıklama:

İndir

Koleksiyon

Sağlık Bilimleri Fakültesi Koleksiyonu
PubMed İndeksli Yayınlar Koleksiyonu
Scopus İndeksli Yayınlar Koleksiyonu
WoS İndeksli Yayınlar Koleksiyonu