ChatGPT vs. sleep disorder specialist responses to common sleep queries: Ratings by experts and laypeople

  • Jiyoung Kim
  • , Seo Young Lee
  • , Jee Hyun Kim
  • , Dong Hyeon Shin
  • , Eun Hye Oh
  • , Jin A. Kim
  • , Jae Wook Cho

Research output: Contribution to journalArticlepeer-review

8 Scopus citations

Abstract

Background: Many individuals use the Internet, including generative artificial intelligence like ChatGPT, for sleep-related information before consulting medical professionals. This study compared responses from sleep disorder specialists and ChatGPT to common sleep queries, with experts and laypersons evaluating the responses' accuracy and clarity. Methods: We assessed responses from sleep medicine specialists and ChatGPT-4 to 140 sleep-related questions from the Korean Sleep Research Society's website. In a blinded study design, sleep disorder experts and laypersons rated the medical helpfulness, emotional supportiveness, and sentence comprehensibility of the responses on a 1-5 scale. Results: Laypersons rated ChatGPT higher for medical helpfulness (3.79 ± 0.90 vs. 3.44 ± 0.99, p < .001), emotional supportiveness (3.48 ± 0.79 vs. 3.12 ± 0.98, p < .001), and sentence comprehensibility (4.24 ± 0.79 vs. 4.14 ± 0.96, p = .028). Experts also rated ChatGPT higher for emotional supportiveness (3.33 ± 0.62 vs. 3.01 ± 0.67, p < .001) but preferred specialists' responses for sentence comprehensibility (4.15 ± 0.74 vs. 3.94 ± 0.90, p < .001). When it comes to medical helpfulness, the experts rated the specialists' answers slightly higher than the laypersons did (3.70 ± 0.84 vs. 3.63 ± 0.87, p = .109). Experts slightly preferred specialist responses overall (56.0%), while laypersons favored ChatGPT (54.3%; p < .001). ChatGPT's responses were significantly longer (186.76 ± 39.04 vs. 113.16 ± 95.77 words, p < .001). Discussion: Generative artificial intelligence like ChatGPT may help disseminate sleep-related medical information online. Laypersons appear to prefer ChatGPT's detailed, emotionally supportive responses over those from sleep disorder specialists.

Original languageEnglish
Pages (from-to)665-670
Number of pages6
JournalSleep Health
Volume10
Issue number6
DOIs
StatePublished - Dec 2024

Bibliographical note

Publisher Copyright:
© 2024 National Sleep Foundation

Keywords

  • Artificial intelligence
  • Health information seeking behavior
  • Medical informatics
  • Patient education as topic
  • Sleep disorders

Fingerprint

Dive into the research topics of 'ChatGPT vs. sleep disorder specialist responses to common sleep queries: Ratings by experts and laypeople'. Together they form a unique fingerprint.

Cite this