News|Articles|August 18, 2023

AI-generated ophthalmic references and abstracts: Proceed with caution

Clinicians should be alert to the fact that while artificial intelligence (AI) is capable of generating ideas and references, it is crucial to thoroughly vet and fact-check any medical research content that AI produces.

Hong-Uyen Hua, MD, a recently graduated surgical retina fellow and first study author, reported that clinicians should be alert to the fact that while artificial intelligence (AI) is capable of generating ideas and references, they need to go a step further and thoroughly vet and fact-check any medical research content that AI produces.¹ Hua, senior author Danny Mammo, MD, and colleagues are from the Cole Eye Institute, Cleveland Clinic Foundation, Cleveland.

Hua and colleagues pointed out the rapid growth in the popularity of AI chatbots and the potential for significant implications for patient education and academia. They also noted that the disadvantages of using these chatbots for generating abstracts and references have not been investigated thoroughly.

To remedy this, the research team conducted a cross-sectional comparative study to do just that, ie, evaluate and compare the quality of ophthalmic scientific abstracts and references generated by earlier and updated versions of a popular AI chatbot.

The study used 2 versions of an AI chatbot to generate scientific abstracts and 10 references for clinical research questions across 7 ophthalmology subspecialties. Two of the authors graded the abstracts using modified DISCERN criteria and performance evaluation scores, and 2 AI output detectors also evaluated the abstracts. A so-called hallucination rate for references generated by the earlier and updated versions of the chatbot but which could not be verified was calculated and compared.

Results of the comparison

The investigators found that the “mean modified AI-DISCERN scores for the chatbot-generated abstracts were 35.9 and 38.1 out of a maximal score of 50 for the earlier and updated versions, respectively (P = 0.30). Based on the 2 AI output detectors, the mean fake scores, with a score of 100% meaning generated by AI, for the earlier and updated chatbot-generated abstracts were 65.4% and 10.8%, respectively (P = 0.01) for 1 detector and 69.5% and 42.7% (P = 0.17) for the second detector. The mean hallucination rates for nonverifiable references generated by the earlier and updated versions were 33% and 29% (P = 0.74).”

The results mean that the quality between the abstracts generated by the versions of the chatbot was comparable. The mean hallucination rate of the citations was about 30% and was comparable between the versions.

Considering that the version of the chatbot produced abstracts of average quality and hallucinated citations that seemed to be realistic, Hua and colleagues warned clinicians to be aware of the potential for factual errors or hallucinations. Any medical content produced by AI should be carefully vetted and fact-checked before it is used for health education or academic purposes.

Hua commented, “The idea for this study initially came while I was exploring generative AI chatbots and their possible applications in ophthalmology. I quickly realized that the chatbot was making up references—a term called ‘hallucinations’ in generative AI. On top of that, the chatbot was unable to distinguish nuances in the scientific literature (e.g. oral vs intravenous dosing of steroids in optic neuritis). Current AI detectors perform poorly in detecting AI-generated text, especially with the newer version of AI chatbots. The scientific community at large must be wary of the implications of using generative AI for research purposes.”

Reference

Hua H-U, Kaakour A-H, Rachitskaya A, et al. Evaluation and comparison of ophthalmic scientific abstracts and references by current artificial intelligence chatbots. JAMA Ophthalmol. 2023; doi: 10.1001/jamaophthalmol.2023.3119. Online ahead of print.

Hong-Uyen Hua, MD

E: honguyenhua@gmail.com

Hua recently completed vitreoretinal surgery fellowship at the Cole Eye Institute, Cleveland Clinic Foundation, Cleveland. She has no financial interest in this subject matter.

Keep your retina practice on the forefront—subscribe for expert analysis and emerging trends in retinal disease management.

Subscribe Now!

Latest CME

In-Person Event

EnVision Summit

February 13-16, 2026

AI-generated ophthalmic references and abstracts: Proceed with caution

Results of the comparison

Reference

Hua H-U, Kaakour A-H, Rachitskaya A, et al. Evaluation and comparison of ophthalmic scientific abstracts and references by current artificial intelligence chatbots. JAMA Ophthalmol. 2023; doi: 10.1001/jamaophthalmol.2023.3119. Online ahead of print.

Newsletter

Related Content

Seeing the difference: Multimodal imaging for AMD and GA

FLORetina 2025: EU-ROP Registry bridges borders for real-world insight

The Atrophy Advisor: An online tool to inform GA treatment decisions

FLORetina 2025: Updates on tinlarebant in adolescent Stargardt disease

FLORetina 2025: Long-term follow-up in pediatric gene therapy

Latest CME

EnVision Summit

(CME Track) The TED Perspective: A Multidisciplinary Approach to Thyroid Eye Care

Practical Approaches to Modern Dry Eye Treatment and Management

(CME Track) Revolutionizing nAMD and DME Management: Collaborative Strategies in the Age of Durable Treatments

(CME Track) Visionary Approaches: Rethinking Therapeutic and Interventional Glaucoma Management

(CME Credit) Time Matters in GA: The Impact of Early Detection and Proactive Treatment Approaches

(CME Track) Expanding Horizons in Toric IOLs: Translating Technological Advances Into Improved Patient Outcomes

(CME Track) Patient-Centered Treatment Strategies in the Management of nAMD and DME

(COPE Track) The Neural Frontier: Mapping Neurostimulation Across the DED Patient Spectrum for Refractive Surgery

(COPE Track) Visionary Approaches: Rethinking Therapeutic and Interventional Glaucoma Management

(COPE Track) Patient-Centered Treatment Strategies in the Management of nAMD and DME

(COPE Track) Revolutionizing nAMD and DME Management: Collaborative Strategies in the Age of Durable Treatments

(COPE Track) The TED Perspective: A Multidisciplinary Approach to Thyroid Eye Care

(COPE Track) Expanding Horizons in Toric IOLs: Translating Technological Advances Into Improved Patient Outcomes

(COPE Credit) Time Matters in GA: The Impact of Early Detection and Proactive Treatment Approaches

(CME Track) The Neural Frontier: Mapping Neurostimulation Across the DED Patient Spectrum for Refractive Surgery

20th Annual Controversies in Modern Eye Care

(CME Track) Clinical Consultations™: Framing a New Approach to Geographic Atrophy Management – Expert Insights into Recent Developments

(COPE Track) Clinical Consultations™: Framing a New Approach to Geographic Atrophy Management – Expert Insights into Recent Developments

(CME Track) Rapid Reviews in Retina™: Emerging Updates from Winter 2025 – Addressing the Wealth of New Data in Treatments for nAMD and DME

(COPE Track) Rapid Reviews in Retina™: Emerging Updates from Winter 2025 – Addressing the Wealth of New Data in Treatments for nAMD and DME

Living With X-Linked Retinitis Pigmentosa: What We Can Learn From a Patient’s Experience

(CME Track) Collaborative Community Connections™: Mastering the Management of nAMD and DME Through Therapeutic Innovation

Living With X-Linked Retinitis Pigmentosa: What We Can Learn From a Patient’s Experience

(COPE Track) Collaborative Community Connections™: Mastering the Management of nAMD and DME Through Therapeutic Innovation

Collaborative Care Symposium

Navigating the Glaucoma Therapeutic and Surgical Landscape: From Conventional to Cutting-Edge

(CME Track) Neurotrophic Keratitis: Multidisciplinary Approaches to Enhance Patient Outcomes

(COPE Track) Neurotrophic Keratitis: Multidisciplinary Approaches to Enhance Patient Outcomes

(CME Track) The Neural Network: Exploring The Role of Neuromodulation in Dry Eye Disease Management

(COPE Track) The Neural Network: Exploring The Role of Neuromodulation in Dry Eye Disease Management

(CME Track) Clinical Case Connections: Expert Insights on Applying Therapeutic Innovations in nAMD

(CME Track) Toric IOLs Unleashed: From Technological Progress to Patient Success

(CME Track) Clinical Case Connections: Understanding the Impact of Advances in Treatment for DME and DR

(COPE Track) Clinical Case Connections: Understanding the Impact of Advances in Treatment for DME and DR

(COPE Track) Toric IOLs Unleashed: From Technological Progress to Patient Success

(COPE Track) Clinical Case Connections: Expert Insights on Applying Therapeutic Innovations in nAMD

(CME Credit) Navigating Pharmacological Presbyopia Treatment for Enhanced Patient Care

Neurotrophic Keratitis Insights: An Interactive Corneal Sensitivity Testing Workshop

(COPE Credit) Navigating Pharmacological Presbyopia Treatment for Enhanced Patient Care

(COPE Track) Small Mites, Big Impact: Revolutionizing Demodex Blepharitis Care

(CME Track) Small Mites, Big Impact: Revolutionizing Demodex Blepharitis Care

Rapid Reviews in Retina™: Emerging Updates from Spring 2025—Addressing the Wealth of New Data in Treatments for Neovascular Retinal Disease

Interventional Dry Eye: A Stepwise Treatment & Management Approach

(CME Track) Rapid Reviews in Retina™: Emerging Updates from Summer 2025—Addressing the Wealth of New Data in Treatments for Neovascular Retinal Disease

(CME Track) Collaborating Across the Continuum™: Best Practices in Patient-Centric Team Management of XLRP

(CME Track) A Forward Look at Anti-VEGF Therapies: A Paradigm Shift in Neovascular Retinal Disease Management

(CME Track) Community Collaborative Connections™: Optimizing the Collaborative Care of Neovascular Retinal Disease in a New Age of Treatment

(CME Track) The Evolution of MacTel Management: Integrating Neuroprotective Therapies Into Clinical Practice

Navigating Advances in Neovascular Retinal Disease: Translating Evidence to Practice in AMD, DME, and RVO

(CME Track) Beyond the Collarette: Empowering Patients in the Management of Demodex Blepharitis

(COPE Track) Beyond the Collarette: Empowering Patients in the Management of Demodex Blepharitis

Navigating Ocular Toxicities: A Multidisciplinary Roadmap for Managing Adverse Events in Targeted Cancer Therapy

Rapid Reviews in Retina™: Emerging Updates from Fall 2025—Addressing the Wealth of New Data in Treatments for Neovascular Retinal Disease

Trending on Modern Retina

FLORetina 2025: EU-ROP Registry bridges borders for real-world insight

Seeing the difference: Multimodal imaging for AMD and GA

The Atrophy Advisor: An online tool to inform GA treatment decisions

FLORetina 2025: Updates on tinlarebant in adolescent Stargardt disease