Are Generative AI systems Capable of Supporting Information Needs of Patients?

AI-generated keywords: Generative AI models Healthcare system Radiology imaging data Patient information needs Accountability

AI-generated Key Points

  • Patients managing complex illnesses like cancer face a daunting information challenge
  • Close interaction with healthcare experts improves patient learning and disease outcome
  • Generative AI models are being explored to support patient information needs in radiology imaging data
  • Thematic analysis identified common themes in discussions between participants and radiologists
  • Evaluation of ChatGPT-4V and MedFlamingo showed varying levels of response quality
  • Current generative AI systems may not adequately address patients' information needs for medical scans and reports
  • High error rates were observed during evaluation, raising concerns about misinformation being conveyed to patients
  • Models struggled with relevance and frequently produced irrelevant elaborations
Also access our AI generated: Comprehensive summary, Lay summary, Blog-like article; or ask questions about this paper to our AI assistant.

Authors: Shreya Rajagopal, Subhashis Hazarika, Sookyung Kim, Yan-ming Chiou, Jae Ho Sohn, Hari Subramonyam, Shiwali Mohan

License: CC BY 4.0

Abstract: Patients managing a complex illness such as cancer face a complex information challenge where they not only must learn about their illness but also how to manage it. Close interaction with healthcare experts (radiologists, oncologists) can improve patient learning and thereby, their disease outcome. However, this approach is resource intensive and takes expert time away from other critical tasks. Given the recent advancements in Generative AI models aimed at improving the healthcare system, our work investigates whether and how generative visual question answering systems can responsibly support patient information needs in the context of radiology imaging data. We conducted a formative need-finding study in which participants discussed chest computed tomography (CT) scans and associated radiology reports of a fictitious close relative with a cardiothoracic radiologist. Using thematic analysis of the conversation between participants and medical experts, we identified commonly occurring themes across interactions, including clarifying medical terminology, locating the problems mentioned in the report in the scanned image, understanding disease prognosis, discussing the next diagnostic steps, and comparing treatment options. Based on these themes, we evaluated two state-of-the-art generative visual language models against the radiologist's responses. Our results reveal variability in the quality of responses generated by the models across various themes. We highlight the importance of patient-facing generative AI systems to accommodate a diverse range of conversational themes, catering to the real-world informational needs of patients.

Submitted to arXiv on 31 Jan. 2024

Ask questions about this paper to our AI assistant

You can also chat with multiple papers at once here.

AI assistant instructions?

Results of the summarizing process for the arXiv paper: 2402.00234v1

Patients managing complex illnesses like cancer often face a daunting information challenge. They not only need to understand their condition but also learn how to effectively manage it. Close interaction with healthcare experts has been shown to improve patient learning and ultimately their disease outcome. However, this approach is resource-intensive and can take expert time away from other critical tasks. With recent advancements in Generative AI models aimed at improving the healthcare system, researchers set out to investigate whether and how generative visual question answering systems could responsibly support patient information needs in the context of radiology imaging data. Through a formative need-finding study involving discussions of chest computed tomography (CT) scans and associated radiology reports with a cardiothoracic radiologist, participants highlighted common themes such as clarifying medical terminology, locating issues mentioned in reports within scanned images, understanding disease prognosis, discussing diagnostic steps, and comparing treatment options. Thematic analysis of these interactions led to the identification of 91 content-based codes grouped into 10 broad themes that captured the essence of participant-radiologist conversations. Evaluation of two state-of-the-art generative visual language models - ChatGPT-4V and MedFlamingo - against responses provided by the radiologist revealed varying levels of response quality across different themes. While MedFlamingo tended to provide concise answers in clinical language that may be challenging for patients and caregivers to understand, ChatGPT-4V generated lengthy responses with generic descriptions without focusing on specific case details or filtering relevant information. The study's findings suggest that current generative AI systems may not adequately address patients' information needs when it comes to understanding medical scans and reports. High error rates were observed in both models during evaluation on real interaction questions, raising concerns about misinformation being conveyed to patients who may not possess medical expertise to discern inaccuracies. The models also struggled with relevance and frequently produced irrelevant elaborations rather than directly addressing questions asked. Overall, the research highlights a critical gap in evaluating generative AI systems for practical healthcare applications and underscores the challenges of ensuring accountable deployment in real-world scenarios where accuracy and relevance are paramount for supporting patients' informational needs effectively.
Created on 20 Feb. 2024

Assess the quality of the AI-generated content by voting

Score: 0

Why do we need votes?

Votes are used to determine whether we need to re-run our summarizing tools. If the count reaches -10, our tools can be restarted.

Look for similar papers (in beta version)

By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.

Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.