metricas
covid
Annals of Hepatology EVIDENCE-BASED DIGITAL SUPPORT IN HEPATOLOGY: RETRIEVAL-AUGMENTED GENERATION&apo...
Journal Information
Vol. 30. Issue S2.
Abstracts of the 2025 Annual Meeting of the ALEH
(September 2025)
Vol. 30. Issue S2.
Abstracts of the 2025 Annual Meeting of the ALEH
(September 2025)
#53
Full text access
EVIDENCE-BASED DIGITAL SUPPORT IN HEPATOLOGY: RETRIEVAL-AUGMENTED GENERATION'S ROLE IN AUTOIMMUNE LIVER DISEASES MANAGEMENT
Visits
186
Ezequiel Ridruejo1, Ernesto Saenz2, Jimmy Daza3, Heike Bantel4, Marcos Girala5, Matthias Ebert6, Florian Van Bommel7, Andreas Geier8, Andres Gomez Aldana9, Mario Reis Alvares-da-Silvai10, Markus Peck-Radosavljevicj11, Frank Tacke12, Arndt Weinmann13, Juan Turnes14, Javier Pazo14, Andreas Teufel3
1 CEMIC, Argentina.
2 Division of Hepatology. Division of Clinical Bioinformatics. Department of Medicine II. Medical Faculty Mannheim. Heidelberg University, Germany.
3 Division of Hepatology. Department of Medicine II. Medical Faculty Mannheim. Heidelberg University, Germany.
4 Department of Gastroenterology. Hepatology. Infectious Diseases and Endocrinology. Hannover Medical School, Germany.
5 Department of Gastroenterology. Hospital de Clínicas. Universidad Nacional de Asunción, Paraguay.
6 Department of Medicine II. Medical Faculty Mannheim. Heidelberg University, Germany.
7 Department of Medicine II. Clinic of Gastroenterology. Hepatology. Infectious Diseases and Pneumology. Leipzig University Medical Center, Germany.
8 Department of Internal Medicine II. Division of Hepatology, Germany.
9 Texas Liver Institute. University of Texas Health Science Center, USA.
10 Department of Gastroenterology. Hospital de Clínicas de Porto Alegre. Universidade Federal do Rio Grande do Sul, Brasil.
11 Internal Medicine and Gastroenterology (IMuG). Clinic Klagenfurt am Woerthersee, Austria.
12 Department of Hepatology and Gastroenterology. Charité - Universitätsmedizin Berlin. Campus Virchow-Klinikum and Campus Charité Mitte, Germany.
13 Department of Internal Medicine I. University Medical Center of the Johannes Gutenberg-University Mainz, Germany.
14 Gastroenterology and Hepatology. Pontevedra University Hospital Complex, España.
Ver más
This item has received
Article information
Abstract
Full Text
Download PDF
Statistics
Special issue
This article is part of special issue:
Vol. 30. Issue S2

Abstracts of the 2025 Annual Meeting of the ALEH

More info
Introduction and Objectives

Autoimmune liver diseases (AILDs) present significant diagnostic and management challenges. Following our initial evaluation of Large Language Models (LLMs), we developed and assessed three specialized Retrieval-Augmented Generation (RAG) systems. These systems incorporated comprehensive clinical guidelines and medication safety information to enhance decision support accuracy. Our aim was to evaluate the effectiveness of Retrieval-augmented AI systems in providing evidence-based recommendations for AILD management.

Materials and Methods

We engineered three distinct RAG systems: HepaChat, RAG-ChatGPT, and RAG-Claude. Each system integrated 13 international clinical guidelines spanning AIH, PBC, and PSC management. Additionally, we incorporated a comprehensive database containing 12,465 FDA medication warnings to ensure safety protocol adherence. Ten liver specialists (six European, four American) evaluated system responses to 56 standardized clinical questions using a 1-10 Likert scale. Questions addressed disease comprehension, therapeutic approaches, and clinical decision-making across all three major AILDs.

Results

Quantitative analysis revealed HepaChat's superior performance (mean score 7.58±1.48) with 33 best-rated responses, compared to RAG-Claude (7.22±1.58, 12 best-rated) and RAG-ChatGPT (7.21±1.67, 9 best-rated). Geographic stratification unveiled variations in evaluation patterns (Americas: 7.97 vs Europe: 6.40). Disease-specific analysis demonstrated HepaChat's excellence in AIH (Europe: 7.12, Americas: 8.17) and PSC management in Europe (6.89), while achieving optimal performance in AIH and PBC in the Americas (8.17 and 8.37, respectively). All three systems showed marked improvement over conventional LLMs (2023 benchmark: 6.72±1.67).

Conclusions

This evaluation demonstrates that specialized RAG systems incorporating clinical guidelines and safety protocols can significantly enhance AILD management support. Geographic variations in assessment highlight the importance of considering regional clinical perspectives in AI system development.

Full Text

Conflict of interest: None

Download PDF
Article options
Tools