five

Supplementary Material for: Improving Accuracy and Source Transparency in Responses to Soft Tissue Sarcoma Queries using GPT-4o Enhanced with German Evidence-Based Guidelines

收藏
DataCite Commons2025-03-01 更新2025-05-07 收录
下载链接:
https://karger.figshare.com/articles/dataset/Supplementary_Material_for_Improving_Accuracy_and_Source_Transparency_in_Responses_to_Soft_Tissue_Sarcoma_Queries_using_GPT-4o_Enhanced_with_German_Evidence-Based_Guidelines/28513592
下载链接
链接失效反馈
官方服务:
资源简介:
Introduction: This study aimed to evaluate the effectiveness of GPT-4o, with and without Retrieval-Augmented Generation (RAG), in responding to soft tissue sarcoma (STS)-related queries. Methods: The study used a 20-question dataset derived from clinical scenarios related to adult STS. The responses were generated by GPT-4o with and without the RAG approach. The RAG system incorporated the English version of German evidence-based S3 guidelines through an embedding-based retrieval system. Two sarcoma experts evaluated the responses for accuracy, comprehensiveness, and safety using a Likert scale. Statistical analyses were conducted to compare the performances. Results: GPT-4o with RAG outperformed the model without RAG across all evaluated areas (p<0.05). GPT-4o without RAG had a 40% error rate, which was reduced to 10% by the RAG approach. In 90% of the questions, the pages with the relevant information that addressed the questions were correctly cited using the retrieval system. Conclusion: The RAG approach significantly enhanced the performance of GPT-4o in answering STS-related questions. However, the model still produced incorrect responses in certain complex scenarios. GPT-4o, even with RAG, should be used cautiously in clinical settings, particularly for rare diseases like sarcoma. Human expertise remains irreplaceable in medical decision-making.
提供机构:
Karger Publishers
创建时间:
2025-02-28
二维码
社区交流群
二维码
科研交流群
商业服务