A comparative evaluation of ChatGPT 3.5 and ChatGPT 4 in responses to selected genetics questions - Full study data
收藏DataCite Commons2025-04-01 更新2025-04-09 收录
下载链接:
https://datadryad.org/dataset/doi:10.5061/dryad.s4mw6m9cv
下载链接
链接失效反馈官方服务:
资源简介:
Objective: Our objective is to evaluate the efficacy of ChatGPT 4 in
accurately and effectively delivering genetic information, building on
previous findings with ChatGPT 3.5. We focus on assessing the utility,
limitations, and ethical implications of using ChatGPT in medical
settings. Materials and Methods: A structured questionnaire, including the
Brief User Survey (BUS-15) and custom questions, was developed to assess
ChatGPT 4's clinical value. An expert panel of genetic counselors and
clinical geneticists independently evaluated ChatGPT 4's responses to
these questions. We also involved comparative analysis with ChatGPT 3.5,
utilizing descriptive statistics and using R for data analysis. Results:
ChatGPT 4 demonstrated improvements over 3.5 in context recognition,
relevance, and informativeness. However, performance variability and
concerns about the naturalness of the output were noted. No significant
difference in accuracy was found between ChatGPT 3.5 and 4.0. Notably, the
efficacy of ChatGPT 4 varied significantly across different genetic
conditions, with specific differences identified between responses related
to BRCA1 and HFE. Discussion and Conclusion: This study highlights ChatGPT
4's potential in genomics, noting significant advancements over its
predecessor. Despite these improvements, challenges remain, including the
risk of outdated information and the necessity of ongoing refinement. The
variability in performance across different genetic conditions underscores
the need for expert oversight and continuous AI training. ChatGPT 4, while
showing promise, emphasizes the importance of balancing technological
innovation with ethical responsibility in healthcare information delivery.
提供机构:
Dryad
创建时间:
2024-06-04



