five

Gemini 1.0 Pro, Claude 3 Sonnet, Microsoft Copilot, and ChatGPT-4 responses on the Test of Understanding Graphs in Kinematics (TUG-K), April 2024

收藏
NIAID Data Ecosystem2026-05-02 收录
下载链接:
https://zenodo.org/record/11183802
下载链接
链接失效反馈
官方服务:
资源简介:
The data contains 30 responses from each chatbot to 26 items on the TUG-K survey. The chatbots tested were Google Gemini (freely available version, Gemini Pro 1.0), Claude 3 Sonnet, Microsoft Copilot (freely available version, balanced setting) and ChatGPT-4 (subscription-based, ChatGPT Plus). The prompts consisted of screenshots of the test items and the sentence "Answer the question in the image" for Copilot and Gemini 1.0 Pro. For Claude 3 Sonnet and ChatGPT-4 the prompt consisted of the screenshot only. The data was collected in April 2024. The data is a continuation of the research data on ChatGPT-4's performance on the TUG-K (10.5281/zenodo.10429075).
创建时间:
2024-06-20
二维码
社区交流群
二维码
科研交流群
商业服务