five

HINT-lab/Qwen2.5-7B-Instruct-Self-Calibration

收藏
Hugging Face2025-03-06 更新2025-04-26 收录
下载链接:
https://hf-mirror.com/datasets/HINT-lab/Qwen2.5-7B-Instruct-Self-Calibration
下载链接
链接失效反馈
官方服务:
资源简介:
该数据集包含了用于评估大型语言模型测试时缩放方法有效性的多个推理数据集。每个数据集以不同的配置名称(config_name)区分,例如arc_easy、commonsense_qa等。这些数据集的特征包括输入文本、答案、加权一致性和一致性评分。具体的数据集描述如下: arc_easy:[待添加描述] commonsense_qa:[待添加描述] gsm8k:[待添加描述] logiqa:[待添加描述] openbookqa:[待添加描述] reclor:[待添加描述] sciq:[待添加描述] svamp:[待添加描述] winogrande:[待添加描述] 需要补充每个数据集的具体描述、统计数据和其他相关信息。

This dataset consists of multiple reasoning datasets for evaluating the effectiveness of test-time scaling methods for large language models (LLMs). Each dataset is distinguished by a different configuration name (config_name), such as arc_easy, commonsense_qa, etc. The features of these datasets include input text, answer, weighted consistency, and consistency scores. Detailed descriptions of each dataset are as follows: arc_easy: [Description to be added] commonsense_qa: [Description to be added] gsm8k: [Description to be added] logiqa: [Description to be added] openbookqa: [Description to be added] reclor: [Description to be added] sciq: [Description to be added] svamp: [Description to be added] winogrande: [Description to be added] Detailed descriptions, statistics, and other relevant information for each dataset need to be补充ed.
提供机构:
HINT-lab
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作