llm-compe-2025-kato/step2-evaluated-dataset-Qwen3-14B-cp32
收藏Hugging Face2025-08-20 更新2025-09-13 收录
下载链接:
https://hf-mirror.com/datasets/llm-compe-2025-kato/step2-evaluated-dataset-Qwen3-14B-cp32
下载链接
链接失效反馈官方服务:
资源简介:
完整评估数据集(评分量表+LogP)包含使用综合评分量表评估和LogP评估的链式思维解释。该数据集源自另一个数据集,共有60个样本。提供了两种评估方法的结果,并详细说明了数据集的结构,包括多个字段。还解释了评估方法及其标准,以及数据集的潜在用途。
The Complete Evaluation Dataset (Rubric + LogP) contains chain-of-thought explanations evaluated using both comprehensive rubric assessment and LogP evaluation. Derived from another dataset, it consists of 60 samples in total. Evaluation results for both methods are provided, along with a detailed structure of the dataset featuring various fields. The evaluation methods and their criteria are explained, as well as the potential uses of the dataset.
提供机构:
llm-compe-2025-kato



