TAUR-dev/D-EVAL__standard_eval_v3__FinEval_16k_fulleval_AT_OURS-SFT-commonsenseQA-eval_sft
收藏Hugging Face2025-11-10 更新2025-11-15 收录
下载链接:
https://hf-mirror.com/datasets/TAUR-dev/D-EVAL__standard_eval_v3__FinEval_16k_fulleval_AT_OURS-SFT-commonsenseQA-eval_sft
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含问题、答案以及与任务相关的配置信息。它还包括用于生成答案的提示,以及模型的响应和评估这些响应的元数据。此外,数据集还提供了关于模型响应的性能指标,如正确率等。数据集分为测试集,包含了大量的示例和相应的字节数。
The dataset includes questions, answers, and configuration information related to tasks. It also contains prompts used to generate answers, as well as model responses and metadata for evaluating these responses. Additionally, the dataset provides performance metrics for model responses, such as accuracy. The dataset is split into a test set, which includes a large number of examples and corresponding byte sizes.
提供机构:
TAUR-dev



