OALL/details_yellowtown__7B-v0.2_v2_alrage
收藏Hugging Face2025-02-14 更新2025-04-12 收录
下载链接:
https://hf-mirror.com/datasets/OALL/details_yellowtown__7B-v0.2_v2_alrage
下载链接
链接失效反馈官方服务:
资源简介:
在yellowtown/7B-v0.2模型评估期间自动创建的数据集。该数据集包含一个配置,每个配置对应于评估的任务之一。数据集由两次运行的结果组成。results配置存储了所有运行聚合的结果。可以使用datasets库加载数据集,并提供了示例。最新结果包括LLM_as_judge和llm_as_judge_stderr等指标。
Dataset automatically created during the evaluation of the yellowtown/7B-v0.2 model. The dataset consists of one configuration corresponding to one of the evaluated tasks and includes results from two runs. The results configuration stores aggregated results of all runs. The dataset can be loaded using the datasets library, with an example provided. The latest results include metrics such as llm_as_judge and llm_as_judge_stderr.
提供机构:
OALL



