richmondsin/truthfulqa_id_mc2_results
收藏Hugging Face2024-12-03 更新2024-12-14 收录
下载链接:
https://hf-mirror.com/datasets/richmondsin/truthfulqa_id_mc2_results
下载链接
链接失效反馈官方服务:
资源简介:
该数据集是在评估模型google/gemma-2-2b时自动创建的。数据集由0个配置组成,每个配置对应一个评估任务。数据集由2次运行创建,每次运行可以在每个配置中找到特定的分割,分割名称使用运行的时间戳。train分割始终指向最新的结果。此外,还有一个名为results的配置存储了所有运行的聚合结果。
The dataset was automatically created during the evaluation run of the model google/gemma-2-2b. The dataset consists of configurations corresponding to evaluated tasks, with each run represented as a specific split named by the timestamp of the run. The train split always points to the latest results. There is an additional results configuration that stores aggregated results of the runs. The dataset is used to evaluate the performance of the model on specific tasks, such as truthfulqa_id_mc2, and includes metrics like accuracy and standard error.
提供机构:
richmondsin



