five

OALL/details_Lina-Z__arabic_llm_test_v2

收藏
Hugging Face2025-11-13 更新2025-11-15 收录
下载链接:
https://hf-mirror.com/datasets/OALL/details_Lina-Z__arabic_llm_test_v2
下载链接
链接失效反馈
官方服务:
资源简介:
该数据集是在评估模型 [Lina-Z/arabic_llm_test](https://huggingface.co/Lina-Z/arabic_llm_test) 的过程中自动创建的,包含116个配置,每个配置对应一个评估任务。数据集由1次运行的结果组成,每次运行的结果存储在以运行时间戳命名的特定分割中。train 分割始终指向最新的结果。还有一个名为 results 的额外配置,用于存储所有运行结果的汇总。内容还包括一个Python代码示例,说明如何从一个运行中加载数据集的详细信息。它还提供了关于特定运行的最新结果的信息,包括各种任务的准确性和标准误差。

The dataset is automatically created during the evaluation run of the model [Lina-Z/arabic_llm_test](https://huggingface.co/Lina-Z/arabic_llm_test) and contains 116 configurations, each corresponding to an evaluated task. The dataset includes results from 1 run, with each runs results stored in a specific split named by the timestamp of the run. The train split always points to the latest results. There is also a results configuration that stores all the aggregated results of the run. The content includes a Python code snippet on how to load the dataset details from a run. It also provides information about the latest results from a specific run, including the accuracy and standard error for various tasks.
提供机构:
OALL
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作