claran/modular-s2orc-parquet
收藏Hugging Face2024-10-08 更新2024-12-14 收录
下载链接:
https://hf-mirror.com/datasets/claran/modular-s2orc-parquet
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含了农业与食品科学、艺术和生物学领域的数据,涵盖了从1970年到2022年的不同时间段。每个时间段的数据集都包含了丰富的特征,如添加时间、属性、元数据(如外部研究领域、来源、年份等)以及文本内容。数据集被分割为训练集、验证集和测试集,每个分割的字节数和示例数都有详细记录。
This dataset contains data from the fields of Agricultural and Food Sciences, Art, and Biology, spanning different time periods from 1970 to 2022. Each time periods dataset includes rich features such as added time, attributes, metadata (e.g., external fields of study, provenance, year), and text content. The dataset is divided into training, validation, and test sets, with detailed records of the number of bytes and examples for each split.
提供机构:
claran



