1lock/finest_dataset_2025_02_06_00
收藏Hugging Face2025-02-06 更新2025-02-15 收录
下载链接:
https://hf-mirror.com/datasets/1lock/finest_dataset_2025_02_06_00
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含了文本信息、唯一标识符和元数据信息。文本信息是数据集的主要组成部分,每个文本都有一个唯一标识符。元数据提供了关于文本的额外信息,如日期、文件路径、语言类型、语言置信度分数、词的数量和文本的URL。数据集分为训练集,共有约1.5万个示例,数据集的总大小为57.5MB。
The dataset consists of text entries, each with a unique identifier and metadata information. The text entries are the primary component of the dataset. Metadata provides additional information about the text, such as date, file path, language type, language confidence score, word count, and the texts URL. The dataset is split into a training set with approximately 15,000 examples, and the total size of the dataset is 57.5MB.
提供机构:
1lock



