DKYoon/CulturaX_ko_500k
收藏Hugging Face2024-11-16 更新2024-12-21 收录
下载链接:
https://hf-mirror.com/datasets/DKYoon/CulturaX_ko_500k
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含500,000个训练样本,每个样本包含四个字段:文本(text)、时间戳(timestamp)、URL(url)和来源(source)。数据集的总大小为2,633,185,940.93字节,下载大小为1,563,638,559字节。数据以train分割的形式提供,路径为data/train-*。
The dataset contains 500,000 training samples, each with four fields: text, timestamp, URL, and source. The total size of the dataset is 2,633,185,940.93 bytes, with a download size of 1,563,638,559 bytes. The data is provided in a train split, with the path being data/train-*.
提供机构:
DKYoon



