arumaekawa/wikipedia-ja
收藏Hugging Face2025-01-03 更新2025-02-15 收录
下载链接:
https://hf-mirror.com/datasets/arumaekawa/wikipedia-ja
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含两种配置:default和original。default配置仅包含文本数据,而original配置则包含文本数据的ID、URL、标题和内容。数据集全部为训练集,共有1389467条示例。
The dataset includes two configurations: default and original. The default configuration contains only text data, while the original configuration includes the ID, URL, title, and content of the text data. The entire dataset is for training, with a total of 1,389,467 examples.
提供机构:
arumaekawa



