umarzein/wikipedia-headings-tree-7k
收藏Hugging Face2023-06-08 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/umarzein/wikipedia-headings-tree-7k
下载链接
链接失效反馈官方服务:
资源简介:
---
license: cc-by-3.0
size_categories:
- 1K<n<10K
---
for context, deserialize the json into Node classes from: https://gist.github.com/UmarZein/4c46bc42323d0f61bd3494dec48f3fa4
the difference between this dataset and https://huggingface.co/datasets/umarzein/wikipedia-headings-20k is that this one is
more compact i.e.: the column `rootwards` is distinct so it saves space but you have to parse the json first
提供机构:
umarzein
原始信息汇总
数据集概述
许可信息
- 许可证: cc-by-3.0
数据集大小
- 大小范围: 1K<n<10K
数据集特点
- 相较于https://huggingface.co/datasets/umarzein/wikipedia-headings-20k,本数据集更为紧凑,主要体现在
rootwards列的独特性,有助于节省空间。但使用前需先解析JSON数据。



