five

vipinpalhugging/dbpedia-label-en-testing

收藏
Hugging Face2026-01-21 更新2026-03-29 收录
下载链接:
https://hf-mirror.com/datasets/vipinpalhugging/dbpedia-label-en-testing
下载链接
链接失效反馈
官方服务:
资源简介:
--- license: cc-by-sa-3.0 task_categories: - text-generation - feature-extraction language: - en tags: - rdf - knowledge-graph - semantic-web - triples size_categories: - 10K<n<100K --- # DBpedia English Labels ## Dataset Description Entity labels from DBpedia (English) **Original Source:** https://downloads.dbpedia.org/repo/dbpedia/generic/labels/2022.12.01/labels_lang=en.ttl.bz2 ### Dataset Summary This dataset contains RDF triples from DBpedia English Labels converted to HuggingFace dataset format for easy use in machine learning pipelines. - **Format:** Originally turtle, converted to HuggingFace Dataset - **Size:** 1.0 GB (extracted) - **Entities:** ~9.5M - **Triples:** ~9.5M - **Original License:** CC BY-SA 3.0 ### Recommended Use Entity naming, label learning, medium-scale training ## RDF Format This dataset uses a standard lossless format for representing RDF triples. Each triple is a row with 6 fields: - `subject`: Subject URI or blank node - `predicate`: Predicate URI - `object`: Object value (URI, literal, or blank node) - `object_type`: Type of object (`uri`, `literal`, or `blank_node`) - `object_datatype`: XSD datatype URI (for typed literals) - `object_language`: Language tag (for language-tagged literals) ### Loading the Dataset ```python from datasets import load_dataset dataset = load_dataset("dbpedia-labels-en") for row in dataset["train"]: print(f"{row['subject']} {row['predicate']} {row['object']}") ``` ## Citation If you use this dataset, please cite the original source: **Dataset:** DBpedia English Labels **URL:** https://downloads.dbpedia.org/repo/dbpedia/generic/labels/2022.12.01/labels_lang=en.ttl.bz2 **License:** CC BY-SA 3.0 ## Conversion Details - **Converted using:** [RDF to HuggingFace Incremental Converter](https://github.com/CleverThis/cleverernie) - **Conversion date:** 2026-01-21 - **Format version:** 1.0 --- This dataset is part of the CleverThis knowledge graph collection.
提供机构:
vipinpalhugging
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作