vipinpalhugging/dbpedia-label-en-testing-v1
收藏Hugging Face2026-01-21 更新2026-03-29 收录
下载链接:
https://hf-mirror.com/datasets/vipinpalhugging/dbpedia-label-en-testing-v1
下载链接
链接失效反馈官方服务:
资源简介:
---
license: cc-by-sa-3.0
task_categories:
- text-generation
- feature-extraction
language:
- en
tags:
- rdf
- knowledge-graph
- semantic-web
- triples
size_categories:
- 10K<n<100K
---
# DBpedia English Labels
## Dataset Description
Entity labels from DBpedia (English)
**Original Source:** https://downloads.dbpedia.org/repo/dbpedia/generic/labels/2022.12.01/labels_lang=en.ttl.bz2
### Dataset Summary
This dataset contains RDF triples from DBpedia English Labels converted to
HuggingFace dataset format for easy use in machine learning pipelines.
- **Format:** Originally turtle, converted to HuggingFace
Dataset
- **Size:** 1.0 GB (extracted)
- **Entities:** ~9.5M
- **Triples:** ~9.5M
- **Original License:** CC BY-SA 3.0
### Recommended Use
Entity naming, label learning, medium-scale training
## RDF Format
This dataset uses a standard lossless format for representing RDF triples.
Each triple is a row with 6 fields:
- `subject`: Subject URI or blank node
- `predicate`: Predicate URI
- `object`: Object value (URI, literal, or blank node)
- `object_type`: Type of object (`uri`, `literal`, or `blank_node`)
- `object_datatype`: XSD datatype URI (for typed literals)
- `object_language`: Language tag (for language-tagged literals)
### Loading the Dataset
```python
from datasets import load_dataset
dataset = load_dataset("dbpedia-labels-en")
for row in dataset["train"]:
print(f"{row['subject']} {row['predicate']} {row['object']}")
```
## Citation
If you use this dataset, please cite the original source:
**Dataset:** DBpedia English Labels
**URL:** https://downloads.dbpedia.org/repo/dbpedia/generic/labels/2022.12.01/labels_lang=en.ttl.bz2
**License:** CC BY-SA 3.0
## Conversion Details
- **Converted using:** [RDF to HuggingFace Incremental Converter](https://github.com/CleverThis/cleverernie)
- **Conversion date:** 2026-01-21
- **Format version:** 1.0
---
This dataset is part of the CleverThis knowledge graph collection.
提供机构:
vipinpalhugging



