EuropeanParliament/Eurovoc_2025
收藏Hugging Face2025-12-01 更新2025-11-15 收录
下载链接:
https://hf-mirror.com/datasets/EuropeanParliament/Eurovoc_2025
下载链接
链接失效反馈官方服务:
资源简介:
Eurovoc 2025数据集包含3,945,170个文档,这些文档都附有EuroVoc标签。EuroVoc是一个大型多学科层次性词库,包含超过7000个类别,涵盖了欧盟机构的活动。该数据集是在2025年7月2日创建的,通过一个名为European Parliament Registry Document Scraper的Python脚本抓取欧洲议会注册办公室的公共文档,并将它们保存为结构化的JSONL格式。
The Eurovoc 2025 dataset contains 3,945,170 documents with associated EuroVoc labels. EuroVoc is a large multidisciplinary hierarchical thesaurus of more than 7000 classes covering the activities of EU institutions. The dataset was created on July 2nd, 2025, using a Python script called European Parliament Registry Document Scraper to scrape public documents from the European Parliament Registry Office and save them in a structured JSONL format.
提供机构:
EuropeanParliament



