JvPetas/aneel-legislacao
收藏Hugging Face2026-04-28 更新2026-04-26 收录
下载链接:
https://hf-mirror.com/datasets/JvPetas/aneel-legislacao
下载链接
链接失效反馈官方服务:
资源简介:
ANEEL Legislação数据集是一个包含由巴西国家电力局(ANEEL)发布的立法和监管文件的语料库,涵盖2016年、2021年和2022年的文档。数据集包括27,060个文档,约3.57亿个字符,文档类型包括全文文本、投票、技术说明、附件、决定等。每个文档包含多个字段,如文档ID、类型、标题、摘要、主题、状态、发布日期、作者、年份、全文文本、是否包含表格、页数、提取质量评分和提取字符数。数据集以结构化JSON格式提供,适用于文本分类、文本生成和摘要等NLP任务。
The ANEEL Legislação dataset is a corpus of legislative and regulatory documents published by the Brazilian National Electric Energy Agency (ANEEL), covering the years 2016, 2021, and 2022. The dataset includes 27,060 documents with approximately 357 million characters, and document types include full text, votes, technical notes, annexes, decisions, and others. Each document contains multiple fields such as document ID, type, title, summary, subject, status, publication date, author, year, full text, presence of tables, number of pages, extraction quality score, and extracted character count. The dataset is provided in structured JSON format and is suitable for NLP tasks such as text classification, text generation, and summarization.
提供机构:
JvPetas



