SINAI/ALIA-legal-administrative
收藏Hugging Face2026-02-19 更新2025-10-25 收录
下载链接:
https://hf-mirror.com/datasets/SINAI/ALIA-legal-administrative
下载链接
链接失效反馈官方服务:
资源简介:
ALIA法律行政语料库是一个开放获取的数据资源,它汇编和组织了大量来自西班牙法律和行政领域的官方文件。它的目的是为研究人员、学者、法律专业人士和公共管理实践者提供一个统一、结构化和可访问的文献基础,他们有兴趣分析和利用西班牙的规范性、立法性和行政性文本。这个语料库采用综合方法设计,涵盖了国家、地区和省级官方公报、专业登记册、部长级文件、公共招标和合同,以及安达卢西亚议会的议会程序。这种多样性使得它可以全面覆盖规范机构、经济和社会活动的文献生态系统。语料库的范围,超过700万个实例和超过50亿个token,使其成为学术研究西班牙法规、比较立法分析、开发应用于法律行政语言的自然语言处理(NLP)工具以及研究机构开放数据的前所未有的来源。其开放和处理过的性质既便于法律专业人士和文献专家的手动探索,也便于在文本挖掘项目、语义建模、信息检索以及构建专门从事法律和公共管理的智能系统中的高级应用。
The ALIA Legal and Administrative Corpus is an open-access data resource that compiles and organizes an extensive collection of official documents from the Spanish legal and administrative domain. Its purpose is to provide a homogeneous, structured, and accessible documentary base for researchers, academics, legal professionals, and public administration practitioners interested in the analysis and exploitation of normative, legislative, and administrative texts in Spanish. This corpus has been designed with an integrative approach that encompasses state, regional, and provincial official bulletins, specialized registries, ministerial documents in key areas such as energy, environment, climate change, defense, and national security, public tenders and contracts, as well as parliamentary proceedings from the Andalusian Parliament. This diversity allows for comprehensive coverage of the documentary ecosystem that regulates institutional, economic, and social activity in Spain. The scope of the corpus, with over 7 million instances and more than 5 billion tokens, makes it an unprecedented source for academic study of Spanish regulations, comparative legislative analysis, development of natural language processing (NLP) tools applied to legal-administrative language, and research in institutional open data. Its open and processed nature facilitates both manual exploration by legal professionals and documentation specialists, as well as advanced utilization in text mining projects, semantic modeling, information retrieval, and construction of artificial intelligence systems specialized in law and public administration.
提供机构:
SINAI



