hordruma/ontario_laws_and_regs
收藏Hugging Face2023-12-23 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/hordruma/ontario_laws_and_regs
下载链接
链接失效反馈官方服务:
资源简介:
---
license: mit
language:
- en
tags:
- law
- legal
- canada
- ontario
- legislation
- regulations
language_details: en-CA, en-GB
pretty_name: Ontario Law and Regulations
size_categories:
- 1K<n<10K
source_datasets:
- Ontario eLaws, Legislation & Regulations
task_categories:
- text-generation
- fill-mask
- text-retrieval
task_ids:
- language-modeling
- masked-language-modeling
- document-retrieval
viewer: true
---
##Ontario Laws & Regulations Dataset
# **⚖️Ontario Laws & Regs⚖️**
The Ontario Laws & Regs dataset contains 5,096 Ontario laws and regulations.
The laws and regulations consist of the most recent version of all current and revoked laws and regs.
The dataset is distributed under the MIT license and is intended to facilitate ML and data tasks involving Ontario legislation.
In addition, a scraper is provided which is capable of capturing different configurations of the data directly from the Ontario eLaws website, found [here](https://github.com/hordruma/elaws_scraper).
## Structure🏛️
Each law and regulation is stored in its own json file which contains the following fields & sub-fields:
1. act_info/reg_info: Act/Reg Info, includes:
-- full_title: Full Title
-- act_name_text/reg_name_text: Act/Reg Name Text
-- citation: Citation
-- act_under: For regulations only, the parent act.
-- url: URL
-- date_scraped: Date Scraped
2. copyright: Crown copyright blurb
-- Copyright: As required by eLaws TOS
3. versions: Versions, includes a list of versions containing:
-- a_href: URL slug for given version
-- valid_from: Valid from
-- valid_to: Valid to
14. current_regs: Current Regs (for Statutes), being a list of associated regulations, containing:
-- a_href: URL slug for given regulation
-- Citation: Citation
-- title: Title
18. revoked_regs: Revoked Regs (for Statutes), being a list of associated revoked regulations, containing:
-- revoked_reg_a_href: URL slug for given revoked regulation
-- revoked_reg_citation: Citation
-- revoked_reg_title: Title
22. content: Contents of the act or regulation, being a list of sections, containing:
-- id: Section #
-- section: Section title
-- content: Section content, as utf-8 text
-- raw_html: raw html of section content, containing extra features
## Collection 📥
Documents were sourced from the [Ontario eLaws Website](https://www.ontario.ca/laws).
[`eLaws Scraper`](https://github.com/hordruma/elaws_scraper/) was used to extract the documents and parse them to jsons.
## Licence 📜
Both the dataset and its associated scraper are distributed under MIT license.
---
许可证:MIT
语言:
- 英语
标签:
- 法律
- 法务
- 加拿大
- 安大略省
- 立法
- 法规
语言详情:en-CA(加拿大英语)、en-GB(英国英语)
美观名称:安大略省法律法规数据集
规模类别:
- 1000 < 样本数 < 10000
源数据集:
- 安大略省eLaws、立法与法规
任务类别:
- 文本生成
- 掩码填充
- 文本检索
任务子项:
- 语言建模
- 掩码语言建模
- 文档检索
数据集查看器:启用
---
## 安大略省法律法规数据集
# ⚖️安大略省法律法规数据集⚖️
本安大略省法律法规数据集包含5096条安大略省现行及已废止的法律法规的最新版本。
本数据集以MIT许可证分发,旨在助力涉及安大略省立法的机器学习与数据处理任务。
此外,本项目附带一款爬虫工具,可直接从安大略省eLaws官网抓取不同配置的数据集,工具仓库地址为:[此处](https://github.com/hordruma/elaws_scraper)。
## 数据结构🏛️
每条法律法规均存储于独立的JSON文件中,包含以下字段与子字段:
1. **act_info/reg_info**:法案/法规信息,包含:
-- full_title:完整标题
-- act_name_text/reg_name_text:法案/法规名称文本
-- citation:引用标识
-- act_under:仅适用于法规,指所属父法案
-- url:资源链接
-- date_scraped:抓取日期
2. **copyright**:官方版权声明
-- Copyright:符合eLaws服务条款的版权声明
3. **versions**:版本信息,包含版本列表,每个版本包含:
-- a_href:对应版本的URL别名
-- valid_from:生效起始日期
-- valid_to:生效终止日期
14. **current_regs**:(针对法案的)现行关联法规列表,每个条目包含:
-- a_href:对应法规的URL别名
-- Citation:引用标识
-- title:标题
18. **revoked_regs**:(针对法案的)已废止关联法规列表,每个条目包含:
-- revoked_reg_a_href:对应已废止法规的URL别名
-- revoked_reg_citation:引用标识
-- revoked_reg_title:标题
22. **content**:法案或法规正文,包含章节列表,每个章节包含:
-- id:章节编号
-- section:章节标题
-- content:章节内容(UTF-8文本格式)
-- raw_html:章节内容的原始HTML代码,包含额外格式信息
## 数据采集📥
数据集文档源自[安大略省eLaws官网](https://www.ontario.ca/laws)。
本项目使用[`eLaws爬虫工具`](https://github.com/hordruma/elaws_scraper/)提取文档并解析为JSON格式。
## 许可证📜
本数据集及附带的爬虫工具均以MIT许可证分发。
提供机构:
hordruma
原始信息汇总
Ontario Laws & Regulations Dataset
该数据集名为“Ontario Laws & Regulations Dataset”,专注于安大略省的法律和法规信息。



