quickmt/canadian_hansard
收藏Hugging Face2026-01-10 更新2026-03-29 收录
下载链接:
https://hf-mirror.com/datasets/quickmt/canadian_hansard
下载链接
链接失效反馈官方服务:
资源简介:
---
dataset_info:
features:
- name: fr
dtype: string
- name: en
dtype: string
- name: sco
dtype: float64
splits:
- name: train
num_bytes: 1728184700
num_examples: 2609639
download_size: 902906857
dataset_size: 1728184700
configs:
- config_name: default
data_files:
- split: train
path: data/train-*
task_categories:
- translation
language:
- en
- fr
---
Contains paired English and French text from the Canadian Hansard (parliamentary debates) for the following sessions, downloaded on 2026-01-09:
* 45-1
* 44-1
* 43-2
* 43-1
* 42-1
* 41-2
* 41-1
* 40-3
* 40-2
* 40-1
* 39-2
* 39-1
* 38-1
Data was downloaded from the XML files, e.g.:
* https://www.ourcommons.ca/Content/House/451/Debates/072/HAN072-E.XML
* https://www.ourcommons.ca/Content/House/451/Debates/072/HAN072-F.XML
提供机构:
quickmt



