thangvip/thuvienphapluat-qa-normalize
收藏Hugging Face2024-04-10 更新2024-06-11 收录
下载链接:
https://hf-mirror.com/datasets/thangvip/thuvienphapluat-qa-normalize
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含多个配置,每个配置都有相同的特征:标题(title)、问题(question)、内容(content)和标准化答案(normalize_answer)。每个配置的训练集包含1000个样本,数据集的大小和下载大小因配置不同而有所变化。
提供机构:
thangvip
原始信息汇总
数据集概述
数据集配置
| 配置名称 | 特征 | 训练集信息 |
|---|---|---|
| 0_set | title: string, question: string, content: string, normalize_answer: string | num_bytes: 10524665, num_examples: 1000 |
| 10_set | title: string, question: string, content: string, normalize_answer: string | num_bytes: 9191019, num_examples: 1000 |
| 11_set | title: string, question: string, content: string, normalize_answer: string | num_bytes: 7321159, num_examples: 1000 |
| 12_set | title: string, question: string, content: string, normalize_answer: string | num_bytes: 2924615, num_examples: 1000 |
| 13_set | title: string, question: string, content: string, normalize_answer: string | num_bytes: 2929844, num_examples: 1000 |
| 14_set | title: string, question: string, content: string, normalize_answer: string | num_bytes: 3604661, num_examples: 1000 |
| 15_set | title: string, question: string, content: string, normalize_answer: string | num_bytes: 6795095, num_examples: 1000 |
| 1_set | title: string, question: string, content: string, normalize_answer: string | num_bytes: 3401608, num_examples: 1000 |
| 2_set | title: string, question: string, content: string, normalize_answer: string | num_bytes: 3752252, num_examples: 1000 |
| 3_set | title: string, question: string, content: string, normalize_answer: string | num_bytes: 4094104, num_examples: 1000 |
| 4_set | title: string, question: string, content: string, normalize_answer: string | num_bytes: 4708074, num_examples: 1000 |
| 5_set | title: string, question: string, content: string, normalize_answer: string | num_bytes: 5070718, num_examples: 1000 |
| 6_set | title: string, question: string, content: string, normalize_answer: string | num_bytes: 3794901, num_examples: 1000 |
| 7_set | title: string, question: string, content: string, normalize_answer: string | num_bytes: 3920479, num_examples: 1000 |
| 8_set | title: string, question: string, content: string, normalize_answer: string | num_bytes: 8851748, num_examples: 1000 |
| 9_set | title: string, question: string, content: string, normalize_answer: string | num_bytes: 10266350, num_examples: 1000 |
| default | title: string, question: string, content: string, normalize_answer: string | num_bytes: 120745, num_examples: 10 |
数据集文件路径
| 配置名称 | 训练集路径 |
|---|---|
| 0_set | 0_set/train-* |
| 10_set | 10_set/train-* |
| 11_set | 11_set/train-* |
| 12_set | 12_set/train-* |
| 13_set | 13_set/train-* |
| 14_set | 14_set/train-* |
| 15_set | 15_set/train-* |
| 1_set | 1_set/train-* |
| 2_set | 2_set/train-* |
| 3_set | 3_set/train-* |
| 4_set | 4_set/train-* |
| 5_set | 5_set/train-* |
| 6_set | 6_set/train-* |
| 7_set | 7_set/train-* |
| 8_set | 8_set/train-* |
| 9_set | 9_set/train-* |
| default | data/train-* |



