sarahooker/legal-qa-pairs
收藏Hugging Face2026-04-21 更新2026-04-26 收录
下载链接:
https://hf-mirror.com/datasets/sarahooker/legal-qa-pairs
下载链接
链接失效反馈官方服务:
资源简介:
---
annotations_creators: []
language:
- en
language_creators: []
license: []
multilinguality:
- monolingual
pretty_name: 'legal_qa_pairs'
size_categories:
- 10K<n<100K
source_datasets:
- 'original'
tags:
- adaption
- instruction-tuning
- legal
- governance
- language
task_categories: []
task_ids: []
---

This dataset is a remastered version prepared using [Adaption's](https://adaptionlabs.ai/app/auth) Adaptive Data platform.
# legal_qa_pairs
This dataset consists of question-and-answer pairs focused on various legal topics, including contract law, self-defense, property rights, and constitutional issues. Each sample features a user prompt describing a specific legal scenario or inquiry, followed by a detailed completion providing legal analysis, relevant statutes, or case law precedents. The content covers jurisdictions such as the US, UK, and international contexts, offering educational insights into legal reasoning and obligations.
### Dataset size
There are 18,378 data points in this dataset. This is an instruction tuning dataset.
### Quality of Remastered Dataset
The final quality is B, with a relative quality improvement of 48.3%.
### Domain
- Legal (92%)
- Governance (2%)
- Language (2%)
### Language
- English (100%)
### Tone
- Cautious (28%)
- Analytical (28%)
- Explanatory (12%)
### Evaluation Results
- **Quality Gains:**
<img src="https://proteus-prod-public.s3.us-east-1.amazonaws.com/temp/6808f358-2af4-410e-91a1-3bedee08ef1f.png" alt="QualityGains" style="max-width: 50%; display: block; margin-left: auto; margin-right: auto;" />
- **Grade Improvement:**
<img src="https://proteus-prod-public.s3.us-east-1.amazonaws.com/temp/c862f605-2071-43a0-902f-8790eafcc579.png" alt="Grade" style="max-width: 50%; display: block; margin-left: auto; margin-right: auto;" />
- **Percentile Chart:**
<img src="https://proteus-prod-public.s3.us-east-1.amazonaws.com/temp/ba89dd25-521b-4377-a165-41d059b80b04.png" alt="Percentile Chart" style="max-width: 50%; display: block; margin-left: auto; margin-right: auto;" />
注释创建者:无
语言:英语
语言创建者:无
许可证:无
多语言属性:单语言
展示名称:legal_qa_pairs
数据量范围:10000 < 样本数 < 100000
源数据集:原创数据集
标签:
- Adaption
- 指令微调(instruction-tuning)
- 法律
- 治理
- 语言
任务类别:无
任务子类别:无

本数据集为经[Adaption](https://adaptionlabs.ai/app/auth)的自适应数据平台(Adaptive Data platform)重构优化后的版本。
# 法律问答对(legal_qa_pairs)
本数据集包含聚焦各类法律主题的问答样本,涵盖合同法、正当防卫、财产权以及宪法议题等内容。每条样本均包含一段用户提示,用于描述特定法律场景或法律疑问,随后附带一段详尽的补全内容,提供法律分析、相关法条或判例先例。数据集内容覆盖美国、英国及国际等多个法域,可为法律推理与法律责任相关的学习提供专业见解。
### 数据集规模
本数据集共包含18378条样本,属于指令微调(instruction tuning)数据集。
### 重构数据集的质量
最终质量评级为B级,相对质量提升幅度达48.3%。
### 领域分布
- 法律(92%)
- 治理(2%)
- 语言(2%)
### 语言属性
- 英语(100%)
### 语气风格
- 审慎严谨(28%)
- 分析论证(28%)
- 解释说明(12%)
### 评估结果
- **质量提升情况**
<img src="https://proteus-prod-public.s3.us-east-1.amazonaws.com/temp/6808f358-2af4-410e-91a1-3bedee08ef1f.png" alt="质量提升曲线" style="max-width: 50%; display: block; margin-left: auto; margin-right: auto;" />
- **评级提升情况**
<img src="https://proteus-prod-public.s3.us-east-1.amazonaws.com/temp/c862f605-2071-43a0-902f-8790eafcc579.png" alt="评级提升曲线" style="max-width: 50%; display: block; margin-left: auto; margin-right: auto;" />
- **百分位排名图表**
<img src="https://proteus-prod-public.s3.us-east-1.amazonaws.com/temp/ba89dd25-521b-4377-a165-41d059b80b04.png" alt="百分位排名图表" style="max-width: 50%; display: block; margin-left: auto; margin-right: auto;" />
提供机构:
sarahooker



