five

sarahooker/legal-qa-pairs

收藏
Hugging Face2026-04-21 更新2026-04-26 收录
下载链接:
https://hf-mirror.com/datasets/sarahooker/legal-qa-pairs
下载链接
链接失效反馈
官方服务:
资源简介:
--- annotations_creators: [] language: - en language_creators: [] license: [] multilinguality: - monolingual pretty_name: 'legal_qa_pairs' size_categories: - 10K<n<100K source_datasets: - 'original' tags: - adaption - instruction-tuning - legal - governance - language task_categories: [] task_ids: [] --- ![banner](https://proteus-prod-public.s3.us-east-1.amazonaws.com/temp/ac912029-7445-49dc-8b08-a7569191237e.png) This dataset is a remastered version prepared using [Adaption's](https://adaptionlabs.ai/app/auth) Adaptive Data platform. # legal_qa_pairs This dataset consists of question-and-answer pairs focused on various legal topics, including contract law, self-defense, property rights, and constitutional issues. Each sample features a user prompt describing a specific legal scenario or inquiry, followed by a detailed completion providing legal analysis, relevant statutes, or case law precedents. The content covers jurisdictions such as the US, UK, and international contexts, offering educational insights into legal reasoning and obligations. ### Dataset size There are 18,378 data points in this dataset. This is an instruction tuning dataset. ### Quality of Remastered Dataset The final quality is B, with a relative quality improvement of 48.3%. ### Domain - Legal (92%) - Governance (2%) - Language (2%) ### Language - English (100%) ### Tone - Cautious (28%) - Analytical (28%) - Explanatory (12%) ### Evaluation Results - **Quality Gains:** <img src="https://proteus-prod-public.s3.us-east-1.amazonaws.com/temp/6808f358-2af4-410e-91a1-3bedee08ef1f.png" alt="QualityGains" style="max-width: 50%; display: block; margin-left: auto; margin-right: auto;" /> - **Grade Improvement:** <img src="https://proteus-prod-public.s3.us-east-1.amazonaws.com/temp/c862f605-2071-43a0-902f-8790eafcc579.png" alt="Grade" style="max-width: 50%; display: block; margin-left: auto; margin-right: auto;" /> - **Percentile Chart:** <img src="https://proteus-prod-public.s3.us-east-1.amazonaws.com/temp/ba89dd25-521b-4377-a165-41d059b80b04.png" alt="Percentile Chart" style="max-width: 50%; display: block; margin-left: auto; margin-right: auto;" />

注释创建者:无 语言:英语 语言创建者:无 许可证:无 多语言属性:单语言 展示名称:legal_qa_pairs 数据量范围:10000 < 样本数 < 100000 源数据集:原创数据集 标签: - Adaption - 指令微调(instruction-tuning) - 法律 - 治理 - 语言 任务类别:无 任务子类别:无 ![横幅图片](https://proteus-prod-public.s3.us-east-1.amazonaws.com/temp/ac912029-7445-49dc-8b08-a7569191237e.png) 本数据集为经[Adaption](https://adaptionlabs.ai/app/auth)的自适应数据平台(Adaptive Data platform)重构优化后的版本。 # 法律问答对(legal_qa_pairs) 本数据集包含聚焦各类法律主题的问答样本,涵盖合同法、正当防卫、财产权以及宪法议题等内容。每条样本均包含一段用户提示,用于描述特定法律场景或法律疑问,随后附带一段详尽的补全内容,提供法律分析、相关法条或判例先例。数据集内容覆盖美国、英国及国际等多个法域,可为法律推理与法律责任相关的学习提供专业见解。 ### 数据集规模 本数据集共包含18378条样本,属于指令微调(instruction tuning)数据集。 ### 重构数据集的质量 最终质量评级为B级,相对质量提升幅度达48.3%。 ### 领域分布 - 法律(92%) - 治理(2%) - 语言(2%) ### 语言属性 - 英语(100%) ### 语气风格 - 审慎严谨(28%) - 分析论证(28%) - 解释说明(12%) ### 评估结果 - **质量提升情况** <img src="https://proteus-prod-public.s3.us-east-1.amazonaws.com/temp/6808f358-2af4-410e-91a1-3bedee08ef1f.png" alt="质量提升曲线" style="max-width: 50%; display: block; margin-left: auto; margin-right: auto;" /> - **评级提升情况** <img src="https://proteus-prod-public.s3.us-east-1.amazonaws.com/temp/c862f605-2071-43a0-902f-8790eafcc579.png" alt="评级提升曲线" style="max-width: 50%; display: block; margin-left: auto; margin-right: auto;" /> - **百分位排名图表** <img src="https://proteus-prod-public.s3.us-east-1.amazonaws.com/temp/ba89dd25-521b-4377-a165-41d059b80b04.png" alt="百分位排名图表" style="max-width: 50%; display: block; margin-left: auto; margin-right: auto;" />
提供机构:
sarahooker
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作