DISC-Law-SFT
收藏Opencsg2025-05-22 更新2025-05-03 收录
下载链接:
https://www.opencsg.com/datasets/AIWizards/DISC-Law-SFT
下载链接
链接失效反馈官方服务:
资源简介:
DISC-Law-SFT数据集专注于中文法律智能系统,旨在提升法律文本理解和生成能力。它包含DISC-Law-SFT-Pair和DISC-Law-SFT-Triplet两个子集,总规模约为40万条数据,主要覆盖法律信息抽取、法律判决预测、法律文档摘要和法律问答等多种法律场景。数据来源于法律专业领域,并经过了标注处理,可用于监督微调任务,以增强模型对外部法律知识的利用能力。该数据集采用Apache-2.0授权许可。
The DISC-Law-SFT dataset focuses on Chinese legal intelligent systems, aiming to enhance the capabilities of legal text understanding and generation. It contains two subsets, DISC-Law-SFT-Pair and DISC-Law-SFT-Triplet, with a total scale of approximately 400,000 data instances. It mainly covers various legal scenarios including legal information extraction, legal judgment prediction, legal document summarization, and legal question answering. The data is sourced from the legal professional domain and has been annotated, which can be used for supervised fine-tuning tasks to enhance the model's ability to leverage external legal knowledge. This dataset is licensed under Apache-2.0.
创建时间:
2024-07-19
搜集汇总
数据集介绍

背景与挑战
背景概述
DISC-Law-SFT是一个高质量的中文法律智能系统数据集,包含约40万条数据,覆盖法律信息抽取、判决预测、文档摘要和问答等多种场景,旨在提升模型的法律文本理解和生成能力。数据集分为Pair和Triplet两个子集,分别用于增强法律推理能力和外部法律知识利用能力。
以上内容由遇见数据集搜集并总结生成



