five

Pile-FreeLaw

收藏
魔搭社区2025-09-28 更新2024-08-31 收录
下载链接:
https://modelscope.cn/datasets/OmniData/Pile-FreeLaw
下载链接
链接失效反馈
官方服务:
资源简介:
displayName: Pile-FreeLaw license: - MIT taskTypes: - Natural Language Generation - Language Modelling mediaTypes: - Text labelTypes: - English Corpus tags: [] publisher: - EleutherAI publishDate: '2023-07-18' publishUrl: https://pile.eleuther.ai/ paperUrl: '' --- # 数据介绍 ## 简介 Pile-FreeLaw数据集是The Pile项目的一部分,它是一个面向法律文本的开放源代码数据集。该数据集旨在提供大量的法律相关文本,以支持自然语言处理和机器学习研究。 Pile-FreeLaw数据集中包含了来自各种法律文档的文本,包括法律条款、法规、法庭判决、法律评论等。这些文本涵盖了各个法律领域,包括刑法、民法、商法、行政法等。 数据集的目的是为研究人员和开发者提供一个丰富的法律文本资源,以便用于文本分析、信息提取、问答系统、法律智能助手等应用的开发和训练。 ## 数据内容 ### 数据说明 Pile-FreeLaw数据集涵盖了50.1G的数据。 ### 数据示例 ``` { "id": "257999335", "source_id": "", "doc_id": "193256024", "data_type": "text", "data_source": "pile", "data_url": "enwiki-c4-pile-ccnews", "content": "\n557 S.E.2d 531 (2001)\n354 N.C. 368\nSTATE of North Carolina\nv.\nMichael NOLEN.\nNo. 391P01.\nSupreme Court of North Carolina.\nNovember 8, 2001.\nLisa Miles, Durham, for Michael Nolen.\nThomas F. Moffitt, Raleigh, Rex Gore, District Attorney, for State of North Carolina.\nPrior report: 144 N.CApp. 172, 550 S.E.2d 783.\n\nORDER\nUpon consideration of the notice of appeal from the North Carolina Court of Appeals, filed by the Defendant in this matter pursuant to G.S. 7A-30, and the motion to dismiss the appeal for lack of substantial constitutional *532 question filed by the Attorney General, the following order was entered and is hereby certified to the North Carolina Court of Appeals: the motion to dismiss the appeal is\n\"Allowed by order of the Court in conference, this the 8th day of November 2001.\"\nUpon consideration of the petition filed by Defendant in this matter for discretionary review of the decision of the North Carolina Court of Appeals pursuant to G.S. 7A-31, the following order was entered and is hereby certified to the North Carolina Court of Appeals:\n\"Denied by order of the Court in conference, this the 8th day of November 2001.\"\n", "remark": { "pile_set_name": "FreeLaw" }, "sub_path": "freelaw/train" } ``` ## 引文 ``` @misc{conghui2022opendatalab, title={OpenDataLab: Empowering General Artificial Intelligence with Open Datasets}, author={Conghui He, Wei Li, Zhenjiang Jin, Bin Wang, Chao Xu, Dahua Lin}, journal={https://opendatalab.com/}, year={2022} } ``` ## Download dataset :modelscope-code[]{type="git"}

displayName: Pile-FreeLaw license: - MIT taskTypes: - 自然语言生成 - 语言建模 mediaTypes: - 文本 labelTypes: - 英语语料库 tags: [] publisher: - EleutherAI publishDate: '2023-07-18' publishUrl: https://pile.eleuther.ai/ paperUrl: '' --- # 数据集介绍 ## 简介 Pile-FreeLaw数据集隶属于The Pile项目,是一款面向法律文本的开源数据集,旨在汇聚海量法律相关文本,为自然语言处理与机器学习研究提供支撑。 本数据集收录了各类法律文档文本,涵盖法律条款、法律法规、法庭判决、法律评论等内容,涉及刑法、民法、商法、行政法等多个法律领域。 其核心目标是为研究人员与开发者提供充足的法律文本资源,用于文本分析、信息抽取、问答系统、法律AI智能体(AI Agent)等应用的开发与训练。 ## 数据集内容 ### 数据集说明 Pile-FreeLaw数据集规模达50.1吉字节(GB)。 ### 数据集示例 { "id": "257999335", "source_id": "", "doc_id": "193256024", "data_type": "text", "data_source": "pile", "data_url": "enwiki-c4-pile-ccnews", "content": " 557 S.E.2d 531 (2001) 354 N.C. 368 STATE of North Carolina v. Michael NOLEN. No. 391P01. Supreme Court of North Carolina. November 8, 2001. Lisa Miles, Durham, for Michael Nolen. Thomas F. Moffitt, Raleigh, Rex Gore, District Attorney, for State of North Carolina. Prior report: 144 N.CApp. 172, 550 S.E.2d 783. ORDER Upon consideration of the notice of appeal from the North Carolina Court of Appeals, filed by the Defendant in this matter pursuant to G.S. 7A-30, and the motion to dismiss the appeal for lack of substantial constitutional *532 question filed by the Attorney General, the following order was entered and is hereby certified to the North Carolina Court of Appeals: the motion to dismiss the appeal is "Allowed by order of the Court in conference, this the 8th day of November 2001." Upon consideration of the petition filed by Defendant in this matter for discretionary review of the decision of the North Carolina Court of Appeals pursuant to G.S. 7A-31, the following order was entered and is hereby certified to the North Carolina Court of Appeals: "Denied by order of the Court in conference, this the 8th day of November 2001." ", "remark": { "pile_set_name": "FreeLaw" }, "sub_path": "freelaw/train" } ## 引用文献 @misc{conghui2022opendatalab, title={OpenDataLab: Empowering General Artificial Intelligence with Open Datasets}, author={Conghui He, Wei Li, Zhenjiang Jin, Bin Wang, Chao Xu, Dahua Lin}, journal={https://opendatalab.com/}, year={2022} } ## 数据集下载 :modelscope-code[]{type="git"}
提供机构:
maas
创建时间:
2024-07-17
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作