five

Contract Discovery

收藏
arXiv2020-10-08 更新2024-06-21 收录
下载链接:
https://github.com/applicaai/contract-discovery
下载链接
链接失效反馈
官方服务:
资源简介:
Contract Discovery数据集由波兰华沙的Applica.ai机构创建,专注于从法律文档中提取条款,以支持自动化合同发现任务。该数据集包含约2500个条款,来源于美国EDGAR数据库的债券发行说明书和保密协议文档,以及英国慈善机构年度报告。数据集旨在通过提供自然语言流,缺乏正式结构,模拟真实世界的使用场景,支持查询示例场景中的跨文档条款识别。该数据集适用于少样本学习设置,旨在通过多示例查询系统提高法律信息检索的效率和准确性。

The Contract Discovery dataset was created by Applica.ai, an institution based in Warsaw, Poland, which focuses on extracting clauses from legal documents to support automated contract discovery tasks. It contains approximately 2,500 clauses sourced from bond offering prospectuses and confidentiality agreement documents in the U.S. EDGAR database, as well as annual reports of UK charities. The dataset is designed to simulate real-world usage scenarios by providing unstructured natural language streams, supporting cross-document clause identification in query example scenarios. This dataset is suitable for few-shot learning settings, aiming to improve the efficiency and accuracy of legal information retrieval through multi-example query systems.
提供机构:
Applica.ai, 华沙, 波兰
创建时间:
2019-11-10
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作