infinite-dataset-hub/UncommonTermsLearningSet
收藏Hugging Face2024-08-28 更新2025-04-12 收录
下载链接:
https://hf-mirror.com/datasets/infinite-dataset-hub/UncommonTermsLearningSet
下载链接
链接失效反馈官方服务:
资源简介:
---
license: mit
tags:
- infinite-dataset-hub
- synthetic
---
# UncommonTermsLearningSet
tags: machine learning, contract analysis, unique contracts
_Note: This is an AI-generated dataset so its content may be inaccurate or false_
**Dataset Description:** The 'UncommonTermsLearningSet' dataset is designed for machine learning applications in contract analysis, specifically targeting the identification and classification of non-standard contractual terms. Each entry in the dataset represents a snippet of text from contracts that are labeled based on the type of non-standard term it contains. The dataset is meant to help in training models to recognize and understand unconventional contractual language, which could be crucial for legal professionals and AI systems working with contract management.
**CSV Content Preview:**
```
id,text,label
1,"This agreement shall remain effective even if Party A relocates to a non-commercial zone",StandardTerms
2,"Party B shall not be held liable for losses caused by natural disasters, Act of God",ExclusionsAndLimitations
3,"The commencement date of this agreement is subject to the mutual agreement of both parties",TermsOfCommencement
4,"Party A agrees to provide confidential information to Party B under a non-disclosure agreement, to be determined by mutual consent",ConfidentialityClause
5,"This contract will be governed by the laws of the state of Nevada, regardless of the location of the parties",GoverningLaw
```
This dataset contains five rows of data, with a 'text' column representing the snippet of contract language and a 'label' column categorizing each snippet as a standard term or a non-standard term. The 'id' column serves as a unique identifier for each entry. The 'label' values are deliberately chosen to represent non-standard terms without using the exact query phrases, aiming for diversity and relevance to contract analysis tasks.
**Source of the data:**
The dataset was generated using the [Infinite Dataset Hub](https://huggingface.co/spaces/infinite-dataset-hub/infinite-dataset-hub) and microsoft/Phi-3-mini-4k-instruct using the query 'Contracts Non Standards terms':
- **Dataset Generation Page**: https://huggingface.co/spaces/infinite-dataset-hub/infinite-dataset-hub?q=Contracts+Non+Standards+terms&dataset=UncommonTermsLearningSet&tags=machine+learning,+contract+analysis,+unique+contracts
- **Model**: https://huggingface.co/microsoft/Phi-3-mini-4k-instruct
- **More Datasets**: https://huggingface.co/datasets?other=infinite-dataset-hub
license: MIT许可证
tags:
- 无限数据集中心(Infinite Dataset Hub)
- 合成数据集
# 非通用条款学习集(UncommonTermsLearningSet)
标签:机器学习、合同分析、独特合同
**注意**:本数据集由人工智能生成,其内容可能存在不准确或虚假信息
**数据集说明**:本数据集(UncommonTermsLearningSet)专为合同分析领域的机器学习应用打造,核心目标为识别与分类非标准合同条款。数据集中的每条条目均取自合同文本片段,并依据其包含的非标准条款类型进行标注。本数据集旨在辅助模型训练,使其能够识别并理解非常规合同用语,这对于法律从业者及从事合同管理的AI系统而言至关重要。
**CSV内容预览**:
id,text,label
1,"This agreement shall remain effective even if Party A relocates to a non-commercial zone",StandardTerms
2,"Party B shall not be held liable for losses caused by natural disasters, Act of God",ExclusionsAndLimitations
3,"The commencement date of this agreement is subject to the mutual agreement of both parties",TermsOfCommencement
4,"Party A agrees to provide confidential information to Party B under a non-disclosure agreement, to be determined by mutual consent",ConfidentialityClause
5,"This contract will be governed by the laws of the state of Nevada, regardless of the location of the parties",GoverningLaw
本数据集包含5条数据记录,设有'id'、'text'与'label'三列:其中'text'列代表合同语言片段,'label'列用于将每条片段归类为标准条款或非标准条款,'id'列则作为每条记录的唯一标识符。'label'字段的取值刻意避开了精确的查询短语,以兼顾多样性与合同分析任务的相关性,其取值代表各类非标准条款。
**数据来源**:
本数据集通过[无限数据集中心(Infinite Dataset Hub)](https://huggingface.co/spaces/infinite-dataset-hub/infinite-dataset-hub)与microsoft/Phi-3-mini-4k-instruct模型生成,生成时使用的查询词为"Contracts Non Standards terms":
- **数据集生成页面**:https://huggingface.co/spaces/infinite-dataset-hub/infinite-dataset-hub?q=Contracts+Non+Standards+terms&dataset=UncommonTermsLearningSet&tags=machine+learning,+contract+analysis,+unique+contracts
- **模型**:https://huggingface.co/microsoft/Phi-3-mini-4k-instruct
- **更多数据集**:https://huggingface.co/datasets?other=infinite-dataset-hub
提供机构:
infinite-dataset-hub



