infinite-dataset-hub/EvidentialTextClass
收藏Hugging Face2024-08-23 更新2025-04-12 收录
下载链接:
https://hf-mirror.com/datasets/infinite-dataset-hub/EvidentialTextClass
下载链接
链接失效反馈官方服务:
资源简介:
---
license: mit
tags:
- infinite-dataset-hub
- synthetic
---
# EvidentialTextClass
tags: classification, forensics, legal
_Note: This is an AI-generated dataset so its content may be inaccurate or false_
**Dataset Description:**
The 'EvidentialTextClass' dataset is a curated collection of text excerpts from various sources, including legal documents, case reports, and online articles, which have been annotated by legal experts to indicate whether the text contains evidence related to a legal case. The labels used in the dataset include 'Evidence' for texts that are directly related to legal evidence, and 'Not Evidence' for texts that do not pertain to legal evidence. This dataset is useful for developing machine learning models aimed at text classification tasks within the legal domain, helping in automating the process of identifying potentially relevant documents in legal proceedings.
**CSV Content Preview:**
```
TextID,Text,Label
1,"The defendant's fingerprints were found on the murder weapon, indicating his involvement in the crime.","Evidence"
2,"The suspect was seen at a different location during the time of the crime.","Not Evidence"
3,"The weather on the day of the incident was unusually rainy.","Not Evidence"
4,"According to the eyewitness, the accused had a heated argument with the victim prior to the incident.","Evidence"
5,"The accused was previously convicted for similar crimes in another state.","Not Evidence"
```
**Source of the data:**
The dataset was generated using the [Infinite Dataset Hub](https://huggingface.co/spaces/infinite-dataset-hub/infinite-dataset-hub) and microsoft/Phi-3-mini-4k-instruct using the query 'Evidence or not text classification ':
- **Dataset Generation Page**: https://huggingface.co/spaces/infinite-dataset-hub/infinite-dataset-hub?q=Evidence+or+not+text+classification+&dataset=EvidentialTextClass&tags=classification,+forensics,+legal
- **Model**: https://huggingface.co/microsoft/Phi-3-mini-4k-instruct
- **More Datasets**: https://huggingface.co/datasets?other=infinite-dataset-hub
许可证:MIT
标签:
- 无限数据集中心(Infinite Dataset Hub)
- 合成数据集
# 证据文本分类(EvidentialTextClass)
标签:分类、取证、法律
_注意:本数据集由AI生成,内容可能存在不准确或虚假情况_
**数据集描述:**
“证据文本分类(EvidentialTextClass)”数据集是经精选整理的文本节选合集,来源涵盖法律文书、案件报告与网络文章,所有文本均由法律专家标注,用以标识文本是否包含与法律案件相关的证据。数据集采用两类标签:「含证据(Evidence)」用于标记与法律证据直接相关的文本,「不含证据(Not Evidence)」用于标记与法律证据无关的文本。本数据集可用于开发法律领域的文本分类机器学习模型,助力自动化识别法律程序中潜在相关文档的流程。
**CSV内容预览:**
TextID,Text,Label
1,"The defendant's fingerprints were found on the murder weapon, indicating his involvement in the crime.","Evidence"
2,"The suspect was seen at a different location during the time of the crime.","Not Evidence"
3,"The weather on the day of the incident was unusually rainy.","Not Evidence"
4,"According to the eyewitness, the accused had a heated argument with the victim prior to the incident.","Evidence"
5,"The accused was previously convicted for similar crimes in another state.","Not Evidence"
**数据来源:**
本数据集通过[无限数据集中心(Infinite Dataset Hub)](https://huggingface.co/spaces/infinite-dataset-hub/infinite-dataset-hub)与微软(Microsoft)Phi-3-mini-4k-instruct模型,以“文本是否属于证据分类”为查询指令生成。
- **数据集生成页面**:https://huggingface.co/spaces/infinite-dataset-hub/infinite-dataset-hub?q=Evidence+or+not+text+classification+&dataset=EvidentialTextClass&tags=classification,+forensics,+legal
- **所用模型**:https://huggingface.co/microsoft/Phi-3-mini-4k-instruct
- **更多数据集**:https://huggingface.co/datasets?other=infinite-dataset-hub
提供机构:
infinite-dataset-hub



