infinite-dataset-hub/ValidateInText
收藏Hugging Face2024-08-23 更新2025-04-12 收录
下载链接:
https://hf-mirror.com/datasets/infinite-dataset-hub/ValidateInText
下载链接
链接失效反馈官方服务:
资源简介:
---
license: mit
tags:
- infinite-dataset-hub
- synthetic
---
# ValidateInText
tags: validation, text-analysis, extract-evidence
_Note: This is an AI-generated dataset so its content may be inaccurate or false_
**Dataset Description:**
The 'ValidateInText' dataset comprises paragraphs of text extracted from research papers, articles, or reports, with the objective of identifying whether certain pieces of evidence are explicitly stated within the text. The dataset labels each entry with a binary 'evidence_present' indicator (0 for 'no', 1 for 'yes'), and also includes a column for 'extracted_evidences' where relevant evidence sentences are listed. This dataset is structured to aid in the development and evaluation of text analysis models, particularly those focused on evidence extraction and validation.
**CSV Content Preview:**
```csv
paragraph,evidence_present,extracted_evidences
"The results indicate a significant reduction in tumor size after treatment.","1","Results show a reduction in tumor size."
"There is no evidence to support the claim that the medication improves memory.","0","N/A"
"The study found a strong correlation between exercise and improved mental health.","1","A strong correlation between exercise and improved mental health."
"While the theory is intriguing, there is insufficient evidence to conclude its effectiveness.","0","Insufficient evidence to conclude effectiveness."
"The research conclusively demonstrates that the new drug has a positive impact on patient recovery.","1","The new drug has a positive impact on patient recovery."
```
This CSV preview contains a selection of five rows of data, each corresponding to a unique paragraph. It labels whether evidence is present and lists the extracted evidence if available. This format facilitates easy processing for machine learning models focused on text analysis tasks.
**Source of the data:**
The dataset was generated using the [Infinite Dataset Hub](https://huggingface.co/spaces/infinite-dataset-hub/infinite-dataset-hub) and microsoft/Phi-3-mini-4k-instruct using the query 'paragraph, evidence is there or not, extracted evidences from paragraph':
- **Dataset Generation Page**: https://huggingface.co/spaces/infinite-dataset-hub/infinite-dataset-hub?q=paragraph,+evidence+is+there+or+not,+extracted+evidences+from+paragraph&dataset=ValidateInText&tags=validation,+text-analysis,+extract-evidence
- **Model**: https://huggingface.co/microsoft/Phi-3-mini-4k-instruct
- **More Datasets**: https://huggingface.co/datasets?other=infinite-dataset-hub
提供机构:
infinite-dataset-hub



