lxyuan/synthetic-nli-triplet
收藏Hugging Face2024-09-25 更新2024-12-14 收录
下载链接:
https://hf-mirror.com/datasets/lxyuan/synthetic-nli-triplet
下载链接
链接失效反馈官方服务:
资源简介:
Synthetic-nli-triplet数据集是通过从现有的nli-triplet数据集中进行迭代采样生成的。每个采样的三元组被用作输入,通过TheBloke/Mistral-7B-Instruct-v0.1-GPTQ模型生成新的合成三元组样本,进一步扩展了用于自然语言推理(NLI)任务的数据。
The Synthetic-nli-triplet dataset is generated through an iterative sampling process from the existing nli-triplet dataset. Each sampled triplet is used as input to prompt the TheBloke/Mistral-7B-Instruct-v0.1-GPTQ model, generating new synthetic triplet samples, thereby expanding the data for tasks involving Natural Language Inference (NLI). The dataset includes a training set with features anchor, positive, and negative, totaling 4,593,633 rows.
提供机构:
lxyuan



