DT4LM/qqp_adv_deberta_leap
收藏Hugging Face2024-07-21 更新2024-07-22 收录
下载链接:
https://hf-mirror.com/datasets/DT4LM/qqp_adv_deberta_leap
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含两个问题(question1和question2)以及一个标签(label),用于表示两个问题之间的关系。数据集分为训练集、验证集和测试集,分别包含335440、40428和40428个样本。训练集大小为45680437字节,验证集大小为5503609字节,测试集大小为5493911字节。整个数据集的下载大小为36225461字节,总大小为56677957字节。
This dataset is primarily used for natural language processing tasks, specifically for evaluating and training models on the ability to determine if two questions are similar. It includes two question fields (question1 and question2) and a label field (label), which indicates whether the two questions are similar. The dataset is divided into training, validation, and test sets, which are used for training, validation, and testing of the models, respectively.
提供机构:
DT4LM
原始信息汇总
数据集概述
特征
- question1: 字符串类型
- question2: 字符串类型
- label: 64位整数类型
数据分割
- train:
- 字节数: 45680437.0
- 样本数: 335440
- validation:
- 字节数: 5503609
- 样本数: 40428
- test:
- 字节数: 5493911
- 样本数: 40428
数据大小
- 下载大小: 36225461 字节
- 数据集总大小: 56677957.0 字节
配置
- config_name: default
- 数据文件路径:
- train: data/train-*
- validation: data/validation-*
- test: data/test-*
- 数据文件路径:



