mtl-dev/semantic-sim-unlabeled-shuffled-batchs-1-1000
收藏Hugging Face2024-10-14 更新2024-12-14 收录
下载链接:
https://hf-mirror.com/datasets/mtl-dev/semantic-sim-unlabeled-shuffled-batchs-1-1000
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含文本、相似度分数、标签和报告名称等特征。数据集被分为训练集、测试集和验证集,其中训练集包含1600000个示例,测试集和验证集各包含200000个示例。数据集的下载大小为210542862字节,总大小为493219658字节。数据文件的路径分别为data/train-*、data/test-*和data/validation-*。
The dataset includes features such as text, similarity score, label, and report name. It is divided into training, test, and validation sets, with the training set containing 1,600,000 examples, and the test and validation sets each containing 200,000 examples. The download size of the dataset is 210,542,862 bytes, and the total size is 493,219,658 bytes. The data files are located at data/train-*, data/test-*, and data/validation-*.
提供机构:
mtl-dev



