SynthSTEL/styledistance_training_triplets_v2
收藏Hugging Face2024-07-22 更新2024-07-22 收录
下载链接:
https://hf-mirror.com/datasets/SynthSTEL/styledistance_training_triplets_v2
下载链接
链接失效反馈官方服务:
资源简介:
该数据集是通过DataDreamer工具生成的合成数据集,包含320,400个样本。数据集的特征包括anchor、positive、negative、feature和feature_clean,均为字符串类型。数据集仅包含一个训练集分割,大小为119,247,863字节,下载大小为35,769,108字节。
This dataset is produced by DataDreamer, containing multiple features such as anchor, positive, negative, feature, feature_clean, all of which are string type. The dataset is divided into a training set, containing 320400 samples, with a total size of 119247863 bytes.
提供机构:
SynthSTEL
原始信息汇总
数据集概述
数据集信息
-
特征:
anchor: 类型为stringpositive: 类型为stringnegative: 类型为stringfeature: 类型为stringfeature_clean: 类型为string
-
分割:
train: 包含 320400 个样本,占用 119247863 字节
-
下载大小: 35769108 字节
-
数据集大小: 119247863 字节
配置
- 默认配置:
config_name:defaultdata_files:split:trainpath:data/train-*
标签
datadreamerdatadreamer-0.20.0synthetic



