dgambettaphd/D_llm3_gen1_run0_W_doc1000_synt64_RANDOM
收藏Hugging Face2025-04-08 更新2025-04-12 收录
下载链接:
https://hf-mirror.com/datasets/dgambettaphd/D_llm3_gen1_run0_W_doc1000_synt64_RANDOM
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含多个字段,如唯一标识符(id)、文本内容(text)、数据集来源(dataset)、生成方式(gen)、句法特征(synt)以及三种概率值(TPP、MPP、FTP)。数据集被划分为训练集(train),包含5000个样本,大小为26827394字节。数据集的下载大小为15555415字节。提供了默认配置,指定了训练集的数据文件路径。
The dataset includes multiple fields such as unique identifier (id), text content (text), dataset source (dataset), generation method (gen), syntactic features (synt), and three probability values (TPP, MPP, FTP). The dataset is split into a training set (train) containing 5000 samples, with a size of 26827394 bytes. The download size of the dataset is 15555415 bytes. A default configuration is provided, specifying the data file path for the training set.
提供机构:
dgambettaphd



