dgambettaphd/D_llm3_run0_gen20_WXS_doc1000_synt64_lr1e-04_acm_SYNLAST
收藏Hugging Face2025-10-10 更新2025-10-25 收录
下载链接:
https://hf-mirror.com/datasets/dgambettaphd/D_llm3_run0_gen20_WXS_doc1000_synt64_lr1e-04_acm_SYNLAST
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含文档ID(id_doc)、文本内容(text)、数据集来源(dataset)、生成方式(gen)、句法分析标记(synt)以及一个名为MPP的浮点数值。数据集分为训练集(train),共有36000个样本。数据集的下载大小为13373363字节,解压后大小为22914724字节。
The dataset includes document ID (id_doc), text content (text), dataset source (dataset), generation method (gen), syntactic analysis tags (synt), and a floating-point value named MPP. The dataset is split into a training set (train) with a total of 36,000 samples. The download size of the dataset is 13,373,363 bytes, and the decompressed size is 22,914,724 bytes.
提供机构:
dgambettaphd



