gokulsrinivasagan/processed_wikitext-103-raw-v1-ld-5
收藏Hugging Face2024-11-18 更新2024-12-14 收录
下载链接:
https://hf-mirror.com/datasets/gokulsrinivasagan/processed_wikitext-103-raw-v1-ld-5
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含多个特征字段,如input_ids、attention_mask、special_tokens_mask和lda_lables,分别表示输入ID、注意力掩码、特殊标记掩码和LDA标签。数据集被分为训练集、测试集和验证集,其中训练集包含228,639个样本,测试集包含549个样本,验证集包含479个样本。数据集的下载大小为249,431,278字节,总大小为718,398,376字节。
The dataset includes multiple feature fields such as input_ids, attention_mask, special_tokens_mask, and lda_lables, representing input IDs, attention masks, special token masks, and LDA labels, respectively. The dataset is divided into training, test, and validation sets, with the training set containing 228,639 samples, the test set containing 549 samples, and the validation set containing 479 samples. The download size of the dataset is 249,431,278 bytes, and the total size is 718,398,376 bytes.
提供机构:
gokulsrinivasagan



