Yusser/vi_sae_wiki_tokenized
收藏Hugging Face2025-03-11 更新2025-04-12 收录
下载链接:
https://hf-mirror.com/datasets/Yusser/vi_sae_wiki_tokenized
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含了一个整数序列特征,名为input_ids。它被拆分为一个训练集,共有347818个示例,数据集的总大小为1,426,053,800字节。数据集的下载大小为690,813,596字节。默认配置中包含了训练集数据文件的路径。
The dataset includes a sequence of integer features named input_ids. It is split into a training set with a total of 347,818 examples, and the overall size of the dataset is 1,426,053,800 bytes. The download size of the dataset is 690,813,596 bytes. The default configuration contains the path to the training set data files.
提供机构:
Yusser



