tim-lawson/mlsae-pythia-70m-deduped-x64-k16-examples
收藏Hugging Face2024-09-08 更新2024-12-14 收录
下载链接:
https://hf-mirror.com/datasets/tim-lawson/mlsae-pythia-70m-deduped-x64-k16-examples
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含多个特征,如id、latent、layer、token_id、token、act、token_ids、tokens和acts,这些特征的数据类型分别为int64、int64、int64、int64、string、float64、string、string和string。数据集只有一个训练集(train),包含3,194,088个样本,总大小为750,124,384字节,下载大小为137,837,555字节。配置信息中指定了默认配置,数据文件路径为data/train-*。
The dataset includes multiple feature fields such as id, latent, layer, token_id, token, act, etc., with data types covering integers and strings. The dataset is split into a training set, containing 3,194,088 samples with a total size of 750,124,384 bytes. The default configuration of the dataset points to the path of the training data files.
提供机构:
tim-lawson



