Leonardo6/crosscoder-llama-3.2-1b-diff
收藏Hugging Face2024-12-03 更新2024-12-14 收录
下载链接:
https://hf-mirror.com/datasets/Leonardo6/crosscoder-llama-3.2-1b-diff
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含两个主要特征:input_ids和original_text,分别表示序列化的整数和原始文本字符串。数据集包含一个训练集(train),共有24,534个样本,总大小为391,513,728字节。下载大小为185,147,529字节,数据集大小为391,513,728字节。配置文件中指定了训练集数据文件的路径为data/train-*。
The dataset contains two main features: input_ids and original_text, representing serialized integers and original text strings, respectively. The dataset includes a training set (train) with 24,534 samples, totaling 391,513,728 bytes. The download size is 185,147,529 bytes, and the dataset size is 391,513,728 bytes. The configuration file specifies the path to the training set data files as data/train-*.
提供机构:
Leonardo6



