kothasuhas/llama-3b-gold-15M-student-generations_SNIS_2048_tune422v1_N15.00M_T4.0
收藏Hugging Face2025-04-26 更新2025-10-25 收录
下载链接:
https://hf-mirror.com/datasets/kothasuhas/llama-3b-gold-15M-student-generations_SNIS_2048_tune422v1_N15.00M_T4.0
下载链接
链接失效反馈官方服务:
资源简介:
这是一个包含文本数据的数据集,具体包括文本内容(text),对数权重(log_weight),采样概率缩放(sampling_p_scaled)和采样温度缩放(sampling_p_temperature_scaled)四个字段。数据集分为训练集和验证集,训练集有1500万个样本,验证集有1000个样本。
This dataset includes text data with four fields: text content (text), logarithmic weight (log_weight), scaled sampling probability (sampling_p_scaled), and scaled sampling temperature (sampling_p_temperature_scaled). The dataset is split into a training set with 15 million samples and a validation set with 1,000 samples.
提供机构:
kothasuhas



