TAUR-dev/D-ExpTracker__1113_newmodels__llama3b_ct3arg__v1
收藏Hugging Face2025-11-13 更新2025-11-15 收录
下载链接:
https://hf-mirror.com/datasets/TAUR-dev/D-ExpTracker__1113_newmodels__llama3b_ct3arg__v1
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含了用于强化学习(RL)的实验配置、日志、元数据和训练数据。具体包括模型的超参数设置、训练过程中的日志记录、实验的基本信息和描述、以及用于RL训练的元数据信息。
The dataset includes experiment configurations, logs, metadata, and training data for reinforcement learning (RL). It specifically contains model hyperparameters, logs during training, basic information and descriptions of the experiment, and metadata information for RL training.
提供机构:
TAUR-dev



