dogtooth/off-policy-0.1-with-on-policy-0.1-uf_iter2_generated_ultrafeedback_binarized_1730453795
收藏Hugging Face2024-11-01 更新2024-12-14 收录
下载链接:
https://hf-mirror.com/datasets/dogtooth/off-policy-0.1-with-on-policy-0.1-uf_iter2_generated_ultrafeedback_binarized_1730453795
下载链接
链接失效反馈官方服务:
资源简介:
allenai/open_instruct数据集是一个生成数据集,主要用于生成任务。数据集的生成过程涉及多个配置参数,包括数据集混合列表、数据集分割、模型路径等。生成过程中使用了拒绝采样技术,并生成了多个完成样本。数据集的目标是生成高质量的对话数据,适用于聊天模型的训练和评估。
The allenai/open_instruct dataset is a generation dataset primarily used for generation tasks. The dataset generation process involves multiple configuration parameters, including dataset mixer list, dataset splits, model path, etc. The generation process employs rejection sampling techniques and generates multiple completion samples. The dataset aims to produce high-quality dialogue data suitable for training and evaluating chat models.
提供机构:
dogtooth



