cfpark00/toy-multistep-nn_5-na_20-nab_10-seed_2
收藏Hugging Face2025-04-07 更新2025-04-12 收录
下载链接:
https://hf-mirror.com/datasets/cfpark00/toy-multistep-nn_5-na_20-nab_10-seed_2
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含四个主要特征:提示(prompts)、完成(completions)、遮蔽数量(num_maskeds)和文本(texts)。数据集分为训练集(train)、测试集(test_rl和test)。每个数据集包含262144个示例。数据集主要用于文本生成任务,其中可能包含用于机器翻译、文本摘要或其他自然语言处理任务的预训练和微调。
The dataset includes four main features: prompts, completions, number of maskeds, and texts. It is split into three parts: training set (train), two test sets (test_rl and test), each containing 262144 examples. The dataset is primarily used for text generation tasks, possibly including pre-training and fine-tuning for machine translation, text summarization, or other natural language processing tasks.
提供机构:
cfpark00



