wisenut-nlp-team/dpo_v1_10000_kor_eng_splits
收藏Hugging Face2024-12-13 更新2024-12-14 收录
下载链接:
https://hf-mirror.com/datasets/wisenut-nlp-team/dpo_v1_10000_kor_eng_splits
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含多个特征,如prompt、chosen、rejected、lang、domain和source,这些特征的数据类型均为字符串。数据集被分割为11个不同的子集,每个子集包含10000个样本,且每个子集的字节数和样本数均相同。此外,README还提供了数据集的下载大小和总大小,以及配置文件中每个子集的数据文件路径。
The dataset contains multiple features such as prompt, chosen, rejected, lang, domain, and source, all of which are of string data type. The dataset is divided into 11 different subsets, each containing 10,000 samples, with the same number of bytes and samples for each subset. Additionally, the README provides the download size and total size of the dataset, as well as the data file paths for each subset in the configuration file.
提供机构:
wisenut-nlp-team



