richard-park/horangi-336K-Filtered-split
收藏Hugging Face2024-12-05 更新2024-12-14 收录
下载链接:
https://hf-mirror.com/datasets/richard-park/horangi-336K-Filtered-split
下载链接
链接失效反馈官方服务:
资源简介:
该数据集旨在通过提供多种对话格式的数据来提升호랑이 리더 보드(Tiger Leader Board)的评分。数据集支持axolotl格式,包括alpaca和gpteacher的对话格式。数据集包含多个子集,如STEM_alpaca_data、Applied Science_alpaca_data等,总数据量为1,078,260条,其中训练集和测试集的比例为300,000:36,000。
This dataset is used to improve the leaderboard score of 호랑이, containing various files with a total of 1,078,260 data entries. The dataset is in Korean.
提供机构:
richard-park



