schuler/open-orca-slimorca-deduped-cleaned-corrected-for-pascal-txt
收藏Hugging Face2024-11-17 更新2024-12-14 收录
下载链接:
https://hf-mirror.com/datasets/schuler/open-orca-slimorca-deduped-cleaned-corrected-for-pascal-txt
下载链接
链接失效反馈官方服务:
资源简介:
Open Orca Slim for Pascal Developers数据集是Open Orca数据集的一个子集,专为Pascal开发者设计。该数据集包含纯英文字符,通过对原始数据集中的对话内容进行处理,去除特定字符并确保所有字符的ASCII码小于128,最终生成了训练集和验证集。
Open Orca Slim for Pascal Developers is a modified English-only dataset, which is a subset of the original Open Orca dataset. The dataset contains only English characters and has been processed and cleaned to ensure data quality. It is divided into training and validation sets, supporting tasks for Pascal developers.
提供机构:
schuler



