johannesack/alpaca_sft1738133725
收藏Hugging Face2025-01-29 更新2025-02-15 收录
下载链接:
https://hf-mirror.com/datasets/johannesack/alpaca_sft1738133725
下载链接
链接失效反馈官方服务:
资源简介:
Alpaca-instructions数据集是一个适用于TLDR代码格式的数据集,由Costa Huang创建。该数据集经过筛选,只包含查询和参考响应的总令牌长度小于或等于615,且参考响应的令牌长度小于或等于106的示例。验证集也经过了最大长度的筛选。测试集是alpaca_farm_evaluation,也进行了轻微的筛选。
The Alpaca-instructions dataset is a dataset formatted for TLDR code by Costa Huang. It is filtered to include only examples where the sum of the token length of the query and the reference response is less than or equal to 615, and the token length of the reference response is less than or equal to 106. The validation dataset is also filtered to the max lengths. The test split is alpaca_farm_evaluation and is slightly filtered as well.
提供机构:
johannesack



