Jianshu001/arabic-daily-batch02-cascade-5k
收藏Hugging Face2026-04-23 更新2026-04-26 收录
下载链接:
https://hf-mirror.com/datasets/Jianshu001/arabic-daily-batch02-cascade-5k
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含5001条阿拉伯语文本记录,通过Gemma-4-31B进行级联重写,并使用GPT-5.4-mini作为二元评判进行过滤(仅删除,不重写)。数据处理流程与batch01相同。
This dataset contains 5001 Arabic text records, processed via cascade rewrite using Gemma-4-31B and filtered by a GPT-5.4-mini binary judge (drop-only, no rewrites). The pipeline is the same as batch01.
提供机构:
Jianshu001



