microsoft/Updesh_beta
收藏Hugging Face2025-07-07 更新2025-07-05 收录
下载链接:
https://hf-mirror.com/datasets/microsoft/Updesh_beta
下载链接
链接失效反馈官方服务:
资源简介:
Updesh是一个大规模的合成数据集,旨在推动印度语系的LLM后训练。它结合了翻译推理数据和合成的开放域生成内容,以支持基于印度语言和文化背景的多语言LLM适应。Updesh通过提供丰富的多语言指令调整数据,填补了高质量、文化基础资源在印度语系中的空白。
Updesh is a large-scale synthetic dataset designed to advance post-training of LLMs for Indic languages. It integrates translated reasoning data and synthesized open-domain generative content to support culturally-grounded multilingual adaptation of LLMs.
提供机构:
microsoft



