leklek02/pangasinan
收藏Hugging Face2026-04-23 更新2026-04-26 收录
下载链接:
https://hf-mirror.com/datasets/leklek02/pangasinan
下载链接
链接失效反馈官方服务:
资源简介:
该数据集是原始Alpaca指令跟随数据集的**Pangasinan语翻译**版本,旨在支持低资源菲律宾语言(特别是Pangasinan语)的指令调优语言模型的研究和开发。数据集保留了原始Alpaca的结构,同时提供了高质量的指令、输入和输出的翻译。每个示例包含Pangasinan语的指令、可选上下文和预期响应。数据集可用于Pangasinan语的指令调优、多语言NLP研究、低资源语言建模以及为Pangasinan语使用者开发聊天机器人和助手。
This dataset is a **Pangasinan translation** of the original Alpaca instruction-following dataset. It is designed to support research and development of **instruction-tuned language models** for low-resource Philippine languages, particularly Pangasinan. The dataset retains the original Alpaca structure while providing high-quality translations of instructions, inputs, and outputs. Each example includes an instruction, optional context, and expected response in Pangasinan. The dataset can be used for instruction tuning of LLMs in Pangasinan, multilingual NLP research, low-resource language modeling, and chatbot and assistant development for Pangasinan speakers.
提供机构:
leklek02



