LingoIITGN/PHINC
收藏Hugging Face2025-03-20 更新2025-04-12 收录
下载链接:
https://hf-mirror.com/datasets/LingoIITGN/PHINC
下载链接
链接失效反馈官方服务:
资源简介:
PHINC(平行Hinglish社交媒体代码混合语料库)是一个针对机器翻译的低资源任务的手动注释的13,738个Hinglish-英语句子对的数据集。该数据集旨在解决翻译带噪声、非正式、代码混合的社交媒体文本的挑战。
PHINC (Parallel Hinglish Social Media Code-Mixed Corpus for Machine Translation) is a dataset of 13,738 manually annotated Hinglish-English sentence pairs for the low-resource machine translation task, designed to address the challenges of translating noisy, informal, code-mixed social media text.
提供机构:
LingoIITGN



