Hinglish Sentences
收藏arXiv2025-09-30 收录
下载链接:
https://github.com/lingo-iitgn/commentator
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含一组十个印地英语(Hinglish)句子,专为词汇级别语言任务进行了标注。它不仅包含了语言识别(LID)的标签,还包含了词性标注(POS)的标签,这些标签使得进行全面的语言学分析成为可能。该数据集的任务是对词汇级别的语言进行标注,具体涉及语言识别和词性标注。
This dataset includes ten Hinglish sentences specially annotated for lexical-level language tasks. It contains both Language Identification (LID) tags and Part-of-Speech (POS) tags, which enable comprehensive linguistic analysis. The task of this dataset focuses on lexical-level language annotation, specifically involving Language Identification (LID) and Part-of-Speech (POS) tagging.
提供机构:
COMMENTATOR project on GitHub



