five

damerajee/khasi-datasets

收藏
Hugging Face2023-11-23 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/damerajee/khasi-datasets
下载链接
链接失效反馈
官方服务:
资源简介:
--- license: apache-2.0 task_categories: - text-generation pretty_name: 'Tribal Language , language modeling ' size_categories: - 1K<n<10K --- # What is Khasi Language? ## Location: - Primarily spoken in the northeastern Indian state of Meghalaya. - Also spoken in parts of Assam, Tripura, and Bangladesh. ## Language Family: - Khasi is a member of the Austroasiatic language family. ## Script: - Traditionally written using the Khasi script, which is a script created specifically for the Khasi language. ## Culture and Identity: - The Khasi language is an integral part of the cultural identity of the Khasi people. - It plays a significant role in traditional Khasi folklore, rituals, and oral traditions. ## Grammar: - Khasi has a subject-verb-object (SVO) word order. - Nouns do not have gender, and there is no grammatical distinction between singular and plural. ## Vocabulary: - The vocabulary of Khasi reflects the cultural and natural environment of the Khasi people, including terms related to agriculture, nature, and social customs. - ## Multilingualism: - Many Khasi speakers are multilingual, often fluent in English and other languages due to the region's diverse linguistic landscape. ## Linguistic Features: - Khasi is known for its unique linguistic features, including a system of classifiers used in counting and categorizing objects. ## Language Preservation: - Efforts are made to preserve and promote the Khasi language through education, literature, and cultural programs. ## Cultural Significance: - The Khasi language is closely tied to the cultural and historical heritage of the Khasi people, contributing to their distinct identity in the northeastern region of India.
提供机构:
damerajee
原始信息汇总

数据集概述

数据集名称

  • 名称:Tribal Language, language modeling

任务类别

  • 文本生成

数据集大小

  • 1K<n<10K

语言信息

位置

  • 主要在印度东北部的梅加拉亚邦使用。
  • 也在阿萨姆邦、特里普拉邦和孟加拉国部分地区使用。

语言家族

  • 属于南亚语系。

文字

  • 传统上使用专门为卡西语创造的卡西文字。

文化和身份

  • 卡西语是卡西人民文化身份的重要组成部分。
  • 在传统的卡西民间传说、仪式和口头传统中扮演重要角色。

语法

  • 采用主谓宾(SVO)词序。
  • 名词没有性别,也没有单复数的语法区分。

词汇

  • 词汇反映了卡西人民的文化和自然环境,包括与农业、自然和社会习俗相关的术语。

多语言能力

  • 许多卡西语使用者是多语言者,通常精通英语和其他语言,这是该地区多样化的语言环境所致。

语言特征

  • 卡西语以其独特的语言特征而闻名,包括用于计数和分类对象的分类系统。

语言保护

  • 通过教育、文学和文化项目努力保护和推广卡西语。

文化意义

  • 卡西语与卡西人民的文化和历史遗产紧密相关,为其在印度东北部地区的独特身份做出贡献。
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作