kalixlouiis/myanmar-literature-corpus
收藏Hugging Face2025-10-20 更新2025-10-25 收录
下载链接:
https://hf-mirror.com/datasets/kalixlouiis/myanmar-literature-corpus
下载链接
链接失效反馈官方服务:
资源简介:
MYLT Corpus 是一个包含缅甸语书籍和长篇小说文本的通用语料库。它主要用于研究和教育目的,特别是为了提高缅甸自然语言处理(NLP)模型在语言建模、文本分类和情感分析等任务中的性能。数据以 JSON Lines 文件组织,包含文本、作者和书名等字段。数据集遵循 CC BY-NC-ND 4.0 许可,这意味着它不能用于商业目的,并且需要标注出处。用户必须从原始作者或出版商那里获得商业使用的许可。
The Myanmar Literature Text (MYLT) Corpus is a general-purpose corpus containing texts from Myanmar books and novels. It is primarily intended for research and educational purposes, specifically to enhance the performance of Myanmar NLP models in tasks such as language modeling, text classification, and sentiment analysis. The data is structured in JSON Lines files, with fields for the text, author, and book title. The dataset is licensed under CC BY-NC-ND 4.0, which means it cannot be used for commercial purposes and requires attribution. Users must obtain permission from the original authors or publishers for commercial use.
提供机构:
kalixlouiis



