ai-forever/udmurt-corpora
收藏Hugging Face2024-12-11 更新2024-12-14 收录
下载链接:
https://hf-mirror.com/datasets/ai-forever/udmurt-corpora
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含各种乌德穆尔特语的文本材料。内容包括:文学作品(民间故事、诗歌和书籍摘录)、新闻文章(本地新闻和文化更新)以及对话数据(聊天记录和口语转录)。该数据集设计用于语言学研究、自然语言处理(NLP)任务以及构建乌德穆尔特语的语言模型。
This dataset comprises a variety of textual materials in the Udmurt language, including literary works (folk tales, poetry, and excerpts from books), news articles (local news and cultural updates), and conversational data (chat logs and spoken word transcripts). The dataset is designed for linguistic research, natural language processing (NLP) tasks, and building language models for the Udmurt language.
提供机构:
ai-forever



