five

z00logist/OldChurchSlavonic

收藏
Hugging Face2024-12-20 更新2024-12-21 收录
下载链接:
https://hf-mirror.com/datasets/z00logist/OldChurchSlavonic
下载链接
链接失效反馈
官方服务:
资源简介:
Old Church Slavonic数据集是一个由互联网上公开资源和学术数据集中的手稿组成的预处理文本集合。数据集包含来自不同古教会斯拉夫语手稿的结构化文本,按来源分类(如Akafist、Triodpost、Evangelie等)。该数据集旨在解决Hugging Face上缺乏古教会斯拉夫语专用数据集的问题,保留了古教会斯拉夫语的丰富语言特征,并为解决不同的NLP任务提供了基础。该数据集的创建目的是保存和推广斯拉夫文化的丰富文化和语言遗产,使其能够被现代计算工具访问,并鼓励在科学研究、语言建模和教育环境中探索古教会斯拉夫语。

The Old Church Slavonic Dataset is a collection of preprocessed texts sourced from a combination of publicly accessible resources on the Internet and curated manuscripts from academic datasets. The dataset consists of structured texts from various Old Church Slavonic manuscripts, divided by sources (e.g., Akafist, Triodpost, Evangelie, etc.). This dataset is designed to address the lack of dedicated datasets for Old Church Slavonic on Hugging Face. It preserves the rich linguistic features of Old Church Slavonic and provides a foundation for solving different NLP tasks with it. This dataset was created with the goal of preserving and promoting the rich cultural and linguistic heritage of Slavic culture, making it accessible to modern computational tools. The aim is to encourage the exploration of Old Church Slavonic in scientific research, language modeling, and educational contexts.
提供机构:
z00logist
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作