five

CLARIN.SI repository

收藏
re3data.org2024-05-31 收录
下载链接:
https://www.re3data.org/repository/r3d100011922
下载链接
链接失效反馈
官方服务:
资源简介:
CLARIN.SI is the Slovenian node of the European CLARIN (Common Language Resources and Technology Infrastructure) Centers. The CLARIN.SI repository is hosted at the Jožef Stefan Institute and offers long-term preservation of deposited linguistic resources, along with their descriptive metadata. The integration of the repository with the CLARIN infrastructure gives the deposited resources wide exposure, so that they can be known, used and further developed beyond the lifetime of the projects in which they were produced. Among the resources currently available in the CLARIN.SI repository are the multilingual MULTEXT-East resources, the CC version of Slovenian reference corpus Gigafida, the morphological lexicon Sloleks, the IMP corpora and lexicons of historical Slovenian, as well as many other resources for a variety of languages. Furthermore, several REST-based web services are provided for different corpus-linguistic and NLP tasks.

CLARIN.SI作为欧洲CLARIN(通用语言资源与技术基础设施)中心的爱沙尼亚节点,其资源库由约瑟夫·斯蒂芬研究院托管。该资源库致力于对存档的语言资源进行长期保存,并附带其描述性元数据。该资源库与CLARIN基础设施的融合使得存档资源得以广泛曝光,从而在它们所产生项目的生命周期之外,得以为人所知、所用及进一步开发。在CLARIN.SI资源库中,目前可获取的资源包括多语言MULTEXT-East资源、斯洛文尼亚参考语料库Gigafida的CC版本、形态学词汇表Sloleks、历史斯洛文尼亚的IMP语料库及词汇表,以及众多其他语言的多样化资源。此外,还提供了一系列基于REST的Web服务,以满足不同语料库语言学和自然语言处理任务的需求。
提供机构:
Slovenian CLARIN repository
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作