five

EXCEPTIUS Corpus

收藏
DataCite Commons2025-07-03 更新2025-04-09 收录
下载链接:
https://dataverse.nl/citation?persistentId=doi:10.34894/ZUWAPS
下载链接
链接失效反馈
官方服务:
资源简介:
EXCEPTIUS Corpus v1.0, containing the following data:<br> - raw documents for 21 countries at national level<br> - pre-processed data with spacy-udpipe v1.0<br> - automatically annotated documents for the identification of exceptional measures at sentence level<br> <br> Country list (ISO 3166-1 alpha-2): AT, BE, HR, CY, CZ, DK, FR, DE, HU, IE, IT, LV, LT, NL, NO, PL, SI, SE, CH, UK <br> <br> Folder structure: each country has a dedicated folder. Inside each folder you will find the following subfolders: <br> - raw_text: the raw text data (.txt format) <br> - processed: the output of the spacy-udpipe v1.0 - each line is a sentence, containing the following info: tokens, lemma, POS, UD dependency relations<br> - model: the predictions of the trained model (XML pre@36 as reported in Table 4 of the paper). Each line is a sentence, separate by 9 tab - each for a exceptional measure class. 1: signals presence of a class. <br> <br> The Italy and Norway folder misses the predictions of the models.
提供机构:
DataverseNL
创建时间:
2021-09-29
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作