EXCEPTIUS Corpus
收藏DataCite Commons2025-07-03 更新2025-04-09 收录
下载链接:
https://dataverse.nl/citation?persistentId=doi:10.34894/ZUWAPS
下载链接
链接失效反馈官方服务:
资源简介:
EXCEPTIUS Corpus v1.0, containing the following data:<br>
- raw documents for 21 countries at national level<br>
- pre-processed data with spacy-udpipe v1.0<br>
- automatically annotated documents for the identification of exceptional measures at sentence level<br>
<br>
Country list (ISO 3166-1 alpha-2): AT, BE, HR, CY, CZ, DK, FR, DE, HU, IE, IT, LV, LT, NL, NO, PL, SI, SE, CH, UK
<br>
<br>
Folder structure: each country has a dedicated folder. Inside each folder you will find the following subfolders: <br>
- raw_text: the raw text data (.txt format) <br>
- processed: the output of the spacy-udpipe v1.0 - each line is a sentence, containing the following info: tokens, lemma, POS, UD dependency relations<br>
- model: the predictions of the trained model (XML pre@36 as reported in Table 4 of the paper). Each line is a sentence, separate by 9 tab - each for a exceptional measure class. 1: signals presence of a class.
<br>
<br>
The Italy and Norway folder misses the predictions of the models.
提供机构:
DataverseNL
创建时间:
2021-09-29



