Semantically tagged Europarl-pt.v7
收藏NIAID Data Ecosystem2026-03-14 收录
下载链接:
https://zenodo.org/record/4627153
下载链接
链接失效反馈官方服务:
资源简介:
Semantically tagged Europarl-pt.v7
55.9+ M lines
Lexical coverage of the tagging: 76.97%
No semantic ambiguity resolving, all the tags marked
POS tagging for semantic tagging performed with Treetagger: https://www.cis.uni-muenchen.de/~schmid/tools/TreeTagger/
Output in base form
Documentation of the semantic tagger (for Finnish, but same principles hold for Portuguese, too):
https://www.aclweb.org/anthology/W19-0306/
https://zenodo.org/record/3676372#.YFNwIa8zY2w
Format: base form POS Semtag
Unknown words marked with tag Z99
Semantic tagging
Tagging of the data was performed in Puhti computing environment of the CSC – IT CENTER FOR SCIENCE LTD. https://research.csc.fi/-/puhti
Example output
reinício# Z99
de+a# Z99
sessão noun T1.3 T1.3/G1.1 T1.3/G2.1 T1.3/K1
declarar# Z99
reaberto# Z99
o pron Z8 Z8f Z8m Z8mf Z8mfn
sessão noun T1.3 T1.3/G1.1 T1.3/G2.1 T1.3/K1
de+o# Z99
parlamento noun G1.1
europeu noun Z2/S2mf Z3
, PUNCT
que conj A13.3 A6.1+ Z5 Z8
ter# Z99
ser verb A3+ Z5
interromper verb H4 T1.3+
em+a# Z99
创建时间:
2022-09-22



