five

Semantically tagged Europarl-pt.v7

收藏
NIAID Data Ecosystem2026-03-14 收录
下载链接:
https://zenodo.org/record/4627153
下载链接
链接失效反馈
官方服务:
资源简介:
Semantically tagged Europarl-pt.v7  55.9+ M lines Lexical coverage of the tagging: 76.97% No semantic ambiguity resolving, all the tags marked POS tagging for semantic tagging performed with Treetagger: https://www.cis.uni-muenchen.de/~schmid/tools/TreeTagger/ Output in base form  Documentation of the semantic tagger (for Finnish, but same principles hold for Portuguese, too):  https://www.aclweb.org/anthology/W19-0306/ https://zenodo.org/record/3676372#.YFNwIa8zY2w Format: base form POS Semtag Unknown words marked with tag Z99 Semantic tagging Tagging of the data was performed in Puhti computing environment of the CSC – IT CENTER FOR SCIENCE LTD. https://research.csc.fi/-/puhti Example output reinício# Z99 de+a# Z99 sessão    noun    T1.3 T1.3/G1.1 T1.3/G2.1 T1.3/K1 declarar# Z99 reaberto# Z99 o    pron    Z8 Z8f Z8m Z8mf Z8mfn sessão    noun    T1.3 T1.3/G1.1 T1.3/G2.1 T1.3/K1 de+o# Z99 parlamento    noun    G1.1 europeu    noun    Z2/S2mf Z3 ,     PUNCT que    conj    A13.3 A6.1+ Z5 Z8 ter# Z99 ser    verb    A3+ Z5 interromper    verb    H4 T1.3+ em+a# Z99
创建时间:
2022-09-22
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作