PrevDistro
收藏DataCite Commons2025-11-13 更新2026-01-12 收录
下载链接:
https://hdl.handle.net/21.15109/CONCORDA/XTVX3U
下载链接
链接失效反馈官方服务:
资源简介:
PrevDistro (Preverb Distributions) is an open-source dataset containing 41.5 million corpus occurrences of 49 preverb-verb construction types. It consists of 10 columns which are as follows:
1st: ID
2nd: construction type
3rd: construction subtype
4th: preverb position
5th: preverb
6th: verb lemma
7th: intervening words (as lemmas)
8th: actual form
9th: document ID
10th: actual sentence from the Hungarian Gigaword Corpus, the actual form (KWIC) stands between < ... >
提供机构:
ARP
创建时间:
2025-11-13



