W2C – Web to Corpus – tool
收藏B2FIND2026-04-25 收录
下载链接:
https://b2find.eudat.eu/dataset/4e55a7bc-ed0f-5393-a6ed-76a4350051b0
下载链接
链接失效反馈官方服务:
资源简介:
A tool used to build multilingual corpora from wikipedia. Download the web pages, convert them to plain text, identify language, etc. A set of 120 corpora collected using this...



