Spanish CBOW Word Embeddings in FastText

NIAID Data Ecosystem2026-03-14 收录

下载链接：

https://zenodo.org/record/5044987

下载链接

链接失效反馈

官方服务：

资源简介：

These Spanish word embeddings in FastText have been generated from the largest corpus ever made in Spanish till date. The corpus has more than 2TB of high-quality text, compiled from the different web crawlings done by the National Library of Spain from 2009 to 2019. These are the CBOW embeddings, for the SKIP-GRAM embeddings see: https://zenodo.org/record/5046525 Citation @article{gutierrezfandino2022, author = {Asier Gutiérrez-Fandiño and Jordi Armengol-Estapé and Marc Pàmies and Joan Llop-Palao and Joaquin Silveira-Ocampo and Casimiro Pio Carrino and Carme Armentano-Oller and Carlos Rodriguez-Penagos and Aitor Gonzalez-Agirre and Marta Villegas}, title = {MarIA: Spanish Language Models}, journal = {Procesamiento del Lenguaje Natural}, volume = {68}, number = {0}, year = {2022}, issn = {1989-7553}, url = {http://journal.sepln.org/sepln/ojs/ojs/index.php/pln/article/view/6405}, pages = {39--60} } Copyright Copyright (c) 2021 Secretaría de Estado de Digitalización e Inteligencia Artificial

创建时间：

2022-11-04

5,000+

优质数据集

54 个

任务类型

进入经典数据集