Spanish CBOW Word Embeddings in FastText
收藏NIAID Data Ecosystem2026-03-14 收录
下载链接:
https://zenodo.org/record/5044987
下载链接
链接失效反馈官方服务:
资源简介:
These Spanish word embeddings in FastText have been generated from the largest corpus ever made in Spanish till date. The corpus has more than 2TB of high-quality text, compiled from the different web crawlings done by the National Library of Spain from 2009 to 2019.
These are the CBOW embeddings, for the SKIP-GRAM embeddings see: https://zenodo.org/record/5046525
Citation
@article{gutierrezfandino2022,
author = {Asier Gutiérrez-Fandiño and Jordi Armengol-Estapé and Marc Pàmies and Joan Llop-Palao and Joaquin Silveira-Ocampo and Casimiro Pio Carrino and Carme Armentano-Oller and Carlos Rodriguez-Penagos and Aitor Gonzalez-Agirre and Marta Villegas},
title = {MarIA: Spanish Language Models},
journal = {Procesamiento del Lenguaje Natural},
volume = {68},
number = {0},
year = {2022},
issn = {1989-7553},
url = {http://journal.sepln.org/sepln/ojs/ojs/index.php/pln/article/view/6405},
pages = {39--60}
}
Copyright
Copyright (c) 2021 Secretaría de Estado de Digitalización e Inteligencia Artificial
创建时间:
2022-11-04



