five

Nos_MeteoGalicia-GL Data-to-Text

收藏
NIAID Data Ecosystem2026-05-01 收录
下载链接:
https://zenodo.org/record/7661649
下载链接
链接失效反馈
官方服务:
资源简介:
MeteoGalicia-GL is the first known data-to-text dataset in Galician language. The dataset is made up of 3,302 records of meteorological prediction tabular data along with handwritten textual descriptions in Galician. All the data provided in this dataset have been collected from the Galician Official Meteorological Agency (MeteoGalicia). The dataset comprises real tabular data and texts written by expert meteorologists from MeteoGalicia. Additionally, the texts provided in MeteoGalicia-GL were cured by CiTIUS staff, in order to check the consistency of the data tables with the captions. The dataset is stored in the "dataset" directory, where we can find individual files for each tabular data file and caption in their respective folders. The tabular data files contained in MeteoGalicia-GL represent the state-of-the-sky for a day in Galicia (NW Spain), by categorical values in Galician, e.g., "despexado" (“sunny”), "nubrado" (“cloudy”), etc. Each data table is organized into 4 columns and 32 rows. The first column ("Zona") indicates one out of the 32 geographical zones of Galicia for the forecast, while the rest of the columns indicate the state-of-the-sky value for the different periods of the day: morning ("Mañá"), afternoon ("Tarde") and night ("Noite").
创建时间:
2023-06-22
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作