five

ainhoa-vivel/MeteoGalEus

收藏
Hugging Face2026-04-22 更新2026-04-26 收录
下载链接:
https://hf-mirror.com/datasets/ainhoa-vivel/MeteoGalEus
下载链接
链接失效反馈
官方服务:
资源简介:
MeteoGalEus数据集是一个多语言、跨区域的天气语料库,结合了结构化的气象数据和人类撰写的天气预报。它涵盖三种语言:西班牙语、加利西亚语和巴斯克语。整个数据集以JSON格式分发,便于集成和分析。每个条目对应一个特定地区和单日的天气情况,包括气象预测和专业气象学家撰写的预报文本。数据覆盖每个地区的33至36个城市,提供全面的覆盖范围。该数据集支持多种研究应用,包括跨区域天气分析、多语言自然语言生成、气候趋势研究、大型语言模型评估以及对低资源语言(特别是巴斯克语和加利西亚语)的研究。

MeteoGalEus dataset is a multilingual, cross-regional weather corpus that combines structured meteorological data and human-written forecasts. It covers three languages: Spanish, Galician, and Euskera. The entire dataset is distributed in JSON format, facilitating easy integration and analysis. Each entry corresponds to a single day and a specific region, comprising both meteorological predictions and forecast texts authored by professional meteorologists. The data spans 33 to 36 cities per region, providing comprehensive coverage. This dataset supports a range of research applications, including cross-regional weather analysis, multilingual natural language generation, climate trend studies, evaluation of large language models, and investigations into low-resource languages, particularly Euskera and Galician.
提供机构:
ainhoa-vivel
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作