GreenNode/nano-climate-fever-vn
收藏Hugging Face2025-12-30 更新2026-01-03 收录
下载链接:
https://hf-mirror.com/datasets/GreenNode/nano-climate-fever-vn
下载链接
链接失效反馈官方服务:
资源简介:
NanoClimateFEVER-VN是一个基于MTEB(大规模文本嵌入基准)的数据集,从CLIMATE-FEVER数据集翻译而来。该数据集包含1,535个关于气候变化的真实世界声明,采用了FEVER方法。数据集的创建过程使用了大型语言模型(LLMs)进行翻译,并应用了先进的嵌入模型来过滤翻译结果。数据集的任务类别包括文本检索和事实核查,语言为越南语(vie),许可证为cc-by-sa-4.0。数据集包含三个配置:corpus(语料库)、qrels(查询相关度)和queries(查询),每个配置都有详细的特征和分割信息。
NanoClimateFEVER-VN is an MTEB (Massive Text Embedding Benchmark) dataset translated from the CLIMATE-FEVER dataset. It consists of 1,535 real-world claims regarding climate-change, adopting the FEVER methodology. The dataset creation process involves using large language models (LLMs) for translation and applying advanced embedding models to filter the translations. The task categories include text retrieval and fact-checking, and the language is Vietnamese (vie) with a cc-by-sa-4.0 license. The dataset includes three configurations: corpus, qrels, and queries, each with detailed features and split information.
提供机构:
GreenNode



