sionic-ai/NanoBEIR-th
收藏Hugging Face2025-12-19 更新2026-02-07 收录
下载链接:
https://hf-mirror.com/datasets/sionic-ai/NanoBEIR-th
下载链接
链接失效反馈官方服务:
资源简介:
NanoBEIR-th是一个泰语翻译版的NanoBEIR基准数据集,用于信息检索评估。该数据集包含三个主要部分:语料库(corpus)、查询相关文档(qrels)和查询(queries)。每个部分下都有多个子集,如NanoClimateFEVER、NanoDBPedia等,每个子集都有详细的字节数和示例数。数据集通过GPT-4o-mini进行翻译,并由GPT-4o进行质量验证。
NanoBEIR-th is a Thai-translated version of the NanoBEIR benchmark dataset for information retrieval evaluation. The dataset includes three main configurations: corpus, qrels, and queries. Each configuration contains multiple subsets such as NanoClimateFEVER, NanoDBPedia, etc., with detailed byte counts and example numbers. The dataset was translated using GPT-4o-mini and quality-verified by GPT-4o.
提供机构:
sionic-ai



