neuclir/news-topics
收藏Hugging Face2025-10-01 更新2025-10-18 收录
下载链接:
https://hf-mirror.com/datasets/neuclir/news-topics
下载链接
链接失效反馈官方服务:
资源简介:
NeuCLIR新闻主题数据集是一个多语言数据集,包含新闻检索任务中的查询和文档。该数据集支持英语、中文、波斯语和俄语四种语言,适用于文本检索和文本排名任务。数据集规模小于1K,提供默认配置,数据文件以CSV格式存储,使用制表符分隔,包含id和query两个字段。
NeuCLIR News Topics dataset is a multilingual dataset containing queries and documents for news retrieval tasks. It supports four languages: English, Chinese, Persian, and Russian, and is suitable for text retrieval and text ranking tasks. The dataset is smaller than 1K in size, provides a default configuration, and the data files are stored in CSV format, separated by tabs, containing id and query fields.
提供机构:
neuclir



