five

lightretriever/lightretriever-finetune-data

收藏
Hugging Face2025-07-10 更新2025-10-25 收录
下载链接:
https://hf-mirror.com/datasets/lightretriever/lightretriever-finetune-data
下载链接
链接失效反馈
官方服务:
资源简介:
该数据集包含了用于研究论文《LightRetriever:一种1000倍快速查询推断的基于LLM的混合检索架构》的所有训练数据集。数据集用于句子相似度和文本排名等任务,提供英文和中文两种语言版本。数据集以Parquet文件形式组织,有多种配置,如agnews、All_classification、AllArxiv_clustering等。README文件中没有提供数据集的具体内容或用途描述,但提到未来将更新更多信息。根据仓库名称和论文标题,这些数据集与训练一个名为LightRetriever的检索系统相关。

This dataset includes all training datasets for the research paper LightRetriever: A LLM-based Hybrid Retrieval Architecture with 1000x Faster Query Inference. The datasets are used for tasks such as sentence similarity and text ranking, and are available in both English and Chinese. The datasets are organized in Parquet files, with multiple configurations like agnews, All_classification, AllArxiv_clustering, etc. The README file does not provide a specific description of the datasets content or purpose, but it mentions that more information will be provided in the future. Based on the repository name and the paper title, these datasets are related to training a retrieval system called LightRetriever.
提供机构:
lightretriever
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作