lightretriever/lightretriever-finetune-data
收藏Hugging Face2025-07-10 更新2025-10-25 收录
下载链接:
https://hf-mirror.com/datasets/lightretriever/lightretriever-finetune-data
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含了用于研究论文《LightRetriever:一种1000倍快速查询推断的基于LLM的混合检索架构》的所有训练数据集。数据集用于句子相似度和文本排名等任务,提供英文和中文两种语言版本。数据集以Parquet文件形式组织,有多种配置,如agnews、All_classification、AllArxiv_clustering等。README文件中没有提供数据集的具体内容或用途描述,但提到未来将更新更多信息。根据仓库名称和论文标题,这些数据集与训练一个名为LightRetriever的检索系统相关。
This dataset includes all training datasets for the research paper LightRetriever: A LLM-based Hybrid Retrieval Architecture with 1000x Faster Query Inference. The datasets are used for tasks such as sentence similarity and text ranking, and are available in both English and Chinese. The datasets are organized in Parquet files, with multiple configurations like agnews, All_classification, AllArxiv_clustering, etc. The README file does not provide a specific description of the datasets content or purpose, but it mentions that more information will be provided in the future. Based on the repository name and the paper title, these datasets are related to training a retrieval system called LightRetriever.
提供机构:
lightretriever



