mteb/COIRCodeSearchNetRetrieval
收藏Hugging Face2025-05-06 更新2025-05-31 收录
下载链接:
https://hf-mirror.com/datasets/mteb/COIRCodeSearchNetRetrieval
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含了Go、Java、JavaScript、PHP、Python和Ruby等编程语言的相关文本和查询数据。每个数据集都分为语料库(corpus)、查询(queries)和相关性评分(qrels)三个部分。语料库部分包含文本内容和标题,查询部分包含查询文本,相关性评分部分包含查询与语料库之间的相关性评分。数据集适用于信息检索、文本挖掘和自然语言处理等领域的研究和应用。
The dataset includes text and query data related to programming languages such as Go, Java, JavaScript, PHP, Python, and Ruby. Each dataset is divided into three parts: corpus, queries, and qrels. The corpus part contains text content and titles, the queries part contains query text, and the qrels part contains relevance scores between queries and corpus. The datasets are suitable for research and applications in information retrieval, text mining, and natural language processing.
提供机构:
mteb



