fyaronskiy/code_search_net_ru_en
收藏Hugging Face2025-11-24 更新2025-12-20 收录
下载链接:
https://hf-mirror.com/datasets/fyaronskiy/code_search_net_ru_en
下载链接
链接失效反馈官方服务:
资源简介:
---
dataset_info:
features:
- name: repository_name
dtype: string
- name: func_path_in_repository
dtype: string
- name: func_name
dtype: string
- name: whole_func_string
dtype: string
- name: language
dtype: string
- name: func_code_string
dtype: string
- name: func_code_tokens
list: string
- name: func_documentation_string
dtype: string
- name: ru_func_documentation_string
dtype: string
- name: func_documentation_tokens
list: string
- name: split_name
dtype: string
- name: func_code_url
dtype: string
splits:
- name: train
num_bytes: 6457176369
num_examples: 1880853
- name: validataion
num_bytes: 304733239
num_examples: 89154
- name: test
num_bytes: 339955840
num_examples: 100529
download_size: 2216518338
dataset_size: 7101865448
configs:
- config_name: default
data_files:
- split: train
path: data/train-*
- split: validataion
path: data/validataion-*
- split: test
path: data/test-*
language:
- ru
- en
tags:
- code
- code_retrieval
- text_retrieval
task_categories:
- sentence-similarity
- text-retrieval
---
The [CodeSearhNet Dataset](https://huggingface.co/datasets/code-search-net/code_search_net) translated into Russian. Translation was done with [Qwen3-8B](https://huggingface.co/Qwen/Qwen3-8B) model.
提供机构:
fyaronskiy



