FudanSELab/SO_KGXQR_DUPLICATE
收藏Hugging Face2023-11-20 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/FudanSELab/SO_KGXQR_DUPLICATE
下载链接
链接失效反馈官方服务:
资源简介:
---
license: mit
dataset_info:
- config_name: duplicate_csharp
features:
- name: query
dtype: string
- name: relevant
sequence: string
splits:
- name: test
num_bytes: 91485
num_examples: 1200
download_size: 61619
dataset_size: 91485
- config_name: duplicate_java
features:
- name: query
dtype: string
- name: relevant
sequence: string
splits:
- name: test
num_bytes: 102838
num_examples: 1200
download_size: 69239
dataset_size: 102838
- config_name: duplicate_javascript
features:
- name: query
dtype: string
- name: relevant
sequence: string
splits:
- name: test
num_bytes: 107321
num_examples: 1200
download_size: 69456
dataset_size: 107321
- config_name: duplicate_python
features:
- name: query
dtype: string
- name: relevant
sequence: string
splits:
- name: test
num_bytes: 109709
num_examples: 1200
download_size: 73833
dataset_size: 109709
configs:
- config_name: duplicate_csharp
data_files:
- split: test
path: duplicate_csharp/test-*
- config_name: duplicate_java
data_files:
- split: test
path: duplicate_java/test-*
- config_name: duplicate_javascript
data_files:
- split: test
path: duplicate_javascript/test-*
- config_name: duplicate_python
data_files:
- split: test
path: duplicate_python/test-*
language:
- en
size_categories:
- 1K<n<10K
---
## Dataset Description
- **Repository:** [GitHub Repository](https://kgxqr.github.io/)
提供机构:
FudanSELab
原始信息汇总
数据集描述
配置信息
duplicate_csharp
- 特征:
query: 字符串类型relevant: 字符串序列
- 分割:
test:- 字节数: 91485
- 样本数: 1200
- 下载大小: 61619 字节
- 数据集大小: 91485 字节
duplicate_java
- 特征:
query: 字符串类型relevant: 字符串序列
- 分割:
test:- 字节数: 102838
- 样本数: 1200
- 下载大小: 69239 字节
- 数据集大小: 102838 字节
duplicate_javascript
- 特征:
query: 字符串类型relevant: 字符串序列
- 分割:
test:- 字节数: 107321
- 样本数: 1200
- 下载大小: 69456 字节
- 数据集大小: 107321 字节
duplicate_python
- 特征:
query: 字符串类型relevant: 字符串序列
- 分割:
test:- 字节数: 109709
- 样本数: 1200
- 下载大小: 73833 字节
- 数据集大小: 109709 字节
数据文件
- duplicate_csharp:
test:duplicate_csharp/test-*
- duplicate_java:
test:duplicate_java/test-*
- duplicate_javascript:
test:duplicate_javascript/test-*
- duplicate_python:
test:duplicate_python/test-*
语言
- 英语
大小类别
- 1K < n < 10K



