five

AISE-TUDelft/multilingual-code-comments-fixed-6

收藏
Hugging Face2026-03-14 更新2026-04-05 收录
下载链接:
https://hf-mirror.com/datasets/AISE-TUDelft/multilingual-code-comments-fixed-6
下载链接
链接失效反馈
官方服务:
资源简介:
--- dataset_info: - config_name: Chinese features: - name: file_id dtype: string - name: content dtype: string - name: repo dtype: string - name: path dtype: string - name: original_comment dtype: string - name: masked_data_Qwen/CodeQwen1.5-7B dtype: string - name: predict_Qwen/CodeQwen1.5-7B dtype: string - name: predicted_comment_Qwen/CodeQwen1.5-7B dtype: string - name: masked_data_bigcode/starcoder2-7b dtype: string - name: predict_bigcode/starcoder2-7b dtype: string - name: predicted_comment_bigcode/starcoder2-7b dtype: string - name: masked_data_ibm-granite/granite-8b-code-base dtype: string - name: predict_ibm-granite/granite-8b-code-base dtype: string - name: predicted_comment_ibm-granite/granite-8b-code-base dtype: string - name: masked_data_meta-llama/CodeLlama-7b-hf dtype: string - name: predict_meta-llama/CodeLlama-7b-hf dtype: string - name: predicted_comment_meta-llama/CodeLlama-7b-hf dtype: string - name: masked_data_google/codegemma-7b dtype: string - name: predict_google/codegemma-7b dtype: string - name: predicted_comment_google/codegemma-7b dtype: string - name: expert_accuracy_Qwen/CodeQwen1.5-7B dtype: string - name: error_codes_Qwen/CodeQwen1.5-7B dtype: string - name: expert_accuracy_bigcode/starcoder2-7b dtype: string - name: error_codes_bigcode/starcoder2-7b dtype: string - name: expert_accuracy_google/codegemma-7b dtype: string - name: error_codes_google/codegemma-7b dtype: string - name: expert_accuracy_ibm-granite/granite-8b-code-base dtype: string - name: error_codes_ibm-granite/granite-8b-code-base dtype: string - name: expert_accuracy_meta-llama/CodeLlama-7b-hf dtype: string - name: error_codes_meta-llama/CodeLlama-7b-hf dtype: string splits: - name: train num_bytes: 21795233 num_examples: 500 download_size: 8998671 dataset_size: 21795233 - config_name: Dutch features: - name: file_id dtype: string - name: content dtype: string - name: repo dtype: string - name: path dtype: string - name: original_comment dtype: string - name: masked_data_Qwen/CodeQwen1.5-7B dtype: string - name: predict_Qwen/CodeQwen1.5-7B dtype: string - name: predicted_comment_Qwen/CodeQwen1.5-7B dtype: string - name: masked_data_bigcode/starcoder2-7b dtype: string - name: expert_accuracy_Qwen/CodeQwen1.5-7B dtype: string - name: error_codes_Qwen/CodeQwen1.5-7B dtype: string - name: predict_bigcode/starcoder2-7b dtype: string - name: predicted_comment_bigcode/starcoder2-7b dtype: string - name: masked_data_ibm-granite/granite-8b-code-base dtype: string - name: expert_accuracy_bigcode/starcoder2-7b dtype: string - name: error_codes_bigcode/starcoder2-7b dtype: string - name: predict_ibm-granite/granite-8b-code-base dtype: string - name: predicted_comment_ibm-granite/granite-8b-code-base dtype: string - name: masked_data_meta-llama/CodeLlama-7b-hf dtype: string - name: expert_accuracy_ibm-granite/granite-8b-code-base dtype: string - name: error_codes_ibm-granite/granite-8b-code-base dtype: string - name: predict_meta-llama/CodeLlama-7b-hf dtype: string - name: predicted_comment_meta-llama/CodeLlama-7b-hf dtype: string - name: masked_data_google/codegemma-7b dtype: string - name: expert_accuracy_meta-llama/CodeLlama-7b-hf dtype: string - name: error_codes_meta-llama/CodeLlama-7b-hf dtype: string - name: predict_google/codegemma-7b dtype: string - name: predicted_comment_google/codegemma-7b dtype: string - name: expert_accuracy_google/codegemma-7b dtype: string - name: error_codes_google/codegemma-7b dtype: string splits: - name: train num_bytes: 24649260 num_examples: 500 download_size: 9132990 dataset_size: 24649260 - config_name: English features: - name: file_id dtype: string - name: content dtype: string - name: repo dtype: string - name: path dtype: string - name: original_comment dtype: string - name: masked_data_Qwen/CodeQwen1.5-7B dtype: string - name: predict_Qwen/CodeQwen1.5-7B dtype: string - name: predicted_comment_Qwen/CodeQwen1.5-7B dtype: string - name: masked_data_bigcode/starcoder2-7b dtype: string - name: predict_bigcode/starcoder2-7b dtype: string - name: predicted_comment_bigcode/starcoder2-7b dtype: string - name: masked_data_ibm-granite/granite-8b-code-base dtype: string - name: predict_ibm-granite/granite-8b-code-base dtype: string - name: predicted_comment_ibm-granite/granite-8b-code-base dtype: string - name: masked_data_meta-llama/CodeLlama-7b-hf dtype: string - name: predict_meta-llama/CodeLlama-7b-hf dtype: string - name: predicted_comment_meta-llama/CodeLlama-7b-hf dtype: string - name: masked_data_google/codegemma-7b dtype: string - name: predict_google/codegemma-7b dtype: string - name: predicted_comment_google/codegemma-7b dtype: string - name: error_codes_Qwen/CodeQwen1.5-7B dtype: string - name: expert_accuracy_Qwen/CodeQwen1.5-7B dtype: string - name: error_codes_bigcode/starcoder2-7b dtype: string - name: expert_accuracy_bigcode/starcoder2-7b dtype: string - name: error_codes_ibm-granite/granite-8b-code-base dtype: string - name: expert_accuracy_ibm-granite/granite-8b-code-base dtype: string - name: error_codes_meta-llama/CodeLlama-7b-hf dtype: string - name: expert_accuracy_meta-llama/CodeLlama-7b-hf dtype: string - name: error_codes_google/codegemma-7b dtype: string - name: expert_accuracy_google/codegemma-7b dtype: string splits: - name: train num_bytes: 20650722 num_examples: 500 download_size: 8170649 dataset_size: 20650722 - config_name: Greek features: - name: file_id dtype: string - name: content dtype: string - name: repo dtype: string - name: path dtype: string - name: original_comment dtype: string - name: masked_data_Qwen/CodeQwen1.5-7B dtype: string - name: predict_Qwen/CodeQwen1.5-7B dtype: string - name: predicted_comment_Qwen/CodeQwen1.5-7B dtype: string - name: masked_data_bigcode/starcoder2-7b dtype: string - name: predict_bigcode/starcoder2-7b dtype: string - name: predicted_comment_bigcode/starcoder2-7b dtype: string - name: masked_data_ibm-granite/granite-8b-code-base dtype: string - name: predict_ibm-granite/granite-8b-code-base dtype: string - name: predicted_comment_ibm-granite/granite-8b-code-base dtype: string - name: masked_data_meta-llama/CodeLlama-7b-hf dtype: string - name: predict_meta-llama/CodeLlama-7b-hf dtype: string - name: predicted_comment_meta-llama/CodeLlama-7b-hf dtype: string - name: masked_data_google/codegemma-7b dtype: string - name: predict_google/codegemma-7b dtype: string - name: predicted_comment_google/codegemma-7b dtype: string - name: error_codes_bigcode/starcoder2-7b dtype: string - name: error_codes_ibm-granite/granite-8b-code-base dtype: string - name: error_codes_meta-llama/CodeLlama-7b-hf dtype: string - name: error_codes_google/codegemma-7b dtype: string - name: error_codes_Qwen/CodeQwen1.5-7B dtype: string - name: expert_accuracy_bigcode/starcoder2-7b dtype: string - name: expert_accuracy_ibm-granite/granite-8b-code-base dtype: string - name: expert_accuracy_meta-llama/CodeLlama-7b-hf dtype: string - name: expert_accuracy_google/codegemma-7b dtype: string - name: expert_accuracy_Qwen/CodeQwen1.5-7B dtype: string splits: - name: train num_bytes: 25755336 num_examples: 500 download_size: 9211744 dataset_size: 25755336 - config_name: Polish features: - name: file_id dtype: string - name: repo dtype: string - name: path dtype: string - name: content dtype: string - name: original_comment dtype: string - name: masked_data_Qwen/CodeQwen1.5-7B dtype: string - name: predict_Qwen/CodeQwen1.5-7B dtype: string - name: predicted_comment_Qwen/CodeQwen1.5-7B dtype: string - name: masked_data_bigcode/starcoder2-7b dtype: string - name: predict_bigcode/starcoder2-7b dtype: string - name: predicted_comment_bigcode/starcoder2-7b dtype: string - name: masked_data_ibm-granite/granite-8b-code-base dtype: string - name: predict_ibm-granite/granite-8b-code-base dtype: string - name: predicted_comment_ibm-granite/granite-8b-code-base dtype: string - name: masked_data_meta-llama/CodeLlama-7b-hf dtype: string - name: predict_meta-llama/CodeLlama-7b-hf dtype: string - name: predicted_comment_meta-llama/CodeLlama-7b-hf dtype: string - name: masked_data_google/codegemma-7b dtype: string - name: predict_google/codegemma-7b dtype: string - name: predicted_comment_google/codegemma-7b dtype: string - name: error_codes_Qwen/CodeQwen1.5-7B dtype: string - name: expert_accuracy_Qwen/CodeQwen1.5-7B dtype: string - name: error_codes_bigcode/starcoder2-7b dtype: string - name: expert_accuracy_bigcode/starcoder2-7b dtype: string - name: error_codes_ibm-granite/granite-8b-code-base dtype: string - name: expert_accuracy_ibm-granite/granite-8b-code-base dtype: string - name: error_codes_meta-llama/CodeLlama-7b-hf dtype: string - name: expert_accuracy_meta-llama/CodeLlama-7b-hf dtype: string - name: error_codes_google/codegemma-7b dtype: string - name: expert_accuracy_google/codegemma-7b dtype: string splits: - name: train num_bytes: 17931795 num_examples: 500 download_size: 7295966 dataset_size: 17931795 configs: - config_name: Chinese data_files: - split: train path: Chinese/train-* - config_name: Dutch data_files: - split: train path: Dutch/train-* - config_name: English data_files: - split: train path: English/train-* - config_name: Greek data_files: - split: train path: Greek/train-* - config_name: Polish data_files: - split: train path: Polish/train-* ---
提供机构:
AISE-TUDelft
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作