patrykbart/code_search_net_tree_enhanced_with_language
收藏Hugging Face2025-01-24 更新2025-02-15 收录
下载链接:
https://hf-mirror.com/datasets/patrykbart/code_search_net_tree_enhanced_with_language
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含多个字段,如语言类型、输入ID、注意力掩码、深度和兄弟索引等。数据集被划分为训练集、测试集和验证集,分别包含大约135万、7万和6万3千多条样本。数据集的总大小约为20GB。
The dataset contains multiple fields such as language type, input IDs, attention masks, depths, and sibling indices. It is split into training, test, and validation sets, containing approximately 1,350,000, 72,000, and 63,000 samples respectively. The total size of the dataset is about 20GB.
提供机构:
patrykbart



