five

ObscuraCoder/research_code

收藏
Hugging Face2024-02-20 更新2025-04-26 收录
下载链接:
https://hf-mirror.com/datasets/ObscuraCoder/research_code
下载链接
链接失效反馈
官方服务:
资源简介:
--- dataset_info: - config_name: c features: - name: repo dtype: string - name: file dtype: string - name: code dtype: string - name: file_length dtype: int64 - name: avg_line_length dtype: float64 - name: max_line_length dtype: int64 - name: extension_type dtype: string splits: - name: train num_bytes: 10791957059.976545 num_examples: 802216 download_size: 1425530604 dataset_size: 10791957059.976545 - config_name: cpp features: - name: repo dtype: string - name: file dtype: string - name: code dtype: string - name: file_length dtype: int64 - name: avg_line_length dtype: float64 - name: max_line_length dtype: int64 - name: extension_type dtype: string splits: - name: train num_bytes: 5309598856.331665 num_examples: 394687 download_size: 608841955 dataset_size: 5309598856.331665 - config_name: go features: - name: repo dtype: string - name: file dtype: string - name: code dtype: string - name: file_length dtype: int64 - name: avg_line_length dtype: float64 - name: max_line_length dtype: int64 - name: extension_type dtype: string splits: - name: train num_bytes: 957925155.2820561 num_examples: 71207 download_size: 109696524 dataset_size: 957925155.2820561 - config_name: haskell features: - name: repo dtype: string - name: file dtype: string - name: code dtype: string - name: file_length dtype: int64 - name: avg_line_length dtype: float64 - name: max_line_length dtype: int64 - name: extension_type dtype: string splits: - name: train num_bytes: 58747864.01781762 num_examples: 4367 download_size: 5301713 dataset_size: 58747864.01781762 - config_name: java features: - name: repo dtype: string - name: file dtype: string - name: code dtype: string - name: file_length dtype: int64 - name: avg_line_length dtype: float64 - name: max_line_length dtype: int64 - name: extension_type dtype: string splits: - name: train num_bytes: 5608208047.447648 num_examples: 416884 download_size: 430136156 dataset_size: 5608208047.447648 - config_name: python features: - name: repo dtype: string - name: file dtype: string - name: code dtype: string - name: file_length dtype: int64 - name: avg_line_length dtype: float64 - name: max_line_length dtype: int64 - name: extension_type dtype: string splits: - name: train num_bytes: 7255394837.906463 num_examples: 539327 download_size: 965554391 dataset_size: 7255394837.906463 - config_name: rust features: - name: repo dtype: string - name: file dtype: string - name: code dtype: string - name: file_length dtype: int64 - name: avg_line_length dtype: float64 - name: max_line_length dtype: int64 - name: extension_type dtype: string splits: - name: train num_bytes: 226812225.1752702 num_examples: 16860 download_size: 30445667 dataset_size: 226812225.1752702 - config_name: typescript features: - name: repo dtype: string - name: file dtype: string - name: code dtype: string - name: file_length dtype: int64 - name: avg_line_length dtype: float64 - name: max_line_length dtype: int64 - name: extension_type dtype: string splits: - name: train num_bytes: 374939711.0237222 num_examples: 27871 download_size: 28836400 dataset_size: 374939711.0237222 configs: - config_name: c data_files: - split: train path: c/train-* - config_name: cpp data_files: - split: train path: cpp/train-* - config_name: go data_files: - split: train path: go/train-* - config_name: haskell data_files: - split: train path: haskell/train-* - config_name: java data_files: - split: train path: java/train-* - config_name: python data_files: - split: train path: python/train-* - config_name: rust data_files: - split: train path: rust/train-* - config_name: typescript data_files: - split: train path: typescript/train-* ---
提供机构:
ObscuraCoder
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作