mike-ravkine/rosettacode-parsed
收藏Hugging Face2023-06-20 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/mike-ravkine/rosettacode-parsed
下载链接
链接失效反馈官方服务:
资源简介:
---
license: gfdl
task_categories:
- text-generation
language:
- en
- code
---
## Data Origins
Original dataset: https://huggingface.co/datasets/jondurbin/rosettacode-raw/
Cleaner code: https://github.com/the-crypt-keeper/rosettacode-parser
## Data Fields
|Field|Type|Description|
|---|---|---|
|title|string|problem title|
|task|string|problem description|
|language|string|solution language/variant|
|soulution|string|solution source code|
## Languages
One .jsonl is provided per language group, the sublanguage field in the data denotes the specific language version/variant or the source language the example was ported from.
```
Language Python problems 510 rows 621
Language C problems 350 rows 350
Language C++ problems 403 rows 416
Language C sharp problems 322 rows 342
Language Go problems 496 rows 503
Language JavaScript problems 269 rows 301
Language Java problems 470 rows 512
Language Lua problems 335 rows 339
Language Kotlin problems 435 rows 435
Language Ruby problems 418 rows 444
Total 4894 done 565 skip 4329 failed 0 rows 4263
```
提供机构:
mike-ravkine
原始信息汇总
数据集概述
数据集来源
- 原始数据集:jondurbin/rosettacode-raw
- 数据清洗代码:rosettacode-parser
数据字段
| 字段 | 类型 | 描述 |
|---|---|---|
| title | string | 问题标题 |
| task | string | 问题描述 |
| language | string | 解决方案语言/变种 |
| solution | string | 解决方案源代码 |
语言分布
- Python: 问题数510,行数621
- C: 问题数350,行数350
- C++: 问题数403,行数416
- C sharp: 问题数322,行数342
- Go: 问题数496,行数503
- JavaScript: 问题数269,行数301
- Java: 问题数470,行数512
- Lua: 问题数335,行数339
- Kotlin: 问题数435,行数435
- Ruby: 问题数418,行数444
数据集统计
- 完成:4894
- 跳过:565
- 失败:4329
- 总行数:4263



