python-edu-annotations
收藏魔搭社区2025-12-05 更新2025-11-03 收录
下载链接:
https://modelscope.cn/datasets/HuggingFaceTB/python-edu-annotations
下载链接
链接失效反馈官方服务:
资源简介:
## Annotations for 📚 Python-Edu classifier
This dataset contains the annotations used for training [Python-Edu](https://huggingface.co/datasets/HuggingFaceTB/smollm-corpus) educational quality [classifier](https://huggingface.co/HuggingFaceTB/python-edu-scorer). We prompt [Llama-3-70B-Instruct](https://huggingface.co/meta-llama/Meta-Llama-3-70B-Instruct) to score python programs from [StarCoderData](https://huggingface.co/datasets/bigcode/starcoderdata) based on their educational value.
**Note:** the dataset contains the Python program, the prompt (using the first 1000 characters of the program) and the scores but it doesn't contain the full Llama 3 generation.
📚 Python-Edu分类器标注数据集
本数据集包含用于训练Python-Edu教育质量分类器(classifier)的标注数据,相关数据集链接为:https://huggingface.co/datasets/HuggingFaceTB/smollm-corpus,分类器模型链接为:https://huggingface.co/HuggingFaceTB/python-edu-scorer。我们通过对Llama-3-70B-Instruct模型进行提示,使其基于教育价值对来自StarCoderData数据集的Python程序进行评分。
**注意:** 本数据集包含Python程序、提示文本(采用程序前1000个字符)以及评分结果,但不包含完整的Llama 3生成内容。
提供机构:
maas
创建时间:
2025-09-08



