five

styal/filtered-python-edu

收藏
Hugging Face2026-02-16 更新2026-03-29 收录
下载链接:
https://hf-mirror.com/datasets/styal/filtered-python-edu
下载链接
链接失效反馈
官方服务:
资源简介:
--- dataset_info: features: - name: blob_id dtype: string - name: repo_name dtype: string - name: path dtype: string - name: length_bytes dtype: int64 - name: score dtype: float64 - name: int_score dtype: int64 - name: text dtype: string - name: is_english dtype: bool splits: - name: train num_bytes: 1226088650 num_examples: 1018270 download_size: 626200552 dataset_size: 1226088650 configs: - config_name: default data_files: - split: train path: data/train-* --- Dataset created with this filter from Avelina/python-edu. ``` def my_filter(example): score = example["score"] > 4.1 lengh = example["length_bytes"] < 3000 and example["length_bytes"] > 200 return score and lengh ``` A is_english boolean have been added with the model papluca/xlm-roberta-base-language-detection.
提供机构:
styal
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作