five

argilla-warehouse/python-seed-tools

收藏
Hugging Face2024-10-01 更新2025-04-12 收录
下载链接:
https://hf-mirror.com/datasets/argilla-warehouse/python-seed-tools
下载链接
链接失效反馈
官方服务:
资源简介:
--- dataset_info: features: - name: func_name dtype: string - name: func_desc dtype: string - name: tools dtype: string splits: - name: train num_bytes: 21930734 num_examples: 41028 download_size: 7990917 dataset_size: 21930734 configs: - config_name: default data_files: - split: train path: data/train-* license: apache-2.0 task_categories: - text-generation language: - en tags: - synthetic - distilabel pretty_name: Python Seed Tools size_categories: - 10K<n<100K --- # Dataset card for argilla-warehouse/python-seed-tools This dataset consists of function names, descriptions and their tool definitions to be used as seeds for an ["APIGen like"](https://huggingface.co/datasets/Salesforce/xlam-function-calling-60k) dataset. These are the seed functions used for the following datasets: - [argilla-warehouse/synth-apigen-llama](https://huggingface.co/datasets/argilla-warehouse/synth-apigen-llama) - [argilla-warehouse/synth-apigen-qwen](https://huggingface.co/datasets/argilla-warehouse/synth-apigen-qwen) It was built using the following script: [create_seed_dataset.py](https://huggingface.co/datasets/argilla-warehouse/python-seed-tools/blob/main/create_seed_dataset.py), using the tools defined in the [tools.jsonl](https://huggingface.co/datasets/argilla-warehouse/python-seed-tools/blob/main/tools/tools.jsonl) file. Take a look at [argilla-warehouse/python-lib-tools-v0.1](https://huggingface.co/datasets/argilla-warehouse/python-lib-tools-v0.1) to see how these tools were obtained in the first place. ## Dataset structure ```json { "func_name": "create_positive_negative_train_and_test_set", "func_desc": "Creates a positive and negative training and test set from a data set.", "tools": "[{\"type\": \"function\", \"function\": {\"name\": \"create_positive_negative_train_and_test_set\", \"description\": \"Creates a positive and negative training and test set from a data set.\", \"parameters\": {\"type\": \"object\", \"properties\": {\"data\": {\"type\": \"object\", \"description\": \"The input data set containing 'label' and 'split' columns.\"}}, \"required\": [\"data\"]}}}]" } ```
提供机构:
argilla-warehouse
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作