five

Voxel51/Coursera_homework_dataset_train

收藏
Hugging Face2024-07-31 更新2025-04-12 收录
下载链接:
https://hf-mirror.com/datasets/Voxel51/Coursera_homework_dataset_train
下载链接
链接失效反馈
官方服务:
资源简介:
--- annotations_creators: [] language: en size_categories: - 10K<n<100K task_categories: - object-detection task_ids: [] pretty_name: homework_dataset_train tags: - fiftyone - image - object-detection dataset_summary: ' This is a [FiftyOne](https://github.com/voxel51/fiftyone) dataset with 18287 samples. ## Installation If you haven''t already, install FiftyOne: ```bash pip install -U fiftyone ``` ## Usage ```python import fiftyone as fo import fiftyone.utils.huggingface as fouh # Load the dataset # Note: other available arguments include ''max_samples'', etc dataset = fouh.load_from_hub("Voxel51/Coursera_homework_dataset_train") # Launch the App session = fo.launch_app(dataset) ``` ' --- # Dataset Card for Homework Training Set for Coursera MOOC - Hands Data Centric Visual AI This dataset is the **training dataset for the homework assignments** of the Hands-on Data Centric AI Coursera course. This is a [FiftyOne](https://github.com/voxel51/fiftyone) dataset with 18287 samples. ## Installation If you haven't already, install FiftyOne: ```bash pip install -U fiftyone ``` ## Usage ```python import fiftyone as fo import fiftyone.utils.huggingface as fouh # Load the dataset # Note: other available arguments include 'max_samples', etc dataset = fouh.load_from_hub("Voxel51/Coursera_homework_dataset_train") # Launch the App session = fo.launch_app(dataset) ``` ## Dataset Details ### Dataset Description This dataset is a modified subset of the [LVIS dataset](https://www.lvisdataset.org/). The dataset here only contains detections, some of which have been artificially perturbed and altered to demonstrate data centric AI techniques and methodologies for the course. This dataset has the following labels: - 'bolt' - 'knob' - 'tag' - 'button' - 'bottle_cap' - 'belt' - 'strap' - 'necktie' - 'shirt' - 'sweater' - 'streetlight' - 'pole' - 'reflector' - 'headlight' - 'taillight' - 'traffic_light' - 'rearview_mirror' ### Dataset Sources - **Repository:** https://www.lvisdataset.org/ - **Paper:** https://arxiv.org/abs/1908.03195 ## Uses The labels in this dataset have been perturbed to illustrate data centric AI techniques for the Hands-on Data Centric AI Coursera MOOC. ## Dataset Structure Each image in the dataset comes with detailed annotations in FiftyOne detection format. A typical annotation looks like this: ```python <Detection: { 'id': '66a2f24cce2f9d11d98d3a21', 'attributes': {}, 'tags': [], 'label': 'shirt', 'bounding_box': [ 0.25414, 0.35845238095238097, 0.041960000000000004, 0.051011904761904765, ], 'mask': None, 'confidence': None, 'index': None, }> ``` ## Dataset Creation ### Curation Rationale The selected labels for this dataset is because these objects can be confusing to a model. Thus, making them a great choice for demonstrating data centric AI techniques. ### Source Data This is a subset of the [LVIS dataset.](https://www.lvisdataset.org/) ## Citation **BibTeX:** ```bibtex @inproceedings{gupta2019lvis, title={{LVIS}: A Dataset for Large Vocabulary Instance Segmentation}, author={Gupta, Agrim and Dollar, Piotr and Girshick, Ross}, booktitle={Proceedings of the {IEEE} Conference on Computer Vision and Pattern Recognition}, year={2019} } ```
提供机构:
Voxel51
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作