five

markytools/goosyntheticv3

收藏
Hugging Face2023-06-25 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/markytools/goosyntheticv3
下载链接
链接失效反馈
官方服务:
资源简介:
--- dataset_info: features: - name: image dtype: image - name: split dtype: string - name: width dtype: int64 - name: height dtype: int64 - name: bboxes dtype: string - name: labels dtype: string - name: cab dtype: int64 - name: hum dtype: int64 - name: light dtype: float64 - name: cam dtype: int64 - name: env dtype: int64 - name: gaze_item dtype: int64 - name: gazeIdx dtype: int64 - name: gaze_cx dtype: int64 - name: gaze_cy dtype: int64 - name: hx dtype: int64 - name: hy dtype: int64 - name: pitch dtype: float64 - name: yaw dtype: float64 - name: roll dtype: float64 - name: seg dtype: string - name: segm_gazeIdx dtype: int64 - name: occluded dtype: int64 splits: - name: train num_bytes: 99500978350.0 num_examples: 172800 - name: test num_bytes: 11081866319.6 num_examples: 19200 download_size: 110113558133 dataset_size: 110582844669.6 --- The dataset features/columns here are almost similar to the original github instruction (please read the github documentation first to understand the dataset): https://github.com/upeee/GOO-GAZE2021/blob/main/dataset/goosynth-download.txt To download goosynthtrain in huggingface, run the code below (https://huggingface.co/docs/datasets/v1.10.0/loading_datasets.html#from-the-huggingface-hub): from datasets import load_dataset</br> dataset = load_dataset("markytools/goosyntheticv3") The image datasets will be stored in ""~/.cache/huggingface", so you need to delete the files here if you want to free up space. The only difference here is that there is a new feature name called "splits", ["train", "test"] </br> The "bboxes" and "labels" features are in string format, so you can use the code below to convert the string into list:</br> import ast</br> listOfBboxes = ast.literal_eval(dataset["test"]["bboxes"][0])</br> </br> The feature "seg" is now in string format instead of numpy ndarray. This is an optional feature, and you can manually download the files here (https://huggingface.co/datasets/markytools/goosegmv3) using wget commandline. The files are in .npy so load it using np.load (https://numpy.org/doc/stable/reference/generated/numpy.load.html).
提供机构:
markytools
原始信息汇总

数据集概述

数据集特征

  • image: 图像数据
  • split: 字符串,表示数据分割类型
  • width: 整数,图像宽度
  • height: 整数,图像高度
  • bboxes: 字符串,边界框数据
  • labels: 字符串,标签数据
  • cab: 整数
  • hum: 整数
  • light: 浮点数
  • cam: 整数
  • env: 整数
  • gaze_item: 整数
  • gazeIdx: 整数
  • gaze_cx: 整数
  • gaze_cy: 整数
  • hx: 整数
  • hy: 整数
  • pitch: 浮点数
  • yaw: 浮点数
  • roll: 浮点数
  • seg: 字符串,分割数据
  • segm_gazeIdx: 整数
  • occluded: 整数

数据分割

  • train: 训练集,包含172800个样本,总大小为99500978350.0字节
  • test: 测试集,包含19200个样本,总大小为11081866319.6字节

数据集大小

  • 下载大小: 110113558133字节
  • 数据集总大小: 110582844669.6字节
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作