yjernite/prof_report__SD_v1.4_random_seeds__sd_21__24
收藏Hugging Face2023-06-03 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/yjernite/prof_report__SD_v1.4_random_seeds__sd_21__24
下载链接
链接失效反馈官方服务:
资源简介:
---
dataset_info:
features:
- name: cluster_id
dtype: int64
- name: cluster_size
dtype: int64
- name: img_ids
sequence: int64
- name: img_cluster_scores
sequence: float64
splits:
- name: paralegal
num_bytes: 3600
num_examples: 10
- name: bartender
num_bytes: 3504
num_examples: 6
- name: facilities_manager
num_bytes: 3600
num_examples: 10
- name: accountant
num_bytes: 3600
num_examples: 10
- name: graphic_designer
num_bytes: 3672
num_examples: 13
- name: network_administrator
num_bytes: 3408
num_examples: 2
- name: financial_manager
num_bytes: 3624
num_examples: 11
- name: baker
num_bytes: 3720
num_examples: 15
- name: security_guard
num_bytes: 3648
num_examples: 12
- name: artist
num_bytes: 3840
num_examples: 20
- name: author
num_bytes: 3600
num_examples: 10
- name: printing_press_operator
num_bytes: 3552
num_examples: 8
- name: public_relations_specialist
num_bytes: 3648
num_examples: 12
- name: sheet_metal_worker
num_bytes: 3576
num_examples: 9
- name: clergy
num_bytes: 3648
num_examples: 12
- name: payroll_clerk
num_bytes: 3552
num_examples: 8
- name: teller
num_bytes: 3816
num_examples: 19
- name: real_estate_broker
num_bytes: 3552
num_examples: 8
- name: customer_service_representative
num_bytes: 3600
num_examples: 10
- name: painter
num_bytes: 3768
num_examples: 17
- name: tractor_operator
num_bytes: 3480
num_examples: 5
- name: dental_hygienist
num_bytes: 3504
num_examples: 6
- name: industrial_engineer
num_bytes: 3576
num_examples: 9
- name: electrician
num_bytes: 3480
num_examples: 5
- name: head_cook
num_bytes: 3744
num_examples: 16
- name: health_technician
num_bytes: 3600
num_examples: 10
- name: carpet_installer
num_bytes: 3456
num_examples: 4
- name: purchasing_agent
num_bytes: 3624
num_examples: 11
- name: supervisor
num_bytes: 3696
num_examples: 14
- name: civil_engineer
num_bytes: 3648
num_examples: 12
- name: lawyer
num_bytes: 3720
num_examples: 15
- name: language_pathologist
num_bytes: 3600
num_examples: 10
- name: ceo
num_bytes: 3672
num_examples: 13
- name: computer_support_specialist
num_bytes: 3600
num_examples: 10
- name: postal_worker
num_bytes: 3672
num_examples: 13
- name: mechanical_engineer
num_bytes: 3648
num_examples: 12
- name: nursing_assistant
num_bytes: 3552
num_examples: 8
- name: dentist
num_bytes: 3624
num_examples: 11
- name: tutor
num_bytes: 3720
num_examples: 15
- name: butcher
num_bytes: 3648
num_examples: 12
- name: insurance_agent
num_bytes: 3528
num_examples: 7
- name: courier
num_bytes: 3720
num_examples: 15
- name: computer_programmer
num_bytes: 3624
num_examples: 11
- name: truck_driver
num_bytes: 3504
num_examples: 6
- name: mechanic
num_bytes: 3528
num_examples: 7
- name: marketing_manager
num_bytes: 3528
num_examples: 7
- name: sales_manager
num_bytes: 3528
num_examples: 7
- name: correctional_officer
num_bytes: 3696
num_examples: 14
- name: manager
num_bytes: 3648
num_examples: 12
- name: underwriter
num_bytes: 3672
num_examples: 13
- name: executive_assistant
num_bytes: 3600
num_examples: 10
- name: designer
num_bytes: 3648
num_examples: 12
- name: groundskeeper
num_bytes: 3480
num_examples: 5
- name: mental_health_counselor
num_bytes: 3672
num_examples: 13
- name: aerospace_engineer
num_bytes: 3648
num_examples: 12
- name: taxi_driver
num_bytes: 3696
num_examples: 14
- name: nurse
num_bytes: 3576
num_examples: 9
- name: data_entry_keyer
num_bytes: 3624
num_examples: 11
- name: musician
num_bytes: 3696
num_examples: 14
- name: event_planner
num_bytes: 3552
num_examples: 8
- name: writer
num_bytes: 3672
num_examples: 13
- name: cook
num_bytes: 3792
num_examples: 18
- name: welder
num_bytes: 3624
num_examples: 11
- name: producer
num_bytes: 3744
num_examples: 16
- name: hairdresser
num_bytes: 3600
num_examples: 10
- name: farmer
num_bytes: 3528
num_examples: 7
- name: construction_worker
num_bytes: 3504
num_examples: 6
- name: air_conditioning_installer
num_bytes: 3432
num_examples: 3
- name: electrical_engineer
num_bytes: 3648
num_examples: 12
- name: occupational_therapist
num_bytes: 3624
num_examples: 11
- name: career_counselor
num_bytes: 3600
num_examples: 10
- name: interior_designer
num_bytes: 3624
num_examples: 11
- name: jailer
num_bytes: 3744
num_examples: 16
- name: office_clerk
num_bytes: 3624
num_examples: 11
- name: market_research_analyst
num_bytes: 3576
num_examples: 9
- name: laboratory_technician
num_bytes: 3624
num_examples: 11
- name: social_assistant
num_bytes: 3744
num_examples: 16
- name: medical_records_specialist
num_bytes: 3576
num_examples: 9
- name: machinery_mechanic
num_bytes: 3552
num_examples: 8
- name: police_officer
num_bytes: 3672
num_examples: 13
- name: software_developer
num_bytes: 3528
num_examples: 7
- name: clerk
num_bytes: 3720
num_examples: 15
- name: salesperson
num_bytes: 3648
num_examples: 12
- name: social_worker
num_bytes: 3744
num_examples: 16
- name: director
num_bytes: 3720
num_examples: 15
- name: fast_food_worker
num_bytes: 3696
num_examples: 14
- name: singer
num_bytes: 3792
num_examples: 18
- name: metal_worker
num_bytes: 3576
num_examples: 9
- name: cleaner
num_bytes: 3792
num_examples: 18
- name: computer_systems_analyst
num_bytes: 3600
num_examples: 10
- name: dental_assistant
num_bytes: 3504
num_examples: 6
- name: psychologist
num_bytes: 3696
num_examples: 14
- name: machinist
num_bytes: 3648
num_examples: 12
- name: therapist
num_bytes: 3648
num_examples: 12
- name: veterinarian
num_bytes: 3576
num_examples: 9
- name: teacher
num_bytes: 3720
num_examples: 15
- name: architect
num_bytes: 3720
num_examples: 15
- name: office_worker
num_bytes: 3672
num_examples: 13
- name: drywall_installer
num_bytes: 3480
num_examples: 5
- name: nutritionist
num_bytes: 3480
num_examples: 5
- name: librarian
num_bytes: 3672
num_examples: 13
- name: childcare_worker
num_bytes: 3576
num_examples: 9
- name: school_bus_driver
num_bytes: 3696
num_examples: 14
- name: file_clerk
num_bytes: 3600
num_examples: 10
- name: logistician
num_bytes: 3576
num_examples: 9
- name: scientist
num_bytes: 3648
num_examples: 12
- name: teaching_assistant
num_bytes: 3672
num_examples: 13
- name: radiologic_technician
num_bytes: 3600
num_examples: 10
- name: manicurist
num_bytes: 3576
num_examples: 9
- name: community_manager
num_bytes: 3576
num_examples: 9
- name: carpenter
num_bytes: 3480
num_examples: 5
- name: claims_appraiser
num_bytes: 3576
num_examples: 9
- name: dispatcher
num_bytes: 3528
num_examples: 7
- name: cashier
num_bytes: 3600
num_examples: 10
- name: roofer
num_bytes: 3504
num_examples: 6
- name: photographer
num_bytes: 3792
num_examples: 18
- name: detective
num_bytes: 3648
num_examples: 12
- name: financial_advisor
num_bytes: 3576
num_examples: 9
- name: wholesale_buyer
num_bytes: 3672
num_examples: 13
- name: it_specialist
num_bytes: 3552
num_examples: 8
- name: pharmacy_technician
num_bytes: 3504
num_examples: 6
- name: engineer
num_bytes: 3648
num_examples: 12
- name: mover
num_bytes: 3768
num_examples: 17
- name: plane_mechanic
num_bytes: 3624
num_examples: 11
- name: interviewer
num_bytes: 3672
num_examples: 13
- name: massage_therapist
num_bytes: 3624
num_examples: 11
- name: dishwasher
num_bytes: 3672
num_examples: 13
- name: fitness_instructor
num_bytes: 3600
num_examples: 10
- name: credit_counselor
num_bytes: 3624
num_examples: 11
- name: stocker
num_bytes: 3816
num_examples: 19
- name: pharmacist
num_bytes: 3672
num_examples: 13
- name: doctor
num_bytes: 3672
num_examples: 13
- name: compliance_officer
num_bytes: 3648
num_examples: 12
- name: aide
num_bytes: 3768
num_examples: 17
- name: bus_driver
num_bytes: 3672
num_examples: 13
- name: financial_analyst
num_bytes: 3624
num_examples: 11
- name: receptionist
num_bytes: 3504
num_examples: 6
- name: janitor
num_bytes: 3672
num_examples: 13
- name: plumber
num_bytes: 3504
num_examples: 6
- name: physical_therapist
num_bytes: 3600
num_examples: 10
- name: inventory_clerk
num_bytes: 3552
num_examples: 8
- name: firefighter
num_bytes: 3600
num_examples: 10
- name: coach
num_bytes: 3696
num_examples: 14
- name: maid
num_bytes: 3648
num_examples: 12
- name: pilot
num_bytes: 3696
num_examples: 14
- name: repair_worker
num_bytes: 3624
num_examples: 11
download_size: 871516
dataset_size: 529248
---
# Dataset Card for "prof_report__SD_v1.4_random_seeds__sd_21__24"
[More Information needed](https://github.com/huggingface/datasets/blob/main/CONTRIBUTING.md#how-to-contribute-to-the-dataset-cards)
# 数据集信息(dataset_info)
## 特征(features)
该数据集包含以下特征:
1. 簇ID(cluster_id):数据类型为64位整数(int64)
2. 簇大小(cluster_size):数据类型为64位整数(int64)
3. 图像ID序列(img_ids):由多个64位整数组成的序列
4. 图像簇得分序列(img_cluster_scores):由多个64位浮点数(float64)组成的序列
## 数据拆分(splits)
本数据集按职业划分为多个子集,各子集详情如下:
- 律师助理(paralegal):占用字节数3600,样本量10
- 调酒师(bartender):占用字节数3504,样本量6
- 设施经理(facilities_manager):占用字节数3600,样本量10
- 会计师(accountant):占用字节数3600,样本量10
- 平面设计师(graphic_designer):占用字节数3672,样本量13
- 网络管理员(network_administrator):占用字节数3408,样本量2
- 财务经理(financial_manager):占用字节数3624,样本量11
- 面包师(baker):占用字节数3720,样本量15
- 保安(security_guard):占用字节数3648,样本量12
- 艺术家(artist):占用字节数3840,样本量20
- 作家(author):占用字节数3600,样本量10
- 印刷机操作员(printing_press_operator):占用字节数3552,样本量8
- 公关专员(public_relations_specialist):占用字节数3648,样本量12
- 钣金工(sheet_metal_worker):占用字节数3576,样本量9
- 神职人员(clergy):占用字节数3648,样本量12
- 薪资文员(payroll_clerk):占用字节数3552,样本量8
- 银行柜员(teller):占用字节数3816,样本量19
- 房地产经纪人(real_estate_broker):占用字节数3552,样本量8
- 客服代表(customer_service_representative):占用字节数3600,样本量10
- 油漆工(painter):占用字节数3768,样本量17
- 拖拉机操作员(tractor_operator):占用字节数3480,样本量5
- 牙科保健师(dental_hygienist):占用字节数3504,样本量6
- 工业工程师(industrial_engineer):占用字节数3576,样本量9
- 电工(electrician):占用字节数3480,样本量5
- 主厨(head_cook):占用字节数3744,样本量16
- 卫生技术员(health_technician):占用字节数3600,样本量10
- 地毯安装工(carpet_installer):占用字节数3456,样本量4
- 采购专员(purchasing_agent):占用字节数3624,样本量11
- 主管(supervisor):占用字节数3696,样本量14
- 土木工程师(civil_engineer):占用字节数3648,样本量12
- 律师(lawyer):占用字节数3720,样本量15
- 语言病理学家(language_pathologist):占用字节数3600,样本量10
- 首席执行官(ceo):占用字节数3672,样本量13
- 计算机支持专员(computer_support_specialist):占用字节数3600,样本量10
- 邮政人员(postal_worker):占用字节数3672,样本量13
- 机械工程师(mechanical_engineer):占用字节数3648,样本量12
- 护理助理(nursing_assistant):占用字节数3552,样本量8
- 牙医(dentist):占用字节数3624,样本量11
- 家教(tutor):占用字节数3720,样本量15
- 屠夫(butcher):占用字节数3648,样本量12
- 保险代理人(insurance_agent):占用字节数3528,样本量7
- 快递员(courier):占用字节数3720,样本量15
- 计算机程序员(computer_programmer):占用字节数3624,样本量11
- 卡车司机(truck_driver):占用字节数3504,样本量6
- 机修工(mechanic):占用字节数3528,样本量7
- 营销经理(marketing_manager):占用字节数3528,样本量7
- 销售经理(sales_manager):占用字节数3528,样本量7
- 狱警(correctional_officer):占用字节数3696,样本量14
- 经理(manager):占用字节数3648,样本量12
- 保险核保员(underwriter):占用字节数3672,样本量13
- 行政助理(executive_assistant):占用字节数3600,样本量10
- 设计师(designer):占用字节数3648,样本量12
- 园丁(groundskeeper):占用字节数3480,样本量5
- 心理健康咨询师(mental_health_counselor):占用字节数3672,样本量13
- 航空航天工程师(aerospace_engineer):占用字节数3648,样本量12
- 出租车司机(taxi_driver):占用字节数3696,样本量14
- 护士(nurse):占用字节数3576,样本量9
- 数据录入员(data_entry_keyer):占用字节数3624,样本量11
- 音乐家(musician):占用字节数3696,样本量14
- 活动策划师(event_planner):占用字节数3552,样本量8
- 写手(writer):占用字节数3672,样本量13
- 厨师(cook):占用字节数3792,样本量18
- 焊工(welder):占用字节数3624,样本量11
- 制作人(producer):占用字节数3744,样本量16
- 美发师(hairdresser):占用字节数3600,样本量10
- 农民(farmer):占用字节数3528,样本量7
- 建筑工人(construction_worker):占用字节数3504,样本量6
- 空调安装工(air_conditioning_installer):占用字节数3432,样本量3
- 电气工程师(electrical_engineer):占用字节数3648,样本量12
- 职业治疗师(occupational_therapist):占用字节数3624,样本量11
- 职业咨询师(career_counselor):占用字节数3600,样本量10
- 室内设计师(interior_designer):占用字节数3624,样本量11
- 看守所看守(jailer):占用字节数3744,样本量16
- 办公室文员(office_clerk):占用字节数3624,样本量11
- 市场调研分析师(market_research_analyst):占用字节数3576,样本量9
- 实验室技术员(laboratory_technician):占用字节数3624,样本量11
- 社会服务助理(social_assistant):占用字节数3744,样本量16
- 病历专员(medical_records_specialist):占用字节数3576,样本量9
- 机械维修工(machinery_mechanic):占用字节数3552,样本量8
- 警察(police_officer):占用字节数3672,样本量13
- 软件开发人员(software_developer):占用字节数3528,样本量7
- 文员(clerk):占用字节数3720,样本量15
- 销售人员(salesperson):占用字节数3648,样本量12
- 社工(social_worker):占用字节数3744,样本量16
- 总监(director):占用字节数3720,样本量15
- 快餐员工(fast_food_worker):占用字节数3696,样本量14
- 歌手(singer):占用字节数3792,样本量18
- 金属加工工(metal_worker):占用字节数3576,样本量9
- 清洁工(cleaner):占用字节数3792,样本量18
- 计算机系统分析师(computer_systems_analyst):占用字节数3600,样本量10
- 牙科助理(dental_assistant):占用字节数3504,样本量6
- 心理学家(psychologist):占用字节数3696,样本量14
- 机械师(machinist):占用字节数3648,样本量12
- 治疗师(therapist):占用字节数3648,样本量12
- 兽医(veterinarian):占用字节数3576,样本量9
- 教师(teacher):占用字节数3720,样本量15
- 建筑师(architect):占用字节数3720,样本量15
- 办公室职员(office_worker):占用字节数3672,样本量13
- 石膏板安装工(drywall_installer):占用字节数3480,样本量5
- 营养师(nutritionist):占用字节数3480,样本量5
- 图书管理员(librarian):占用字节数3672,样本量13
- 儿童保育工作者(childcare_worker):占用字节数3576,样本量9
- 校车司机(school_bus_driver):占用字节数3696,样本量14
- 档案文员(file_clerk):占用字节数3600,样本量10
- 物流师(logistician):占用字节数3576,样本量9
- 科学家(scientist):占用字节数3648,样本量12
- 教学助理(teaching_assistant):占用字节数3672,样本量13
- 放射科技师(radiologic_technician):占用字节数3600,样本量10
- 美甲师(manicurist):占用字节数3576,样本量9
- 社区经理(community_manager):占用字节数3576,样本量9
- 木匠(carpenter):占用字节数3480,样本量5
- 理赔评估师(claims_appraiser):占用字节数3576,样本量9
- 调度员(dispatcher):占用字节数3528,样本量7
- 收银员(cashier):占用字节数3600,样本量10
- 屋面工(roofer):占用字节数3504,样本量6
- 摄影师(photographer):占用字节数3792,样本量18
- 侦探(detective):占用字节数3648,样本量12
- 财务顾问(financial_advisor):占用字节数3576,样本量9
- 批发采购员(wholesale_buyer):占用字节数3672,样本量13
- IT专员(it_specialist):占用字节数3552,样本量8
- 药房技术员(pharmacy_technician):占用字节数3504,样本量6
- 工程师(engineer):占用字节数3648,样本量12
- 搬家工人(mover):占用字节数3768,样本量17
- 飞机维修工(plane_mechanic):占用字节数3624,样本量11
- 访调员(interviewer):占用字节数3672,样本量13
- 按摩治疗师(massage_therapist):占用字节数3624,样本量11
- 洗碗工(dishwasher):占用字节数3672,样本量13
- 健身教练(fitness_instructor):占用字节数3600,样本量10
- 信贷咨询师(credit_counselor):占用字节数3624,样本量11
- 库存员(stocker):占用字节数3816,样本量19
- 药剂师(pharmacist):占用字节数3672,样本量13
- 医生(doctor):占用字节数3672,样本量13
- 合规专员(compliance_officer):占用字节数3648,样本量12
- 助理(aide):占用字节数3768,样本量17
- 巴士司机(bus_driver):占用字节数3672,样本量13
- 财务分析师(financial_analyst):占用字节数3624,样本量11
- 前台接待(receptionist):占用字节数3504,样本量6
- 保洁员(janitor):占用字节数3672,样本量13
- 水管工(plumber):占用字节数3504,样本量6
- 物理治疗师(physical_therapist):占用字节数3600,样本量10
- 库存文员(inventory_clerk):占用字节数3552,样本量8
- 消防员(firefighter):占用字节数3600,样本量10
- 教练(coach):占用字节数3696,样本量14
- 女佣(maid):占用字节数3648,样本量12
- 飞行员(pilot):占用字节数3696,样本量14
- 维修工人(repair_worker):占用字节数3624,样本量11
本数据集下载大小为871516字节,总数据集大小为529248字节。
# 数据集卡片(Dataset Card):"prof_report__SD_v1.4_random_seeds__sd_21__24"
[需补充更多信息](https://github.com/huggingface/datasets/blob/main/CONTRIBUTING.md#how-to-contribute-to-the-dataset-cards)
提供机构:
yjernite
原始信息汇总
数据集概述
数据集名称
- 名称: prof_report__SD_v1.4_random_seeds__sd_21__24
数据集大小
- 下载大小: 871516
- 数据集大小: 529248
数据集特征
- 特征列表:
cluster_id: 整数类型 (int64)cluster_size: 整数类型 (int64)img_ids: 序列类型,整数 (sequence: int64)img_cluster_scores: 序列类型,浮点数 (sequence: float64)
数据集分割
- 分割详情:
- 名称: 多种职业
- 示例数量: 每个职业的示例数量不同,范围从2到20不等
- 字节数: 每个职业的字节数也不同,范围从3408到3840不等
数据集使用
- 使用指南: 需要更多信息,参考CONTRIBUTING.md获取如何贡献数据集卡片的指南。



