five

yjernite/prof_report__SD_v1.4_random_seeds__sd_21__24

收藏
Hugging Face2023-06-03 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/yjernite/prof_report__SD_v1.4_random_seeds__sd_21__24
下载链接
链接失效反馈
官方服务:
资源简介:
--- dataset_info: features: - name: cluster_id dtype: int64 - name: cluster_size dtype: int64 - name: img_ids sequence: int64 - name: img_cluster_scores sequence: float64 splits: - name: paralegal num_bytes: 3600 num_examples: 10 - name: bartender num_bytes: 3504 num_examples: 6 - name: facilities_manager num_bytes: 3600 num_examples: 10 - name: accountant num_bytes: 3600 num_examples: 10 - name: graphic_designer num_bytes: 3672 num_examples: 13 - name: network_administrator num_bytes: 3408 num_examples: 2 - name: financial_manager num_bytes: 3624 num_examples: 11 - name: baker num_bytes: 3720 num_examples: 15 - name: security_guard num_bytes: 3648 num_examples: 12 - name: artist num_bytes: 3840 num_examples: 20 - name: author num_bytes: 3600 num_examples: 10 - name: printing_press_operator num_bytes: 3552 num_examples: 8 - name: public_relations_specialist num_bytes: 3648 num_examples: 12 - name: sheet_metal_worker num_bytes: 3576 num_examples: 9 - name: clergy num_bytes: 3648 num_examples: 12 - name: payroll_clerk num_bytes: 3552 num_examples: 8 - name: teller num_bytes: 3816 num_examples: 19 - name: real_estate_broker num_bytes: 3552 num_examples: 8 - name: customer_service_representative num_bytes: 3600 num_examples: 10 - name: painter num_bytes: 3768 num_examples: 17 - name: tractor_operator num_bytes: 3480 num_examples: 5 - name: dental_hygienist num_bytes: 3504 num_examples: 6 - name: industrial_engineer num_bytes: 3576 num_examples: 9 - name: electrician num_bytes: 3480 num_examples: 5 - name: head_cook num_bytes: 3744 num_examples: 16 - name: health_technician num_bytes: 3600 num_examples: 10 - name: carpet_installer num_bytes: 3456 num_examples: 4 - name: purchasing_agent num_bytes: 3624 num_examples: 11 - name: supervisor num_bytes: 3696 num_examples: 14 - name: civil_engineer num_bytes: 3648 num_examples: 12 - name: lawyer num_bytes: 3720 num_examples: 15 - name: language_pathologist num_bytes: 3600 num_examples: 10 - name: ceo num_bytes: 3672 num_examples: 13 - name: computer_support_specialist num_bytes: 3600 num_examples: 10 - name: postal_worker num_bytes: 3672 num_examples: 13 - name: mechanical_engineer num_bytes: 3648 num_examples: 12 - name: nursing_assistant num_bytes: 3552 num_examples: 8 - name: dentist num_bytes: 3624 num_examples: 11 - name: tutor num_bytes: 3720 num_examples: 15 - name: butcher num_bytes: 3648 num_examples: 12 - name: insurance_agent num_bytes: 3528 num_examples: 7 - name: courier num_bytes: 3720 num_examples: 15 - name: computer_programmer num_bytes: 3624 num_examples: 11 - name: truck_driver num_bytes: 3504 num_examples: 6 - name: mechanic num_bytes: 3528 num_examples: 7 - name: marketing_manager num_bytes: 3528 num_examples: 7 - name: sales_manager num_bytes: 3528 num_examples: 7 - name: correctional_officer num_bytes: 3696 num_examples: 14 - name: manager num_bytes: 3648 num_examples: 12 - name: underwriter num_bytes: 3672 num_examples: 13 - name: executive_assistant num_bytes: 3600 num_examples: 10 - name: designer num_bytes: 3648 num_examples: 12 - name: groundskeeper num_bytes: 3480 num_examples: 5 - name: mental_health_counselor num_bytes: 3672 num_examples: 13 - name: aerospace_engineer num_bytes: 3648 num_examples: 12 - name: taxi_driver num_bytes: 3696 num_examples: 14 - name: nurse num_bytes: 3576 num_examples: 9 - name: data_entry_keyer num_bytes: 3624 num_examples: 11 - name: musician num_bytes: 3696 num_examples: 14 - name: event_planner num_bytes: 3552 num_examples: 8 - name: writer num_bytes: 3672 num_examples: 13 - name: cook num_bytes: 3792 num_examples: 18 - name: welder num_bytes: 3624 num_examples: 11 - name: producer num_bytes: 3744 num_examples: 16 - name: hairdresser num_bytes: 3600 num_examples: 10 - name: farmer num_bytes: 3528 num_examples: 7 - name: construction_worker num_bytes: 3504 num_examples: 6 - name: air_conditioning_installer num_bytes: 3432 num_examples: 3 - name: electrical_engineer num_bytes: 3648 num_examples: 12 - name: occupational_therapist num_bytes: 3624 num_examples: 11 - name: career_counselor num_bytes: 3600 num_examples: 10 - name: interior_designer num_bytes: 3624 num_examples: 11 - name: jailer num_bytes: 3744 num_examples: 16 - name: office_clerk num_bytes: 3624 num_examples: 11 - name: market_research_analyst num_bytes: 3576 num_examples: 9 - name: laboratory_technician num_bytes: 3624 num_examples: 11 - name: social_assistant num_bytes: 3744 num_examples: 16 - name: medical_records_specialist num_bytes: 3576 num_examples: 9 - name: machinery_mechanic num_bytes: 3552 num_examples: 8 - name: police_officer num_bytes: 3672 num_examples: 13 - name: software_developer num_bytes: 3528 num_examples: 7 - name: clerk num_bytes: 3720 num_examples: 15 - name: salesperson num_bytes: 3648 num_examples: 12 - name: social_worker num_bytes: 3744 num_examples: 16 - name: director num_bytes: 3720 num_examples: 15 - name: fast_food_worker num_bytes: 3696 num_examples: 14 - name: singer num_bytes: 3792 num_examples: 18 - name: metal_worker num_bytes: 3576 num_examples: 9 - name: cleaner num_bytes: 3792 num_examples: 18 - name: computer_systems_analyst num_bytes: 3600 num_examples: 10 - name: dental_assistant num_bytes: 3504 num_examples: 6 - name: psychologist num_bytes: 3696 num_examples: 14 - name: machinist num_bytes: 3648 num_examples: 12 - name: therapist num_bytes: 3648 num_examples: 12 - name: veterinarian num_bytes: 3576 num_examples: 9 - name: teacher num_bytes: 3720 num_examples: 15 - name: architect num_bytes: 3720 num_examples: 15 - name: office_worker num_bytes: 3672 num_examples: 13 - name: drywall_installer num_bytes: 3480 num_examples: 5 - name: nutritionist num_bytes: 3480 num_examples: 5 - name: librarian num_bytes: 3672 num_examples: 13 - name: childcare_worker num_bytes: 3576 num_examples: 9 - name: school_bus_driver num_bytes: 3696 num_examples: 14 - name: file_clerk num_bytes: 3600 num_examples: 10 - name: logistician num_bytes: 3576 num_examples: 9 - name: scientist num_bytes: 3648 num_examples: 12 - name: teaching_assistant num_bytes: 3672 num_examples: 13 - name: radiologic_technician num_bytes: 3600 num_examples: 10 - name: manicurist num_bytes: 3576 num_examples: 9 - name: community_manager num_bytes: 3576 num_examples: 9 - name: carpenter num_bytes: 3480 num_examples: 5 - name: claims_appraiser num_bytes: 3576 num_examples: 9 - name: dispatcher num_bytes: 3528 num_examples: 7 - name: cashier num_bytes: 3600 num_examples: 10 - name: roofer num_bytes: 3504 num_examples: 6 - name: photographer num_bytes: 3792 num_examples: 18 - name: detective num_bytes: 3648 num_examples: 12 - name: financial_advisor num_bytes: 3576 num_examples: 9 - name: wholesale_buyer num_bytes: 3672 num_examples: 13 - name: it_specialist num_bytes: 3552 num_examples: 8 - name: pharmacy_technician num_bytes: 3504 num_examples: 6 - name: engineer num_bytes: 3648 num_examples: 12 - name: mover num_bytes: 3768 num_examples: 17 - name: plane_mechanic num_bytes: 3624 num_examples: 11 - name: interviewer num_bytes: 3672 num_examples: 13 - name: massage_therapist num_bytes: 3624 num_examples: 11 - name: dishwasher num_bytes: 3672 num_examples: 13 - name: fitness_instructor num_bytes: 3600 num_examples: 10 - name: credit_counselor num_bytes: 3624 num_examples: 11 - name: stocker num_bytes: 3816 num_examples: 19 - name: pharmacist num_bytes: 3672 num_examples: 13 - name: doctor num_bytes: 3672 num_examples: 13 - name: compliance_officer num_bytes: 3648 num_examples: 12 - name: aide num_bytes: 3768 num_examples: 17 - name: bus_driver num_bytes: 3672 num_examples: 13 - name: financial_analyst num_bytes: 3624 num_examples: 11 - name: receptionist num_bytes: 3504 num_examples: 6 - name: janitor num_bytes: 3672 num_examples: 13 - name: plumber num_bytes: 3504 num_examples: 6 - name: physical_therapist num_bytes: 3600 num_examples: 10 - name: inventory_clerk num_bytes: 3552 num_examples: 8 - name: firefighter num_bytes: 3600 num_examples: 10 - name: coach num_bytes: 3696 num_examples: 14 - name: maid num_bytes: 3648 num_examples: 12 - name: pilot num_bytes: 3696 num_examples: 14 - name: repair_worker num_bytes: 3624 num_examples: 11 download_size: 871516 dataset_size: 529248 --- # Dataset Card for "prof_report__SD_v1.4_random_seeds__sd_21__24" [More Information needed](https://github.com/huggingface/datasets/blob/main/CONTRIBUTING.md#how-to-contribute-to-the-dataset-cards)

# 数据集信息(dataset_info) ## 特征(features) 该数据集包含以下特征: 1. 簇ID(cluster_id):数据类型为64位整数(int64) 2. 簇大小(cluster_size):数据类型为64位整数(int64) 3. 图像ID序列(img_ids):由多个64位整数组成的序列 4. 图像簇得分序列(img_cluster_scores):由多个64位浮点数(float64)组成的序列 ## 数据拆分(splits) 本数据集按职业划分为多个子集,各子集详情如下: - 律师助理(paralegal):占用字节数3600,样本量10 - 调酒师(bartender):占用字节数3504,样本量6 - 设施经理(facilities_manager):占用字节数3600,样本量10 - 会计师(accountant):占用字节数3600,样本量10 - 平面设计师(graphic_designer):占用字节数3672,样本量13 - 网络管理员(network_administrator):占用字节数3408,样本量2 - 财务经理(financial_manager):占用字节数3624,样本量11 - 面包师(baker):占用字节数3720,样本量15 - 保安(security_guard):占用字节数3648,样本量12 - 艺术家(artist):占用字节数3840,样本量20 - 作家(author):占用字节数3600,样本量10 - 印刷机操作员(printing_press_operator):占用字节数3552,样本量8 - 公关专员(public_relations_specialist):占用字节数3648,样本量12 - 钣金工(sheet_metal_worker):占用字节数3576,样本量9 - 神职人员(clergy):占用字节数3648,样本量12 - 薪资文员(payroll_clerk):占用字节数3552,样本量8 - 银行柜员(teller):占用字节数3816,样本量19 - 房地产经纪人(real_estate_broker):占用字节数3552,样本量8 - 客服代表(customer_service_representative):占用字节数3600,样本量10 - 油漆工(painter):占用字节数3768,样本量17 - 拖拉机操作员(tractor_operator):占用字节数3480,样本量5 - 牙科保健师(dental_hygienist):占用字节数3504,样本量6 - 工业工程师(industrial_engineer):占用字节数3576,样本量9 - 电工(electrician):占用字节数3480,样本量5 - 主厨(head_cook):占用字节数3744,样本量16 - 卫生技术员(health_technician):占用字节数3600,样本量10 - 地毯安装工(carpet_installer):占用字节数3456,样本量4 - 采购专员(purchasing_agent):占用字节数3624,样本量11 - 主管(supervisor):占用字节数3696,样本量14 - 土木工程师(civil_engineer):占用字节数3648,样本量12 - 律师(lawyer):占用字节数3720,样本量15 - 语言病理学家(language_pathologist):占用字节数3600,样本量10 - 首席执行官(ceo):占用字节数3672,样本量13 - 计算机支持专员(computer_support_specialist):占用字节数3600,样本量10 - 邮政人员(postal_worker):占用字节数3672,样本量13 - 机械工程师(mechanical_engineer):占用字节数3648,样本量12 - 护理助理(nursing_assistant):占用字节数3552,样本量8 - 牙医(dentist):占用字节数3624,样本量11 - 家教(tutor):占用字节数3720,样本量15 - 屠夫(butcher):占用字节数3648,样本量12 - 保险代理人(insurance_agent):占用字节数3528,样本量7 - 快递员(courier):占用字节数3720,样本量15 - 计算机程序员(computer_programmer):占用字节数3624,样本量11 - 卡车司机(truck_driver):占用字节数3504,样本量6 - 机修工(mechanic):占用字节数3528,样本量7 - 营销经理(marketing_manager):占用字节数3528,样本量7 - 销售经理(sales_manager):占用字节数3528,样本量7 - 狱警(correctional_officer):占用字节数3696,样本量14 - 经理(manager):占用字节数3648,样本量12 - 保险核保员(underwriter):占用字节数3672,样本量13 - 行政助理(executive_assistant):占用字节数3600,样本量10 - 设计师(designer):占用字节数3648,样本量12 - 园丁(groundskeeper):占用字节数3480,样本量5 - 心理健康咨询师(mental_health_counselor):占用字节数3672,样本量13 - 航空航天工程师(aerospace_engineer):占用字节数3648,样本量12 - 出租车司机(taxi_driver):占用字节数3696,样本量14 - 护士(nurse):占用字节数3576,样本量9 - 数据录入员(data_entry_keyer):占用字节数3624,样本量11 - 音乐家(musician):占用字节数3696,样本量14 - 活动策划师(event_planner):占用字节数3552,样本量8 - 写手(writer):占用字节数3672,样本量13 - 厨师(cook):占用字节数3792,样本量18 - 焊工(welder):占用字节数3624,样本量11 - 制作人(producer):占用字节数3744,样本量16 - 美发师(hairdresser):占用字节数3600,样本量10 - 农民(farmer):占用字节数3528,样本量7 - 建筑工人(construction_worker):占用字节数3504,样本量6 - 空调安装工(air_conditioning_installer):占用字节数3432,样本量3 - 电气工程师(electrical_engineer):占用字节数3648,样本量12 - 职业治疗师(occupational_therapist):占用字节数3624,样本量11 - 职业咨询师(career_counselor):占用字节数3600,样本量10 - 室内设计师(interior_designer):占用字节数3624,样本量11 - 看守所看守(jailer):占用字节数3744,样本量16 - 办公室文员(office_clerk):占用字节数3624,样本量11 - 市场调研分析师(market_research_analyst):占用字节数3576,样本量9 - 实验室技术员(laboratory_technician):占用字节数3624,样本量11 - 社会服务助理(social_assistant):占用字节数3744,样本量16 - 病历专员(medical_records_specialist):占用字节数3576,样本量9 - 机械维修工(machinery_mechanic):占用字节数3552,样本量8 - 警察(police_officer):占用字节数3672,样本量13 - 软件开发人员(software_developer):占用字节数3528,样本量7 - 文员(clerk):占用字节数3720,样本量15 - 销售人员(salesperson):占用字节数3648,样本量12 - 社工(social_worker):占用字节数3744,样本量16 - 总监(director):占用字节数3720,样本量15 - 快餐员工(fast_food_worker):占用字节数3696,样本量14 - 歌手(singer):占用字节数3792,样本量18 - 金属加工工(metal_worker):占用字节数3576,样本量9 - 清洁工(cleaner):占用字节数3792,样本量18 - 计算机系统分析师(computer_systems_analyst):占用字节数3600,样本量10 - 牙科助理(dental_assistant):占用字节数3504,样本量6 - 心理学家(psychologist):占用字节数3696,样本量14 - 机械师(machinist):占用字节数3648,样本量12 - 治疗师(therapist):占用字节数3648,样本量12 - 兽医(veterinarian):占用字节数3576,样本量9 - 教师(teacher):占用字节数3720,样本量15 - 建筑师(architect):占用字节数3720,样本量15 - 办公室职员(office_worker):占用字节数3672,样本量13 - 石膏板安装工(drywall_installer):占用字节数3480,样本量5 - 营养师(nutritionist):占用字节数3480,样本量5 - 图书管理员(librarian):占用字节数3672,样本量13 - 儿童保育工作者(childcare_worker):占用字节数3576,样本量9 - 校车司机(school_bus_driver):占用字节数3696,样本量14 - 档案文员(file_clerk):占用字节数3600,样本量10 - 物流师(logistician):占用字节数3576,样本量9 - 科学家(scientist):占用字节数3648,样本量12 - 教学助理(teaching_assistant):占用字节数3672,样本量13 - 放射科技师(radiologic_technician):占用字节数3600,样本量10 - 美甲师(manicurist):占用字节数3576,样本量9 - 社区经理(community_manager):占用字节数3576,样本量9 - 木匠(carpenter):占用字节数3480,样本量5 - 理赔评估师(claims_appraiser):占用字节数3576,样本量9 - 调度员(dispatcher):占用字节数3528,样本量7 - 收银员(cashier):占用字节数3600,样本量10 - 屋面工(roofer):占用字节数3504,样本量6 - 摄影师(photographer):占用字节数3792,样本量18 - 侦探(detective):占用字节数3648,样本量12 - 财务顾问(financial_advisor):占用字节数3576,样本量9 - 批发采购员(wholesale_buyer):占用字节数3672,样本量13 - IT专员(it_specialist):占用字节数3552,样本量8 - 药房技术员(pharmacy_technician):占用字节数3504,样本量6 - 工程师(engineer):占用字节数3648,样本量12 - 搬家工人(mover):占用字节数3768,样本量17 - 飞机维修工(plane_mechanic):占用字节数3624,样本量11 - 访调员(interviewer):占用字节数3672,样本量13 - 按摩治疗师(massage_therapist):占用字节数3624,样本量11 - 洗碗工(dishwasher):占用字节数3672,样本量13 - 健身教练(fitness_instructor):占用字节数3600,样本量10 - 信贷咨询师(credit_counselor):占用字节数3624,样本量11 - 库存员(stocker):占用字节数3816,样本量19 - 药剂师(pharmacist):占用字节数3672,样本量13 - 医生(doctor):占用字节数3672,样本量13 - 合规专员(compliance_officer):占用字节数3648,样本量12 - 助理(aide):占用字节数3768,样本量17 - 巴士司机(bus_driver):占用字节数3672,样本量13 - 财务分析师(financial_analyst):占用字节数3624,样本量11 - 前台接待(receptionist):占用字节数3504,样本量6 - 保洁员(janitor):占用字节数3672,样本量13 - 水管工(plumber):占用字节数3504,样本量6 - 物理治疗师(physical_therapist):占用字节数3600,样本量10 - 库存文员(inventory_clerk):占用字节数3552,样本量8 - 消防员(firefighter):占用字节数3600,样本量10 - 教练(coach):占用字节数3696,样本量14 - 女佣(maid):占用字节数3648,样本量12 - 飞行员(pilot):占用字节数3696,样本量14 - 维修工人(repair_worker):占用字节数3624,样本量11 本数据集下载大小为871516字节,总数据集大小为529248字节。 # 数据集卡片(Dataset Card):"prof_report__SD_v1.4_random_seeds__sd_21__24" [需补充更多信息](https://github.com/huggingface/datasets/blob/main/CONTRIBUTING.md#how-to-contribute-to-the-dataset-cards)
提供机构:
yjernite
原始信息汇总

数据集概述

数据集名称

  • 名称: prof_report__SD_v1.4_random_seeds__sd_21__24

数据集大小

  • 下载大小: 871516
  • 数据集大小: 529248

数据集特征

  • 特征列表:
    • cluster_id: 整数类型 (int64)
    • cluster_size: 整数类型 (int64)
    • img_ids: 序列类型,整数 (sequence: int64)
    • img_cluster_scores: 序列类型,浮点数 (sequence: float64)

数据集分割

  • 分割详情:
    • 名称: 多种职业
    • 示例数量: 每个职业的示例数量不同,范围从2到20不等
    • 字节数: 每个职业的字节数也不同,范围从3408到3840不等

数据集使用

  • 使用指南: 需要更多信息,参考CONTRIBUTING.md获取如何贡献数据集卡片的指南。
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作