five

HUVER|无人机数据集|多模态数据数据集

收藏
huggingface2024-07-02 更新2024-12-12 收录
无人机
多模态数据
下载链接:
https://huggingface.co/datasets/raiselab/HUVER
下载链接
链接失效反馈
资源简介:
HUVER数据集包含6,051个独特的无人机配置,每个配置通过多种格式描述,包括语法字符串、RGB图像和GLB文件。此外,每个配置还附带一个英文文本描述符,用自然语言详细描述无人机的特征。该数据集支持图像到文本、图像到3D和特征提取等多种任务,由Abhiram Karri、Gary Stump、Christopher McComb和Binyang Song策划,并采用MIT许可证。
创建时间:
2024-07-02
原始信息汇总

数据集卡片 for HUVER

概述

HUVER数据集包含6,051个独特的无人机配置,每个配置通过多种数据格式描述,包括语法字符串、RGB图像和GLB文件。此外,还提供了基于配置的描述,即使用自然语言描述每个无人机的特征。

语言

  • 语言(NLP): 英语, en

许可证

  • 许可证: MIT

用途

直接用途

  • 该多模态无人机数据集包含多种无人机表示形式,如GLB格式的3D模型、语法表示、文本描述和参数数据。这种多样性有助于开发利用不同无人机表示形式的代理模型,以更准确地预测性能。
  • 该多方面无人机数据集支持通过各种生成模型创建多样化的无人机设计。模型如GANs、LSTMs、transformers和GNNs可以生成新的无人机图像和不同格式的设计,包括GLB。数据集中包含的负面示例有助于早期识别和纠正潜在的设计缺陷,增强模型细化和确保无人机设计的可行性和安全性。

超出范围的用途

  • 该数据集不适用于飞行准备系统的详细设计。

数据集结构

数据实例

json { Image: <0001.png>, glb_file: https://huggingface.co/datasets/raiselab/HUVER/resolve/main/train/glb/0001.glb?download=true, Grammar_string": <*aMM0-*bNM2++*cMN1++dLM2eML1^ab^ac^ad^ae>, Cost ($): <1877.19>, Number of Batteries: <1>, Number of Motor-Rotor Pairs: <4>, Number of Airfoils: <0>, Number of Connectors: <4>, Weight of Batteries (lb): <19.40347644>, Weight of Motor-Rotor Pair (lb): <3.858051314>, Weight of Airfoils (lb): <0.0>, Total Weight (lb): <23.26152854>, Total Thrust (lb): <82.50002518>, Normalized Average Structure Size: <0.324324324>, Normalized Average Motor Size: <0.259259259>, Normalized Average Foil Size: <0.0>, Design Descriptor: <This drone is made up of 1 part and has 4 engines that help it move. It also has 0 wings for better flying. It has 4 links that connect everything together securely. The drone weighs 23.2615285432816 pounds in total and can lift itself and more, thanks to its strong thrust of 82.500025177002 pounds.>, Operations Descriptor: <This drone configuration has a feasible flying range of 0.0-0.0 miles, evaluated over the payload range of 0-0 pounds. This configuration has a velocity range of 0.06352621-0.06352621 mph. It is observed when payload increases, the flying range and velocity decrease. The drones achieve highest values of velocity and range for the lowest payloads. It can be interpreted from the data that the drone can fly as far as 0.0 miles, and can reach maximum speeds up to 0.06352621 mph. This means that while the drone does well in many situations, how far and fast it can fly can vary with how much payload it carries. This drone costs around $1877.19, adding up costs of all the components used to achieve this configuration.>, Performance: <Feasibilty": "CouldNotStabilize", "Flying Range": 0.0, "Payload Capacity (lb)": 0, "Velocity (mph)": 0.06352621, "Performance Descriptor": "This drone could not hover. The drone for a payload of 0 pounds, could not accomplish a successful run, the reason being either the motors could not provide enough lift or the drone did not balance properly after flight".> }

数据字段

  • Grammar String: 每个无人机配置可以通过一个语法字符串完全描述,该字符串根据特定的预定义语法规则结构化。
  • Image: 对应于无人机配置(语法字符串)的RGB图像的俯视图。
  • glb: 对应无人机配置的详细空间结构的3D网格表示。
  • 配置参数字段: 包括电池数量、电机-旋翼对数量、翼片数量、连接器数量、电池重量、电机-旋翼对重量、翼片重量、总重量、总推力、归一化平均结构尺寸、归一化平均电机尺寸、归一化平均翼片尺寸。
  • 文本描述:
    • Design Descriptor: 基于无人机配置的设计描述。
    • Performance Descriptor: 基于无人机模拟结果的性能描述。
    • Operational Descriptor: 基于无人机操作范围的性能曲线描述。
AI搜集汇总
数据集介绍
main_image_url
构建方式
HUVER数据集通过整合多种数据格式,构建了一个包含6,051种独特无人机配置的多样化数据集。每种配置均以语法字符串、RGB图像和GLB文件的形式进行描述,并辅以基于配置的自然语言文本描述。这种多模态的数据结构不仅涵盖了无人机的设计特征,还通过详细的参数化数据提供了性能评估的基础。数据集的构建过程注重多样性与实用性,旨在为无人机设计领域的模型开发提供丰富的训练资源。
特点
HUVER数据集以其多模态特性脱颖而出,涵盖了语法字符串、RGB图像、GLB文件以及自然语言描述等多种数据形式。此外,数据集还提供了详细的配置参数,如电池数量、电机-转子对数量、翼型数量等,以及性能描述和操作范围曲线。这些特征使得数据集能够支持从图像生成到3D建模、从特征提取到性能预测的多种任务。数据集中的负例设计也有助于识别潜在的设计缺陷,提升模型的鲁棒性与实用性。
使用方法
HUVER数据集适用于多种生成模型和预测任务,如生成对抗网络(GANs)、长短期记忆网络(LSTMs)、Transformer模型和图神经网络(GNNs)。用户可以通过数据集中的多模态数据生成新的无人机设计,或利用参数化数据训练性能预测模型。数据集中的负例设计可用于模型优化,帮助识别并修正设计中的潜在问题。此外,数据集还提供了可视化脚本,便于用户直观地探索和分析数据。
背景与挑战
背景概述
HUVER数据集由Abhiram Karri、Gary Stump、Christopher McComb和Binyang Song等研究人员于近期创建,旨在为无人机(UAV)设计领域提供多模态数据支持。该数据集包含6,051种独特的无人机配置,每种配置通过多种数据格式进行描述,包括语法字符串、RGB图像和GLB文件。此外,数据集还提供了基于配置的自然语言描述,详细说明了每种无人机的特征。HUVER的创建旨在推动无人机设计的自动化与优化,特别是在生成模型和性能预测方面,具有重要的研究价值和应用潜力。
当前挑战
HUVER数据集在解决无人机设计自动化问题时面临多重挑战。首先,无人机设计的复杂性要求数据集能够准确捕捉多种设计参数及其相互关系,这对数据的多样性和完整性提出了较高要求。其次,构建过程中,如何将不同模态的数据(如3D模型、图像和文本描述)有效整合,并确保其一致性和可解释性,是一个技术难点。此外,数据集中包含的负例设计虽然有助于识别潜在设计缺陷,但也增加了数据标注和验证的复杂性。这些挑战不仅影响了数据集的构建过程,也对后续模型的训练和性能预测提出了更高的要求。
常用场景
经典使用场景
在无人机设计领域,HUVER数据集提供了一个多模态的无人机配置库,涵盖了从语法描述到3D模型的多种数据格式。这一数据集特别适用于开发能够预测无人机性能的代理模型,通过整合不同的无人机表示形式,如RGB图像、GLB文件和自然语言描述,研究人员能够更精确地模拟和预测无人机的飞行性能。
衍生相关工作
基于HUVER数据集,许多研究工作得以展开,特别是在无人机设计自动化和性能预测领域。例如,研究人员利用该数据集开发了基于深度学习的无人机设计生成模型,这些模型能够自动生成符合特定性能要求的无人机配置。此外,该数据集还促进了无人机设计中的多模态数据融合技术的研究,推动了无人机设计领域的智能化发展。
数据集最近研究
最新研究方向
近年来,HUVER数据集在无人机设计与性能预测领域引起了广泛关注。该数据集通过多模态数据(包括语法字符串、RGB图像和GLB文件)提供了丰富的无人机配置信息,为生成模型和性能预测模型的研究提供了坚实的基础。特别是在生成对抗网络(GANs)、长短期记忆网络(LSTMs)、变压器模型(Transformers)和图神经网络(GNNs)等技术的支持下,研究者能够生成多样化的无人机设计,并通过负样本优化设计缺陷。此外,HUVER数据集在无人机设计的早期阶段识别潜在问题,提升了设计的可行性和安全性,推动了无人机工程领域的创新与进步。
以上内容由AI搜集并总结生成
用户留言
有没有相关的论文或文献参考?
这个数据集是基于什么背景创建的?
数据集的作者是谁?
能帮我联系到这个数据集的作者吗?
这个数据集如何下载?
点击留言
数据主题
具身智能
数据集  4098个
机构  8个
大模型
数据集  439个
机构  10个
无人机
数据集  37个
机构  6个
指令微调
数据集  36个
机构  6个
蛋白质结构
数据集  50个
机构  8个
空间智能
数据集  21个
机构  5个
5,000+
优质数据集
54 个
任务类型
进入经典数据集
热门数据集

中国劳动力动态调查

“中国劳动力动态调查” (China Labor-force Dynamics Survey,简称 CLDS)是“985”三期“中山大学社会科学特色数据库建设”专项内容,CLDS的目的是通过对中国城乡以村/居为追踪范围的家庭、劳动力个体开展每两年一次的动态追踪调查,系统地监测村/居社区的社会结构和家庭、劳动力个体的变化与相互影响,建立劳动力、家庭和社区三个层次上的追踪数据库,从而为进行实证导向的高质量的理论研究和政策研究提供基础数据。

中国学术调查数据资料库 收录

TCIA

TCIA(The Cancer Imaging Archive)是一个公开的癌症影像数据集,包含多种癌症类型的医学影像数据,如CT、MRI、PET等。这些数据通常与临床和病理信息相结合,用于癌症研究和临床试验。

www.cancerimagingarchive.net 收录

flames-and-smoke-datasets

该仓库总结了多个公开的火焰和烟雾数据集,包括DFS、D-Fire dataset、FASDD、FLAME、BoWFire、VisiFire、fire-smoke-detect-yolov4、Forest Fire等数据集。每个数据集都有详细的描述,包括数据来源、图像数量、标注信息等。

github 收录

Canadian Census

**Overview** The data package provides demographics for Canadian population groups according to multiple location categories: Forward Sortation Areas (FSAs), Census Metropolitan Areas (CMAs) and Census Agglomerations (CAs), Federal Electoral Districts (FEDs), Health Regions (HRs) and provinces. **Description** The data are available through the Canadian Census and the National Household Survey (NHS), separated or combined. The main demographic indicators provided for the population groups, stratified not only by location but also for the majority by demographical and socioeconomic characteristics, are population number, females and males, usual residents and private dwellings. The primary use of the data at the Health Region level is for health surveillance and population health research. Federal and provincial departments of health and human resources, social service agencies, and other types of government agencies use the information to monitor, plan, implement and evaluate programs to improve the health of Canadians and the efficiency of health services. Researchers from various fields use the information to conduct research to improve health. Non-profit health organizations and the media use the health region data to raise awareness about health, an issue of concern to all Canadians. The Census population counts for a particular geographic area representing the number of Canadians whose usual place of residence is in that area, regardless of where they happened to be on Census Day. Also included are any Canadians who were staying in that area on Census Day and who had no usual place of residence elsewhere in Canada, as well as those considered to be 'non-permanent residents'. National Household Survey (NHS) provides demographic data for various levels of geography, including provinces and territories, census metropolitan areas/census agglomerations, census divisions, census subdivisions, census tracts, federal electoral districts and health regions. In order to provide a comprehensive overview of an area, this product presents data from both the NHS and the Census. NHS data topics include immigration and ethnocultural diversity; aboriginal peoples; education and labor; mobility and migration; language of work; income and housing. 2011 Census data topics include population and dwelling counts; age and sex; families, households and marital status; structural type of dwelling and collectives; and language. The data are collected for private dwellings occupied by usual residents. A private dwelling is a dwelling in which a person or a group of persons permanently reside. Information for the National Household Survey does not include information for collective dwellings. Collective dwellings are dwellings used for commercial, institutional or communal purposes, such as a hotel, a hospital or a work camp. **Benefits** - Useful for canada public health stakeholders, for public health specialist or specialized public and other interested parties. for health surveillance and population health research. for monitoring, planning, implementation and evaluation of health-related programs. media agencies may use the health regions data to raise awareness about health, an issue of concern to all canadians. giving the addition of longitude and latitude in some of the datasets the data can be useful to transpose the values into geographical representations. the fields descriptions along with the dataset description are useful for the user to quickly understand the data and the dataset. **License Information** The use of John Snow Labs datasets is free for personal and research purposes. For commercial use please subscribe to the [Data Library](https://www.johnsnowlabs.com/marketplace/) on John Snow Labs website. The subscription will allow you to use all John Snow Labs datasets and data packages for commercial purposes. **Included Datasets** - [Canadian Population and Dwelling by FSA 2011](https://www.johnsnowlabs.com/marketplace/canadian-population-and-dwelling-by-fsa-2011) - This Canadian Census dataset covers data on population, total private dwellings and private dwellings occupied by usual residents by forward sortation area (FSA). It is enriched with the percentage of the population or dwellings versus the total amount as well as the geographical area, province, and latitude and longitude. The whole Canada's population is marked as 100, referring to 100% for the percentages. - [Detailed Canadian Population Statistics by CMAs and CAs 2011](https://www.johnsnowlabs.com/marketplace/detailed-canadian-population-statistics-by-cmas-and-cas-2011) - This dataset covers the population statistics of Canada by Census Metropolitan Areas (CMAs) and Census Agglomerations (CAs). It is categorized also by citizen/immigration status, ethnic origin, religion, mobility, education, language, work, housing, income etc. There is detailed characteristics categorization within these stated categories that are in 5 layers. - [Detailed Canadian Population Statistics by FED 2011](https://www.johnsnowlabs.com/marketplace/detailed-canadian-population-statistics-by-fed-2011) - This dataset covers the population statistics of Canada from 2011 by Federal Electoral District of 2013 Representation Order. It is categorized also by citizen/immigration status, ethnic origin, religion, mobility, education, language, work, housing, income etc. There is detailed characteristics categorization within these stated categories that are in 5 layers. - [Detailed Canadian Population Statistics by Health Region 2011](https://www.johnsnowlabs.com/marketplace/detailed-canadian-population-statistics-by-health-region-2011) - This dataset covers the population statistics of Canada by health region. It is categorized also by citizen/immigration status, ethnic origin, religion, mobility, education, language, work, housing, income etc. There is detailed characteristics categorization within these stated categories that are in 5 layers. - [Detailed Canadian Population Statistics by Province 2011](https://www.johnsnowlabs.com/marketplace/detailed-canadian-population-statistics-by-province-2011) - This dataset covers the population statistics of Canada by provinces and territories. It is categorized also by citizen/immigration status, ethnic origin, religion, mobility, education, language, work, housing, income etc. There is detailed characteristics categorization within these stated categories that are in 5 layers. **Data Engineering Overview** **We deliver high-quality data** - Each dataset goes through 3 levels of quality review - 2 Manual reviews are done by domain experts - Then, an automated set of 60+ validations enforces every datum matches metadata & defined constraints - Data is normalized into one unified type system - All dates, unites, codes, currencies look the same - All null values are normalized to the same value - All dataset and field names are SQL and Hive compliant - Data and Metadata - Data is available in both CSV and Apache Parquet format, optimized for high read performance on distributed Hadoop, Spark & MPP clusters - Metadata is provided in the open Frictionless Data standard, and its every field is normalized & validated - Data Updates - Data updates support replace-on-update: outdated foreign keys are deprecated, not deleted **Our data is curated and enriched by domain experts** Each dataset is manually curated by our team of doctors, pharmacists, public health & medical billing experts: - Field names, descriptions, and normalized values are chosen by people who actually understand their meaning - Healthcare & life science experts add categories, search keywords, descriptions and more to each dataset - Both manual and automated data enrichment supported for clinical codes, providers, drugs, and geo-locations - The data is always kept up to date – even when the source requires manual effort to get updates - Support for data subscribers is provided directly by the domain experts who curated the data sets - Every data source’s license is manually verified to allow for royalty-free commercial use and redistribution. **Need Help?** If you have questions about our products, contact us at [info@johnsnowlabs.com](mailto:info@johnsnowlabs.com).

Databricks 收录

URPC系列数据集, S-URPC2019, UDD

URPC系列数据集包括URPC2017至URPC2020DL,主要用于水下目标的检测和分类。S-URPC2019专注于水下环境的特定检测任务。UDD数据集信息未在README中详细描述。

github 收录