five

Data used in Machine learning reveals the waggle drift's role in the honey bee dance communication system|蜜蜂行为数据集|机器学习数据集

收藏
Mendeley Data2024-05-10 更新2024-06-27 收录
蜜蜂行为
机器学习
下载链接:
https://zenodo.org/records/7928121
下载链接
链接失效反馈
资源简介:
Data and metadata used in "Machine learning reveals the waggle drift’s role in the honey bee dance communication system" All timestamps are given in ISO 8601 format. The following files are included: Berlin2019_waggle_phases.csv, Berlin2021_waggle_phases.csv Automatic individual detections of waggle phases during our recording periods in 2019 and 2021. timestamp: Date and time of the detection. cam_id: Camera ID (0: left side of the hive, 1: right side of the hive). x_median, y_median: Median position of the bee during the waggle phase (for 2019 given in millimeters after applying a homography, for 2021 in the original image coordinates). waggle_angle: Body orientation of the bee during the waggle phase in radians (0: oriented to the right, PI / 4: oriented upwards). Berlin2019_dances.csv Automatic detections of dance behavior during our recording period in 2019. dancer_id: Unique ID of the individual bee. dance_id: Unique ID of the dance. ts_from, ts_to: Date and time of the beginning and end of the dance. cam_id: Camera ID (0: left side of the hive, 1: right side of the hive). median_x, median_y: Median position of the individual during the dance. feeder_cam_id: ID of the feeder that the bee was detected at prior to the dance. Berlin2019_followers.csv Automatic detections of attendance and following behavior, corresponding to the dances in Berlin2019_dances.csv. dance_id: Unique ID of the dance being attended or followed. follower_id: Unique ID of the individual attending or following the dance. ts_from, ts_to: Date and time of the beginning and end of the interaction. label: “attendance” or “follower” cam_id: Camera ID (0: left side of the hive, 1: right side of the hive). Berlin2019_dances_with_manually_verified_times.csv A sample of dances from Berlin2019_dances.csv where the exact timestamps have been manually verified to correspond to the beginning of the first and last waggle phase down to a precision of ca. 166 ms (video material was recorded at 6 FPS). dance_id: Unique ID of the dance. dancer_id: Unique ID of the dancing individual. cam_id: Camera ID (0: left side of the hive, 1: right side of the hive). feeder_cam_id: ID of the feeder that the bee was detected at prior to the dance. dance_start, dance_end: Manually verified date and times of the beginning and end of the dance. Berlin2019_dance_classifier_labels.csv Manually annotated waggle phases or following behavior for our recording season in 2019 that was used to train the dancing and following classifier. Can be merged with the supplied individual detections. timestamp: Timestamp of the individual frame the behavior was observed in. frame_id: Unique ID of the video frame the behavior was observed in. bee_id: Unique ID of the individual bee. label: One of “nothing”, “waggle”, “follower” Berlin2019_dance_classifier_unlabeled.csv Additional unlabeled samples of timestamp and individual ID with the same format as Berlin2019_dance_classifier_labels.csv, but without a label. The data points have been sampled close to detections of our waggle phase classifier, so behaviors related to the waggle dance are likely overrepresented in that sample. Berlin2021_waggle_phase_classifier_labels.csv Manually annotated detections of our waggle phase detector (bb_wdd2) that were used to train the neural network filter (bb_wdd_filter) for the 2021 data. detection_id: Unique ID of the waggle phase. label: One of “waggle”, “activating”, “ventilating”, “trembling”, “other”. Where “waggle” denoted a waggle phase, “activating” is the shaking signal, “ventilating” is a bee fanning her wings. “trembling” denotes a tremble dance, but the distinction from the “other” class was often not clear, so “trembling” was merged into “other” for training. orientation: The body orientation of the bee that triggered the detection in radians (0: facing to the right, PI /4: facing up). metadata_path: Path to the individual detection in the same directory structure as created by the waggle dance detector. Berlin2021_waggle_phase_classifier_ground_truth.zip The output of the waggle dance detector (bb_wdd2) that corresponds to Berlin2021_waggle_phase_classifier_labels.csv and is used for training. The archive includes a directory structure as output by the bb_wdd2 and each directory includes the original image sequence that triggered the detection in an archive and the corresponding metadata. The training code supplied in bb_wdd_filter directly works with this directory structure. Berlin2019_tracks.zip Detections and tracks from the recording season in 2019 as produced by our tracking system. As the full data is several terabytes in size, we include the subset of our data here that is relevant for our publication which comprises over 46 million detections. We included tracks for all detected behaviors (dancing, following, attending) including one minute before and after the behavior. We also included all tracks that correspond to the labeled and unlabeled data that was used to train the dance classifier including 30 seconds before and after the data used for training. We grouped the exported data by date to make the handling easier, but to efficiently work with the data, we recommend importing it into an indexable database. The individual files contain the following columns: cam_id: Camera ID (0: left side of the hive, 1: right side of the hive). timestamp: Date and time of the detection. frame_id: Unique ID of the video frame of the recording from which the detection was extracted. track_id: Unique ID of an individual track (short motion path from one individual). For longer tracks, the detections can be linked based on the bee_id. bee_id: Unique ID of the individual bee. bee_id_confidence: Confidence between 0 and 1 that the bee_id is correct as output by our tracking system. x_pos_hive, y_pos_hive: Spatial position of the bee in the hive on the side indicated by cam_id. Given in millimeters after applying a homography on the video material. orientation_hive: Orientation of the bees’ thorax in the hive in radians (0: oriented to the right, PI / 4: oriented upwards). Berlin2019_feeder_experiment_log.csv Experiment log for our feeder experiments in 2019. date: Date given in the format year-month-day. feeder_cam_id: Numeric ID of the feeder. coordinates: Longitude and latitude of the feeder. For feeders 1 and 2 this is only given once and held constant. Feeder 3 had varying locations. time_opened, time_closed: Date and time when the feeder was set up or closed again. sucrose_solution: Concentration of the sucrose solution given as sugar:water (in terms of weight). On days where feeder 3 was open, the other two feeders offered water without sugar. Software used to acquire and analyze the data: bb_pipeline: Tag localization and decoding pipeline bb_pipeline_models: Pretrained localizer and decoder models for bb_pipeline bb_binary: Raw detection data storage format bb_irflash: IR flash system schematics and arduino code bb_imgacquisition: Recording and network storage bb_behavior: Database interaction and data (pre)processing, feature extraction bb_tracking: Tracking of bee detections over time bb_wdd2: Automatic detection and decoding of honey bee waggle dances bb_wdd_filter: Machine learning model to improve the accuracy of the waggle dance detector bb_dance_networks: Detection of dancing and following behavior from trajectories
创建时间:
2023-06-28
用户留言
有没有相关的论文或文献参考?
这个数据集是基于什么背景创建的?
数据集的作者是谁?
能帮我联系到这个数据集的作者吗?
这个数据集如何下载?
点击留言
数据主题
具身智能
数据集  4099个
机构  8个
大模型
数据集  439个
机构  10个
无人机
数据集  37个
机构  6个
指令微调
数据集  36个
机构  6个
蛋白质结构
数据集  50个
机构  8个
空间智能
数据集  21个
机构  5个
5,000+
优质数据集
54 个
任务类型
进入经典数据集
热门数据集

MeSH

MeSH(医学主题词表)是一个用于索引和检索生物医学文献的标准化词汇表。它包含了大量的医学术语和概念,用于描述医学文献中的主题和内容。MeSH数据集包括主题词、副主题词、树状结构、历史记录等信息,广泛应用于医学文献的分类和检索。

www.nlm.nih.gov 收录

中国交通事故深度调查(CIDAS)数据集

交通事故深度调查数据通过采用科学系统方法现场调查中国道路上实际发生交通事故相关的道路环境、道路交通行为、车辆损坏、人员损伤信息,以探究碰撞事故中车损和人伤机理。目前已积累深度调查事故10000余例,单个案例信息包含人、车 、路和环境多维信息组成的3000多个字段。该数据集可作为深入分析中国道路交通事故工况特征,探索事故预防和损伤防护措施的关键数据源,为制定汽车安全法规和标准、完善汽车测评试验规程、

北方大数据交易中心 收录

OpenSonarDatasets

OpenSonarDatasets是一个致力于整合开放源代码声纳数据集的仓库,旨在为水下研究和开发提供便利。该仓库鼓励研究人员扩展当前的数据集集合,以增加开放源代码声纳数据集的可见性,并提供一个更容易查找和比较数据集的方式。

github 收录

ABIDE Dataset

ABIDE(自闭症脑成像数据交换)数据集包含1112个数据集,包括539个来自ASD个体的数据和573个来自典型控制者的数据(年龄7-64岁,跨组中位数14.7岁)。数据集涉及17个国际站点,包括静息状态fMRI(R-fMRI)、解剖数据集和表型数据集。

github 收录

ADE20K

ADE20K 数据集包含 Scene Parsing Benchmark 场景数据和部分分割数据。图像和注释:每个文件夹包含按场景类别分类的图像,对象和部分分割分别存储在两个不同的 png 文件中。所有对象和零件实例均已单独注释。

OpenDataLab 收录