Data used in Machine learning reveals the waggle drift's role in the honey bee dance communication system|蜜蜂行为数据集|机器学习数据集

Mendeley Data2024-05-10 更新2024-06-27 收录

蜜蜂行为

机器学习

下载链接：

https://zenodo.org/records/7928121

下载链接

链接失效反馈

资源简介：

Data and metadata used in "Machine learning reveals the waggle drift’s role in the honey bee dance communication system" All timestamps are given in ISO 8601 format. The following files are included: Berlin2019_waggle_phases.csv, Berlin2021_waggle_phases.csv Automatic individual detections of waggle phases during our recording periods in 2019 and 2021. timestamp: Date and time of the detection. cam_id: Camera ID (0: left side of the hive, 1: right side of the hive). x_median, y_median: Median position of the bee during the waggle phase (for 2019 given in millimeters after applying a homography, for 2021 in the original image coordinates). waggle_angle: Body orientation of the bee during the waggle phase in radians (0: oriented to the right, PI / 4: oriented upwards). Berlin2019_dances.csv Automatic detections of dance behavior during our recording period in 2019. dancer_id: Unique ID of the individual bee. dance_id: Unique ID of the dance. ts_from, ts_to: Date and time of the beginning and end of the dance. cam_id: Camera ID (0: left side of the hive, 1: right side of the hive). median_x, median_y: Median position of the individual during the dance. feeder_cam_id: ID of the feeder that the bee was detected at prior to the dance. Berlin2019_followers.csv Automatic detections of attendance and following behavior, corresponding to the dances in Berlin2019_dances.csv. dance_id: Unique ID of the dance being attended or followed. follower_id: Unique ID of the individual attending or following the dance. ts_from, ts_to: Date and time of the beginning and end of the interaction. label: “attendance” or “follower” cam_id: Camera ID (0: left side of the hive, 1: right side of the hive). Berlin2019_dances_with_manually_verified_times.csv A sample of dances from Berlin2019_dances.csv where the exact timestamps have been manually verified to correspond to the beginning of the first and last waggle phase down to a precision of ca. 166 ms (video material was recorded at 6 FPS). dance_id: Unique ID of the dance. dancer_id: Unique ID of the dancing individual. cam_id: Camera ID (0: left side of the hive, 1: right side of the hive). feeder_cam_id: ID of the feeder that the bee was detected at prior to the dance. dance_start, dance_end: Manually verified date and times of the beginning and end of the dance. Berlin2019_dance_classifier_labels.csv Manually annotated waggle phases or following behavior for our recording season in 2019 that was used to train the dancing and following classifier. Can be merged with the supplied individual detections. timestamp: Timestamp of the individual frame the behavior was observed in. frame_id: Unique ID of the video frame the behavior was observed in. bee_id: Unique ID of the individual bee. label: One of “nothing”, “waggle”, “follower” Berlin2019_dance_classifier_unlabeled.csv Additional unlabeled samples of timestamp and individual ID with the same format as Berlin2019_dance_classifier_labels.csv, but without a label. The data points have been sampled close to detections of our waggle phase classifier, so behaviors related to the waggle dance are likely overrepresented in that sample. Berlin2021_waggle_phase_classifier_labels.csv Manually annotated detections of our waggle phase detector (bb_wdd2) that were used to train the neural network filter (bb_wdd_filter) for the 2021 data. detection_id: Unique ID of the waggle phase. label: One of “waggle”, “activating”, “ventilating”, “trembling”, “other”. Where “waggle” denoted a waggle phase, “activating” is the shaking signal, “ventilating” is a bee fanning her wings. “trembling” denotes a tremble dance, but the distinction from the “other” class was often not clear, so “trembling” was merged into “other” for training. orientation: The body orientation of the bee that triggered the detection in radians (0: facing to the right, PI /4: facing up). metadata_path: Path to the individual detection in the same directory structure as created by the waggle dance detector. Berlin2021_waggle_phase_classifier_ground_truth.zip The output of the waggle dance detector (bb_wdd2) that corresponds to Berlin2021_waggle_phase_classifier_labels.csv and is used for training. The archive includes a directory structure as output by the bb_wdd2 and each directory includes the original image sequence that triggered the detection in an archive and the corresponding metadata. The training code supplied in bb_wdd_filter directly works with this directory structure. Berlin2019_tracks.zip Detections and tracks from the recording season in 2019 as produced by our tracking system. As the full data is several terabytes in size, we include the subset of our data here that is relevant for our publication which comprises over 46 million detections. We included tracks for all detected behaviors (dancing, following, attending) including one minute before and after the behavior. We also included all tracks that correspond to the labeled and unlabeled data that was used to train the dance classifier including 30 seconds before and after the data used for training. We grouped the exported data by date to make the handling easier, but to efficiently work with the data, we recommend importing it into an indexable database. The individual files contain the following columns: cam_id: Camera ID (0: left side of the hive, 1: right side of the hive). timestamp: Date and time of the detection. frame_id: Unique ID of the video frame of the recording from which the detection was extracted. track_id: Unique ID of an individual track (short motion path from one individual). For longer tracks, the detections can be linked based on the bee_id. bee_id: Unique ID of the individual bee. bee_id_confidence: Confidence between 0 and 1 that the bee_id is correct as output by our tracking system. x_pos_hive, y_pos_hive: Spatial position of the bee in the hive on the side indicated by cam_id. Given in millimeters after applying a homography on the video material. orientation_hive: Orientation of the bees’ thorax in the hive in radians (0: oriented to the right, PI / 4: oriented upwards). Berlin2019_feeder_experiment_log.csv Experiment log for our feeder experiments in 2019. date: Date given in the format year-month-day. feeder_cam_id: Numeric ID of the feeder. coordinates: Longitude and latitude of the feeder. For feeders 1 and 2 this is only given once and held constant. Feeder 3 had varying locations. time_opened, time_closed: Date and time when the feeder was set up or closed again. sucrose_solution: Concentration of the sucrose solution given as sugar:water (in terms of weight). On days where feeder 3 was open, the other two feeders offered water without sugar. Software used to acquire and analyze the data: bb_pipeline: Tag localization and decoding pipeline bb_pipeline_models: Pretrained localizer and decoder models for bb_pipeline bb_binary: Raw detection data storage format bb_irflash: IR flash system schematics and arduino code bb_imgacquisition: Recording and network storage bb_behavior: Database interaction and data (pre)processing, feature extraction bb_tracking: Tracking of bee detections over time bb_wdd2: Automatic detection and decoding of honey bee waggle dances bb_wdd_filter: Machine learning model to improve the accuracy of the waggle dance detector bb_dance_networks: Detection of dancing and following behavior from trajectories

创建时间：

2023-06-28

用户留言

有没有相关的论文或文献参考？

这个数据集是基于什么背景创建的？

数据集的作者是谁？

能帮我联系到这个数据集的作者吗？

这个数据集如何下载？

点击留言

数据主题

具身智能

数据集 4098个

机构 8个

大模型

数据集 439个

机构 10个

无人机

数据集 37个

机构 6个

指令微调

数据集 36个

机构 6个

蛋白质结构

数据集 50个

机构 8个

空间智能

数据集 21个

机构 5个

5,000+

优质数据集

54 个

任务类型

进入经典数据集

热门数据集

Breast Ultrasound Images (BUSI)

小型（约500×500像素）超声图像，适用于良性和恶性病变的分类和分割任务。

github 收录

中国区域交通网络数据集

该数据集包含中国各区域的交通网络信息，包括道路、铁路、航空和水路等多种交通方式的网络结构和连接关系。数据集详细记录了各交通节点的位置、交通线路的类型、长度、容量以及相关的交通流量信息。

data.stats.gov.cn 收录

China Health and Nutrition Survey (CHNS)

China Health and Nutrition Survey（CHNS）是一项由美国北卡罗来纳大学人口中心与中国疾病预防控制中心营养与健康所合作开展的长期开放性队列研究项目，旨在评估国家和地方政府的健康、营养与家庭计划政策对人群健康和营养状况的影响，以及社会经济转型对居民健康行为和健康结果的作用。该调查覆盖中国15个省份和直辖市的约7200户家庭、超过30000名个体，采用多阶段随机抽样方法，收集了家庭、个体以及社区层面的详细数据，包括饮食、健康、经济和社会因素等信息。自2011年起，CHNS不断扩展，新增多个城市和省份，并持续完善纵向数据链接，为研究中国社会经济变化与健康营养的动态关系提供了重要的数据支持。

www.cpc.unc.edu 收录

开源PHM数据集

本文分享了一个全球各大学、研究机构和公司捐赠的PHM（Prognostics and Health Management）开源数据集，涵盖加工制造、轨道交通、能源电力和半导体等行业的多种场景，包含部件级、设备级和产线级数据。用户可以利用这些数据开发智能分析和建模算法，数据集分类包括故障诊断、健康评估和寿命预测。

github 收录

红外谱图数据库

收集整理红外谱图实验手册等数据，建成了红外谱图数据库。本数据库收录了常见化合物的红外谱图。主要包括化合物数据和对应的红外谱图数据。其中，原始红外谱图都进行了数字化处理，从而使谱峰检索成为可能。用户可以在数据库中检索指定化合物的谱图，也可以提交谱图/谱峰数据，以检索与之相似的谱图数据，以协助进行谱图鉴定。

国家基础学科公共科学数据中心收录