Santa Anita track, opening day crowd shots, Arcadia, Calif., 1937
收藏Materials Project 在线材料数据库
Materials Project 是一个由伯克利加州大学和劳伦斯伯克利国家实验室于 2011 年共同发起的大型开放式在线材料数据库。这个项目的目标是利用高通量第一性原理计算,为超过百万种无机材料提供全面的性能数据、结构信息和计算模拟结果,以此加速新材料的发现和创新过程。数据库中的数据不仅包括晶体结构和能量特性,还涵盖了电子结构和热力学性质等详尽信息,为研究人员提供了丰富的材料数据资源。相关论文成果为「Commentary: The Materials Project: A materials genome approach to accelerating materials innovation」。
超神经 收录
yahoo-finance-data
该数据集包含从Yahoo! Finance、Nasdaq和U.S. Department of the Treasury获取的财务数据,旨在用于研究和教育目的。数据集包括公司详细信息、高管信息、财务指标、历史盈利、股票价格、股息事件、股票拆分、汇率和每日国债收益率等。每个数据集都有其来源、简要描述以及列出的列及其数据类型和描述。数据定期更新,并以Parquet格式提供,可通过DuckDB进行查询。
huggingface 收录
PCLT20K
PCLT20K数据集是由湖南大学等机构创建的一个大规模PET-CT肺癌肿瘤分割数据集,包含来自605名患者的21,930对PET-CT图像,所有图像都带有高质量的像素级肿瘤区域标注。该数据集旨在促进医学图像分割研究,特别是在PET-CT图像中肺癌肿瘤的分割任务。
arXiv 收录
HIT-UAV Dataset
The HIT-UAV: A High-Altitude Infrared Thermal Dataset for Unmanned Aerial Vehicle-Based Object Detection dataset consists of 2,898 infrared thermal images. These images were extracted from a larger pool of 43,470 frames sourced from numerous videos, all of which were publicly available and had undergone desensitization for privacy reasons. In order to enhance the dataset's utility for various tasks, the HIT-UAV10 dataset includes two types of annotated bounding boxes for each object within the images: oriented bounding boxes, designed to address the challenge of significant overlap between object instances in aerial images, and standard bounding boxes, aimed at facilitating efficient dataset utilization. This comprehensive dataset encompasses five distinct object categories: person, car, bicycle, other vehicle, and dontcare, totaling 24,899 annotated objects. The DontCare category encompasses objects that proved difficult for annotators to categorize accurately, with additional details provided in the Methods section.
datasetninja.com 收录
NASA Battery Dataset
用于预测电池健康状态的数据集,由NASA提供。
github 收录