汕头市工业和信息化局部门决算公开信息|财政管理数据集|政府效能数据集
收藏全国 1∶200 000 数字地质图(公开版)空间数据库
As the only one of its kind, China National Digital Geological Map (Public Version at 1∶200 000 scale) Spatial Database (CNDGM-PVSD) is based on China' s former nationwide measured results of regional geological survey at 1∶200 000 scale, and is also one of the nationwide basic geosciences spatial databases jointly accomplished by multiple organizations of China. Spatially, it embraces 1 163 geological map-sheets (at scale 1: 200 000) in both formats of MapGIS and ArcGIS, covering 72% of China's whole territory with a total data volume of 90 GB. Its main sources is from 1∶200 000 regional geological survey reports, geological maps, and mineral resources maps with an original time span from mid-1950s to early 1990s. Approved by the State's related agencies, it meets all the related technical qualification requirements and standards issued by China Geological Survey in data integrity, logic consistency, location acc racy, attribution fineness, and collation precision, and is hence of excellent and reliable quality. The CNDGM-PVSD is an important component of China' s national spatial database categories, serving as a spatial digital platform for the information construction of the State's national economy, and providing informationbackbones to the national and provincial economic planning, geohazard monitoring, geological survey, mineral resources exploration as well as macro decision-making.
DataCite Commons 收录
MAV-VID, Drone-vs-Bird, Anti-UAV
本研究涉及三个数据集:MAV-VID、Drone-vs-Bird和Anti-UAV,总计包含241个视频,共计331,486张图像。这些数据集由杜伦大学创建,用于无人机视觉检测和跟踪的研究。数据集内容丰富,包括从地面和无人机搭载的摄像头捕获的图像,涵盖了多种环境和条件。创建过程中,数据集经过精心标注和处理,以确保数据质量。这些数据集主要用于评估和改进无人机检测和跟踪技术,特别是在复杂环境和动态场景中的应用。
arXiv 收录
Traditional-Chinese-Medicine-Dataset-SFT
该数据集是一个高质量的中医数据集,主要由非网络来源的内部数据构成,包含约1GB的中医各个领域临床案例、名家典籍、医学百科、名词解释等优质内容。数据集99%为简体中文内容,质量优异,信息密度可观。数据集适用于预训练或继续预训练用途,未来将继续发布针对SFT/IFT的多轮对话和问答数据集。数据集可以独立使用,但建议先使用配套的预训练数据集对模型进行继续预训练后,再使用该数据集进行进一步的指令微调。数据集还包含一定比例的中文常识、中文多轮对话数据以及古文/文言文<->现代文翻译数据,以避免灾难性遗忘并加强模型表现。
huggingface 收录
Asteroids by the Minor Planet Center
包含所有已知小行星的轨道数据和观测数据。数据来源于Minor Planet Center,格式包括Fortran (.DAT)和JSON,数据集大小为81MB(压缩)和450MB(未压缩),记录数约750,000条,每日更新。
github 收录
DeepFashion
DeepFashion数据集是一个大规模的时尚识别和检索数据集,包含289,222张多样化的衣物图像,以及详细的边界框、时尚地标、类别和属性标注。该数据集由多媒体实验室,香港中文大学开发,用于支持非商业研究及教育目的。
github 收录