five

地质垂直大模型微调标注数据集

收藏
陕西省数据知识产权登记服务平台2025-10-17 更新2025-10-11 收录
下载链接:
http://data.snippc.com/registrationPublicity
下载链接
链接失效反馈
官方服务:
资源简介:
1.主要内容 本数据集的核心内容是从我单位历史地质调查报告、科研论文等文本中,精确抽取的结构化问答对。问题覆盖地质学相关的核心概念、原理、方法、资源勘查、工程应用、灾害防治及标准解读等多元主题,答案严格依据原文知识,确保专业性与准确性,构成用于微调地质大模型的问答数据集。 2.数据用途 本数据集用于微调地质垂直大模型,使其掌握地质领域专业知识,从而能够生成可靠、符合行业规范的专业回答,缓解模型在垂直领域的“幻觉”现象。它可以用于构建地质知识库与智能问答系统,并为地质科研、教学、矿产勘探、工程勘察及灾害评估等场景提供高效、精准的知识查询支持,能够提升大模型在地质报告撰写、文献解析、地质智能应用等专业任务中的实用价值与可信度。 3.涵盖范围 本数据集来源于我单位历史地质调查报告、科研论文等文本中,广泛涵盖地质学各分支学科,包括构造地质学、矿物岩石学、矿床学、水文地质学、工程地质学、环境地质学、古生物地层学及地球物理勘探应用等。应用场景包括理论研究、资源勘探、地质调查、灾害防治、环境评价等知识需求,形成支撑地质智能化所需的专业知识体系。

1. Core Content: The core of this dataset is structured question-answer pairs accurately extracted from historical geological survey reports, research papers and other texts of our institution. The questions cover multiple topics including core geological concepts, principles, methods, resource exploration, engineering applications, disaster prevention and control, standard interpretation and other diverse themes. The answers are strictly based on the original textual knowledge to ensure professionalism and accuracy, forming a question-answer dataset for fine-tuning geological large language models. 2. Data Application: This dataset is used for fine-tuning vertical geological large language models, enabling them to master professional geological knowledge, generate reliable and industry-standard professional responses, and mitigate the "hallucination" phenomenon of models in vertical domains. It can be used to construct geological knowledge bases and intelligent question-answering systems, providing efficient and accurate knowledge query support for scenarios such as geological scientific research, teaching, mineral exploration, engineering survey and disaster assessment, and enhancing the practical value and credibility of large language models in professional tasks including geological report writing, literature analysis and geological intelligent applications. 3. Coverage Scope: This dataset is sourced from historical geological survey reports, research papers and other texts of our institution, and extensively covers various sub-disciplines of geology, including structural geology, mineralogy and petrology, ore deposit geology, hydrogeology, engineering geology, environmental geology, paleontology and stratigraphy, and geophysical exploration applications. The application scenarios include knowledge demands for theoretical research, resource exploration, geological survey, disaster prevention and control, environmental assessment and other aspects, forming a professional knowledge system supporting geological intelligence.
提供机构:
西安煤科透明地质科技有限公司
创建时间:
2025-08-27
搜集汇总
数据集介绍
main_image_url
背景与挑战
背景概述
该数据集是'地质垂直大模型微调标注数据集',由西安煤科透明地质科技有限公司登记,属于企业数据,目前处于公示中状态。它主要用于地质领域的垂直大模型微调标注,但详情页未提供具体数据规模、格式或应用场景等细节。
以上内容由遇见数据集搜集并总结生成
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作