five

政务热线服务AI问答对训练数据集

收藏
库帕思2025-12-10 更新2025-12-20 收录
下载链接:
https://www.kupasai.com/corpus/detail?id=541&type=1
下载链接
链接失效反馈
官方服务:
资源简介:
<p>本数据集是为增强语言大模型中文口语能力的AI公司,以及开发政务服务AI助手、智能客服机器人的研发团队提供的基础数据资源,旨在解决政务领域高质量、标准化问答数据稀缺,导致AI回复不准确、不规范的核心痛点。数据集基于真实的12345热线工单,通过大模型语义理解与人工精细核验,构建了2500条覆盖政策咨询、问题投诉、应急处置等多类民生场景的高质量问答对。其独特价值在于将原始工单内容转化为第一人称口语化提问,完整保留“市民自然表达-官方规范答复”的真实对话结构,并附有珍贵的市民满意度回访标签作为监督信号。该资源可直接用于监督学习,训练模型理解多样化口语表达,并学习针对各类市民诉求的标准回复范式,有效推进政务问答的自动化与智能化。基于本数据集,基座模型可增强对中文口语的语义理解能力,政务服务AI则可构建回复更准确、口径更规范、服务更高效的智能系统,从而显著提升政务服务的响应质量与市民满意度。</p>

This dataset serves as a foundational data resource for AI companies aiming to enhance the Chinese spoken language capabilities of large language models (LLMs), as well as R&D teams developing government service AI assistants and intelligent customer service robots. It is designed to address the core pain point of scarce high-quality, standardized question-answering data in the government affairs sector, which leads to inaccurate and non-standard AI responses. Constructed from real 12345 hotline work orders, this dataset includes 2,500 high-quality question-answering pairs covering multiple livelihood scenarios such as policy consultation, problem complaints, and emergency response, developed through large language model-based semantic understanding and manual fine-grained verification. Its unique value lies in converting original work order content into first-person spoken questions, fully preserving the authentic dialogue structure of "citizen's natural expression and official standardized reply", and comes with valuable citizen satisfaction follow-up survey tags as supervision signals. This resource can be directly applied to supervised learning, enabling models to comprehend diverse spoken expressions and master standardized reply paradigms for various citizen demands, effectively advancing the automation and intelligence of government affairs question-answering systems. Based on this dataset, base LLMs can improve their semantic understanding of Chinese spoken language, while government service AI can build intelligent systems with more accurate replies, more consistent response standards, and more efficient services, thereby significantly enhancing the response quality of government services and citizen satisfaction.
提供机构:
库帕思
创建时间:
2025-12-05
搜集汇总
数据集介绍
main_image_url
背景与挑战
背景概述
该数据集是为提升语言大模型中文口语能力和开发政务服务AI助手而设计,包含72505条基于真实12345热线工单的问答对,格式为json,总容量107.32MB。主要应用于城市治理领域,解决政务领域高质量问答数据稀缺的问题。
以上内容由遇见数据集搜集并总结生成
二维码
社区交流群
二维码
科研交流群
商业服务