文旅垂类大模型高质量训练数据集
收藏北京国际大数据交易所2025-06-04 收录
下载链接:
https://webs.bjidex.com/sys-bsc-home/#/bscConsole/tradingMarket/detail?id=4498
下载链接
链接失效反馈官方服务:
资源简介:
文旅垂类大模型高质量训练数据集包括指令微调数据集和测试数据集。指令微调数据集是一组特定于文旅垂类的文本数据,用于微调预训练模型,使其更好地适应文旅领域服务任务,包括各种类型的文本,如旅游攻略、景点介绍、历史文化、美食推荐等,以提供更全面的文旅信息覆盖,是用户查询问题和相应的高质量回复,以训练模型更好地理解用户意图和生成相关回答。测试数据集用于评估和验证训练后模型的性能、准确性和安全性,涵盖文旅领域认知、内容生成、专业知识、专业逻辑、安全性等维度的测试问答对,可用于测试模型的回答能力和领域适应性、模型的鲁棒性和应对复杂情况的能力,以全面评估模型的回答质量。通过文旅垂类大模型高质量训练数据集应用可以有效地提高模型的性能和适应性,使其能够更好地满足文旅垂类的需求,并提升用户体验。
The High-Quality Training Dataset for Cultural Tourism Vertical Large Language Models consists of an instruction fine-tuning dataset and a test dataset. The instruction fine-tuning dataset is a collection of text data tailored specifically for the cultural tourism vertical domain, used to fine-tune pre-trained models to better adapt them to cultural tourism-related service tasks. It covers various types of texts such as travel guides, attraction introductions, historical and cultural content, food recommendations, etc., to achieve comprehensive coverage of cultural tourism information. Composed of high-quality user query-response pairs, it trains the model to better comprehend user intentions and generate relevant responses. The test dataset is utilized to evaluate and validate the performance, accuracy, and safety of the post-training model. It includes test question-answer pairs covering multiple dimensions such as cultural tourism domain cognition, content generation, professional expertise, professional logic, and safety. This dataset can be used to assess the model's response capabilities, domain adaptability, robustness, and ability to handle complex scenarios, thereby enabling a comprehensive evaluation of the quality of the model's outputs. The application of this high-quality training dataset for cultural tourism vertical large language models can effectively improve the model's performance and adaptability, enabling it to better meet the demands of the cultural tourism vertical domain and elevate user experience.
提供机构:
中关村科学城城市大脑股份有限公司
搜集汇总
数据集介绍

背景与挑战
背景概述
该文旅垂类训练数据集包含指令微调数据和测试数据,涵盖旅游攻略、景点介绍等多元文本,用于优化模型在文旅领域的理解与生成能力。测试集通过多维度评估模型性能,旨在提升服务精准度和用户体验。
以上内容由遇见数据集搜集并总结生成



