five

WWT-QA: A Chinese question answering dataset for the wastewater treatment domain

收藏
NIAID Data Ecosystem2026-05-10 收录
下载链接:
https://figshare.com/articles/dataset/WWT-QA_A_Chinese_question_answering_dataset_for_the_wastewater_treatment_domain/31229695
下载链接
链接失效反馈
官方服务:
资源简介:
Under WWT_QA, the policy_qa directory contains train.json, val.json, and test.json with 19,074, 2,119, and 2,355 policy QA samples, respectively. These samples cover key points and interpretations of policies, regulations, and standards. The knowledge_qa directory contains three duplicate files: 12,168, 1,513, and 1,578 knowledge QA samples, respectively. These samples cover high-frequency frontline topics, including process flows, equipment maintenance, and operational control. All samples follow an Alpaca-style instruction-tuning format. Each sample is a JSON object with three fields: instruction, input, and output. Here, instruction represents the question, output corresponds to the answer, and input provides optional supplementary context (set to the empty string "" for all samples in WWT_QA). The dataset can be used for fine-tuning and benchmarking LLMs for wastewater treatment.
创建时间:
2026-02-02
二维码
社区交流群
二维码
科研交流群
商业服务