five

Agricultural named entity recognition technology based on thought chain distillation and counterfactual reasoning

收藏
中国科学数据2025-12-23 更新2026-04-25 收录
下载链接:
https://www.sciengine.com/AA/doi/10.7671/j.issn.1001-411X.202507003
下载链接
链接失效反馈
官方服务:
资源简介:
ObjectiveTo address the issues of hallucinations, contextual logical inconsistencies, and inability to run on low-resource devices when large language models perform named entity recognition (NER) in agriculture.MethodUsing DeepSeek with 671 billion parameters (DeepSeek-671B) as the teacher model, domain knowledge was transferred to student models with fewer parameters. The student models selected were low-parameter versions of DeepSeek, Qwen, and Llama (1.5 billion, 7.0 billion, and 14.0 billion parameters, abbreviated as 1.5B, 7.0B and 14B respectively), which underwent distillation and counterfactual reasoning training. Model performance was experimentally validated on the CropDiseaseNer dataset, a specialized agricultural disease dataset.ResultBy comparing the performance of a series of distilled student models, the results showed that DeepSeek-14B achieved an entity recognition F1 score of 89.60% while requiring only 2.08% of the parameters of the teacher model. Its performance significantly outperformed both the general-purpose large model GPT-mini-14B (F1 score: 57.64%) and the domain-adapted model GLiNER (F1 score: 82.96%) based on a general LLM. Further analysis revealed that the DeepSeek student model, sharing the same architecture, demonstrated superiority over models with different architectures in recognizing long-tail categories such as disease entities and pathogen genus names, owing to its parameter alignment advantage.ConclusionThis study validates the effectiveness of knowledge distillation in NER tasks within the agricultural domain, offering a novel solution for entity recognition technology in resource-constrained scenarios.
创建时间:
2025-12-23
二维码
社区交流群
二维码
科研交流群
商业服务