Automated Review of Academic Literature Generation and Assessment Dataset
收藏DataCite Commons2025-04-27 更新2025-04-16 收录
下载链接:
https://www.scidb.cn/detail?dataSetId=20f98ebaac2947e5a32f68951be7579b
下载链接
链接失效反馈官方服务:
资源简介:
This dataset is the supporting data for the paper ‘Research on Intelligent Generation and Evidence Based of Literature Review Based on Large Language Model’, including the following four files:1. self-constructed academic literature language step-level literature review dataset. Literature abstracts in Chinese Peking University core journals and English SCI journals are selected as the data source, and step-level abstracts under the same subject term are obtained through step recognition and vector search, and the step-level literature review dataset is generated based on multiple abstracts using a big language model with manual review and revision for fine-tuning the local big language model.2. results were obtained using the baseline models ChatGLM3-6B and GLM-3-Turbo, GPT-3.5-turbo, and the fine-tuned model GLM-Lora for simultaneous literature review generation of the validation set.3. using the manually reviewed original review as the reference text, the generated results of different models were evaluated using TF-IDF weighted cosine similarity, BLEU, ROUGE metrics.4. large model assessment of the generated results of different models using GEMINI-Pro, GPT-4 and also GPT-4 for authenticity assessment.
提供机构:
Science Data Bank
创建时间:
2024-12-12



