Automated Review of Academic Literature Generation and Assessment Dataset

Name: Automated Review of Academic Literature Generation and Assessment Dataset
Creator: Science Data Bank
Published: 2025-04-27 18:20:16
License: 暂无描述

DataCite Commons2025-04-27 更新2025-04-16 收录

下载链接：

https://www.scidb.cn/detail?dataSetId=20f98ebaac2947e5a32f68951be7579b

下载链接

链接失效反馈

官方服务：

资源简介：

This dataset is the supporting data for the paper ‘Research on Intelligent Generation and Evidence Based of Literature Review Based on Large Language Model’, including the following four files:1. self-constructed academic literature language step-level literature review dataset. Literature abstracts in Chinese Peking University core journals and English SCI journals are selected as the data source, and step-level abstracts under the same subject term are obtained through step recognition and vector search, and the step-level literature review dataset is generated based on multiple abstracts using a big language model with manual review and revision for fine-tuning the local big language model.2. results were obtained using the baseline models ChatGLM3-6B and GLM-3-Turbo, GPT-3.5-turbo, and the fine-tuned model GLM-Lora for simultaneous literature review generation of the validation set.3. using the manually reviewed original review as the reference text, the generated results of different models were evaluated using TF-IDF weighted cosine similarity, BLEU, ROUGE metrics.4. large model assessment of the generated results of different models using GEMINI-Pro, GPT-4 and also GPT-4 for authenticity assessment.

提供机构：

Science Data Bank

创建时间：

2024-12-12

5,000+

优质数据集

54 个

任务类型

进入经典数据集