Russian-Language Multimodal Dataset for Automatic Summarization of Scientific Papers
收藏arXiv2024-05-14 更新2024-06-21 收录
下载链接:
https://github.com/iisresearch-team/summarization-dataset
下载链接
链接失效反馈官方服务:
资源简介:
本数据集名为‘Russian-Language Multimodal Dataset for Automatic Summarization of Scientific Papers’,由新西伯利亚国立大学创建,旨在为俄语科学论文的自动摘要生成提供资源。数据集包含420篇来自7个科学领域的论文,涵盖文本、表格和图像等多模态数据。创建过程经过精心策划,确保数据的多样性和代表性。该数据集主要用于评估和改进自动文本摘要技术,特别是在处理科学文献方面,以提高研究效率和信息获取速度。
This dataset, named "Russian-Language Multimodal Dataset for Automatic Summarization of Scientific Papers", was developed by Novosibirsk State University to provide resources for automatic summarization of Russian-language scientific papers. It encompasses 420 papers across 7 scientific disciplines, including multimodal data types such as text, tables, and images. The dataset was meticulously curated during its development to guarantee diversity and representativeness. Its primary applications are to evaluate and refine automatic text summarization technologies, particularly for scientific literature, thereby improving research efficiency and accelerating information access.
提供机构:
新西伯利亚国立大学
创建时间:
2024-05-14



