five

GenWiki: A Dataset of 1.3 Million Content-Sharing Text and Graphs for Unsupervised Graph-to-Text Generation

收藏
DataCite Commons2025-05-08 更新2024-07-13 收录
下载链接:
https://edmond.mpg.de/citation?persistentId=doi:10.17617/3.YGO7EW
下载链接
链接失效反馈
官方服务:
资源简介:
Paper: "GenWiki: A Dataset of 1.3 Million Content-Sharing Text and Graphs for Unsupervised Graph-to-Text Generation" (COLING 2020) by Zhijing Jin, Qipeng Guo, Xipeng Qiu, and Zheng Zhang. (https://aclanthology.org/2020.coling-main.217/) <br> <br> Abstract: Data collection for the knowledge graph-to-text generation is expensive. As a result, research on unsupervised models has emerged as an active field recently. However, most unsupervised models have to use non-parallel versions of existing small supervised datasets, which largely constrain their potential. In this paper, we propose a large-scale, general-domain dataset, GenWiki. Our unsupervised dataset has 1.3M text and graph examples, respectively. With a human-annotated test set, we provide this new benchmark dataset for future research on unsupervised text generation from knowledge graphs.
提供机构:
Edmond
创建时间:
2024-01-02
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作