WebNLG Dataset
收藏WebNLG Dataset Summary
Data Releases
-
release_v2
- Latest release.
- Includes release_v1 and test data (seen categories) from the WebNLG challenge.
- Split into train/dev/test with equal representation of DBpedia categories and tripleset sizes.
- Includes tree shapes and types (sibling, chain, mixed) for each input RDF tree.
-
release_v2_constrained
- Contains the same data as release_v2.
- Split into train/dev/test with a more challenging constraint: no triple occurring in train/dev is present in test.
-
release_v1
- Matches Final Release (Larger Dataset) on the challenge website.
- Does not include test data (seen categories) from the challenge.
- No split into train/dev/test provided.
- Covers 15 DBpedia categories.
-
webnlg_challenge_2017
- Contains data used in the WebNLG Challenge 2017.
- Covers 10 DBpedia categories (partially for the City category).
Data Formats
- Each folder contains the same data in two formats:
xmlandjson.
Dataset Coverage
- DBpedia Categories:
- release_v1: 15 categories
- webnlg_challenge_2017: 10 categories (partially for City)
Citation Information
-
For general use of the WebNLG corpus:
@InProceedings{gardent2017creating, author = "Gardent, Claire and Shimorina, Anastasia and Narayan, Shashi and Perez-Beltrachini, Laura", title = "Creating Training Corpora for NLG Micro-Planners", booktitle = "Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)", year = "2017", publisher = "Association for Computational Linguistics", pages = "179--188", location = "Vancouver, Canada", doi = "10.18653/v1/P17-1017", url = "http://www.aclweb.org/anthology/P17-1017" }
-
For use of release_v2_constrained:
@InProceedings{shimorina2018handling, author = "Shimorina, Anastasia and Gardent, Claire", title = "Handling Rare Items in Data-to-Text Generation", booktitle = "Proceedings of the 11th International Conference on Natural Language Generation", year = "2018", publisher = "Association for Computational Linguistics", location = "Tilburg, The Netherlands" }
License




