Darmstadt Fanfiction Corpus 1.0 (Fanfiktion.de, 2020-2023)
收藏DataCite Commons2024-06-11 更新2024-07-13 收录
下载链接:
https://tudatalib.ulb.tu-darmstadt.de/handle/tudatalib/4245
下载链接
链接失效反馈官方服务:
资源简介:
A corpus of mostly German-language fanfiction texts created or updated in 2020-2023 from the website Fanfiktion.de. The website was scraped every month, the monthly corpora were later merged into one. The corpus consists of four components: the texts (in .csv and .txt format), the reviews to texts updated in the selected period, the text metadata, and the user information for each author. There are 67538 texts by 22738 authors, and reviews were written on 51188 texts. <br> The corpus allows the creation of subcorpora based on fandom, fanfiction genre, author metadata, age restriction, word or chapter count, and review or endorsement numbers. Further, the corpus can be transformed into network data using reviews as relations. <br> ACCESS: To access to the corpus, please send a signed PDF of the Statement for the use of the Darmstadt Fanfiction Corpus in English or in German to anastasia.glawion@tu-darmstadt.de or thomas.weitin@tu-darmstadt.de.
提供机构:
Technical University of Darmstadt
创建时间:
2024-06-11



