five

Darmstadt Fanfiction Corpus 1.0 (Fanfiktion.de, 2020-2023)

收藏
DataCite Commons2024-06-11 更新2024-07-13 收录
下载链接:
https://tudatalib.ulb.tu-darmstadt.de/handle/tudatalib/4245
下载链接
链接失效反馈
官方服务:
资源简介:
A corpus of mostly German-language fanfiction texts created or updated in 2020-2023 from the website Fanfiktion.de. The website was scraped every month, the monthly corpora were later merged into one. The corpus consists of four components: the texts (in .csv and .txt format), the reviews to texts updated in the selected period, the text metadata, and the user information for each author. There are 67538 texts by 22738 authors, and reviews were written on 51188 texts. <br> The corpus allows the creation of subcorpora based on fandom, fanfiction genre, author metadata, age restriction, word or chapter count, and review or endorsement numbers. Further, the corpus can be transformed into network data using reviews as relations. <br> ACCESS: To access to the corpus, please send a signed PDF of the Statement for the use of the Darmstadt Fanfiction Corpus in English or in German to anastasia.glawion@tu-darmstadt.de or thomas.weitin@tu-darmstadt.de.
提供机构:
Technical University of Darmstadt
创建时间:
2024-06-11
二维码
社区交流群
二维码
科研交流群
商业服务