five

Token-based data sets for the analysis of the academic language of literary studies and linguistics

收藏
NIAID Data Ecosystem2026-03-12 收录
下载链接:
https://zenodo.org/record/4306014
下载链接
链接失效反馈
官方服务:
资源简介:
These are the token-based data sets used for my PhD thesis ("Potentiale syntaktischer Annotationen für die datengeleitete Sprachbeschreibung am Beispiel der Wissenschaftssprachen der Germanistik", publication in progress). For python scripts and further data see https://github.com/melandresen/dissertation. Due to copyright law, the annotated texts of the corpus could only be published without the token layer. The files provided here include the token-based frequency data that have been derived from the origial texts and can be used as input to the analysis scripts in the GitHub-Repository.
创建时间:
2020-12-05
二维码
社区交流群
二维码
科研交流群
商业服务