The Hansard 19th-Century British Parliamentary Debates with Improved Speaker Names: Parsed Debates, N-Gram Counts, Special Vocabulary, Collocates, and Topics
收藏NIAID Data Ecosystem2026-05-01 收录
下载链接:
https://doi.org/10.7910/DVN/ZCYJH8
下载链接
链接失效反馈官方服务:
资源简介:
The Hansard 19th-Century British Parliamentary Debates with Improved Speaker Names: Parsed Debates, N-Gram Counts, Special Vocabulary, Collocates, and Topics is an analysis-ready corpus of the 19th-century British Parliamentary Debates (1803-1909). The corpus identifies debates whose records are missing from UK Parliament’s corpus, and it also offers a field for disambiguated speakers. We believe these improvements will enhance scholarship in digital history, enabling researchers to study the Hansard debates, including speaker discourse, in a way that has not been accessible before. We also intend to support analyses of the Hansard debates by providing supplementary metadata such as tokens and their raw counts, bigrams and their raw counts, special vocabulary, speaker metadata, and topics from LDA topic modeling. This data set was produced during a project that received funding from the National Science Foundation, award number 182209.
创建时间:
2023-05-24



