five

Biber et al.'s (2016) set of 150 BNC items for the analysis of dispersion measures: Dataset for "Evaluation of text-level measures of lexical dispersion"

收藏
DataverseNO2025-01-01 更新2026-04-13 收录
下载链接:
https://dataverse.no/citation?persistentId=doi:10.18710/ATCQZW
下载链接
链接失效反馈
官方服务:
资源简介:
This dataset contains frequencies for a set of 150 word forms in the BNC. The set of items was compiled by Biber et al. (2016) for the purpose of analyzing the behavior of dispersion measures in different distributional settings. It was therefore assembled to cover a broad range of frequency and dispersion levels. For each form, the dataset lists (i) the number occurrences in each of the 4049 text files in the BNC, including zero counts; and (ii) the length of each text file, i.e. the number of word tokens it contains.
提供机构:
University of Bamberg
创建时间:
2025-01-01
二维码
社区交流群
二维码
科研交流群
商业服务