Biber et al.'s (2016) set of 150 BNC items for the analysis of dispersion measures: Dataset for "Evaluation of text-level measures of lexical dispersion"
收藏DataverseNO2025-01-01 更新2026-04-13 收录
下载链接:
https://dataverse.no/citation?persistentId=doi:10.18710/ATCQZW
下载链接
链接失效反馈官方服务:
资源简介:
This dataset contains frequencies for a set of 150 word forms in the BNC. The set of items was compiled by Biber et al. (2016) for the purpose of analyzing the behavior of dispersion measures in different distributional settings. It was therefore assembled to cover a broad range of frequency and dispersion levels. For each form, the dataset lists (i) the number occurrences in each of the 4049 text files in the BNC, including zero counts; and (ii) the length of each text file, i.e. the number of word tokens it contains.
提供机构:
University of Bamberg
创建时间:
2025-01-01



