Vocabulary richness in Main and Simple.
收藏NIAID Data Ecosystem2026-03-07 收录
下载链接:
https://figshare.com/articles/dataset/_Vocabulary_richness_in_Main_and_Simple_/217214
下载链接
链接失效反馈官方服务:
资源简介:
For the definition of conditions (character- or word-balanced, with or without puctuation, with or without Porter stemming) see the Methods section. SR is size ratio (number of characters in C conditions, number of words in W conditions) for comparable Main and Simple corpora. CM and CS are Herdan’s C for Main and Simple. As the last column shows, the vocabulary richness of comparable Simle and Main corpora differs at most by 0.72% depending on condition.
创建时间:
2012-11-07



