OSS Linguistic Behavior Data
收藏Figshare2022-09-02 更新2026-04-08 收录
下载链接:
https://figshare.com/articles/dataset/20_projects_raw_data/20788189/3
下载链接
链接失效反馈官方服务:
资源简介:
<strong>1. Corpus_labelled_20_projects.rar contains all data of 20 projects. All analyses were supported with the data in it.</strong> <strong>2. Raw Data.rar contains the 20 projects' raw data, including pull requests, commits, issue comments and issue events.</strong> <strong>3. adj+adv data.xlsx contains the data supporting comparisons with Google 1-gram dataset.</strong> Column 1,6: Adjective/Adverb word Column 2: The number of the word occurred in the 20 projects Column 3,8: The number of words in the 20 projects Column 4,9: The frequency of the word occurred in the 20 projects Column 5,10: The frequency of the word in Google 1-gram dataset
**1. Corpus_labelled_20_projects.rar 包含20个项目的全部数据集,本研究所有分析均依托该压缩包内的数据开展。**
**2. Raw Data.rar 包含该20个项目的原始数据,涵盖拉取请求(pull requests)、提交记录(commits)、议题评论(issue comments)与议题事件(issue events)。**
**3. adj+adv data.xlsx 包含用于与Google 1-gram数据集(Google 1-gram dataset)进行对比的支撑数据。该文件各列含义如下:第1、6列为形容词/副词词目;第2列为该词在20个项目中的出现次数;第3、8列为20个项目的总词数;第4、9列为该词在20个项目中的出现频率;第5、10列为该词在Google 1-gram数据集中的出现频率。**
提供机构:
Han, Yis
创建时间:
2022-09-02



