five

Words in UHCL Theses and Dissertations

收藏
DataCite Commons2020-09-02 更新2024-07-25 收录
下载链接:
https://figshare.com/articles/dataset/Words_in_UHCL_Theses_and_Dissertations/4956782
下载链接
链接失效反馈
官方服务:
资源简介:
This dataset includes only words from thesis and dissertation titles, and is not the full thesis and dissertation dataset with advisors and colleges. To view that, go to https://figshare.com/articles/UHCL_Theses_and_Dissertations/4959161.<br>A dataset including the titles of all graduate projects, theses, and dissertations ever submitted to UHCL was pulled from the library catalog system and read into R. Titles were collapsed into a single vector, and the vector was loaded as a corpus in the tm package. all punctuation was converted to a space, converted to lowercase, and spaces, stopwords, white space, and single characters were removed.<br>Words from titles in all colleges are in the titlewords.csv file. The other four csvs are titles in specific colleges. <br>The Word Cloud was created in Tableau, and is hosted at http://libguides.uhcl.edu/thesesdissertations/titlewordcloud.<br><br>thesisWordCloud.R is the code, and titleWordCloud.pdf is a Markdown document describing the process in a bit more depth.
提供机构:
figshare
创建时间:
2017-05-01
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作