Hidden Room game in University of Cadiz data clustering by DeutschUCA
收藏Figshare2018-04-27 更新2026-04-29 收录
下载链接:
https://figshare.com/articles/dataset/Hidden_Room_game_in_University_of_Cadiz_data_clustering_by_DeutschUCA/6194597
下载链接
链接失效反馈官方服务:
资源简介:
Histograms and results of k-means and Ward's clustering for Hidden Room game (Open Simulator) in University of Cadiz (Spain) by DeutschUCAThe fileset contains information from three sources:1. Histograms files:* Lexical_histogram.png (histogram of lexical error ratios)* Grammatical_histogram.png (histogram of grammatical error ratios)2. K-means clustering files:* elbow-lex kmeans.png (clustering by lexical aspects: error curves obtained for applying elbow method to determinate the optimal number of clusters)* cube-lex kmeans.png (clustering by lexical aspects: a three-dimensional representation of clusters obtained after applying k-means method)* Lexical_clusters (table) kmeans.xls (clustering by lexical aspects: centroids, standard deviations and number of instances assigned to each cluster)* elbow-gram kmeans.png (clustering by grammatical aspects: error curves obtained for applying elbow method to determinate the optimal number of clusters)* cube-gramm kmeans.png (clustering by grammatical aspects: a three-dimensional representation of clusters obtained after applying k-means method)* Grammatical_clusters (table) kmeans.xls (clustering by grammatical aspects: centroids, standard deviations and number of instances assigned to each cluster)* elbow-lexgram kmeans.png (clustering by lexical and grammatical aspects: error curves obtained for applying elbow method to determinate the optimal number of clusters)* Lexical_Grammatical_clusters (table) kmeans.xls (clustering by lexical and grammatical aspects: centroids, standard deviations and number of instances assigned to each cluster)* Grammatical_clusters_number_of_words (table) kmeans.xls : number of words (from column 2 to 4) and sizes (last column) obtained per each cluster by applying k-means clustering to grammatical error ratios.* Lexical_clusters_number_of_words (table) kmeans.xls : number of words (from column 2 to 4) and sizes (last column) obtained per each cluster by applying k-means clustering to lexical error ratios.* Lexical_Grammatical_clusters_number_of_words (table) kmeans.xls : number of words (from column 2 to 4) and sizes (last column) obtained per each cluster by applying k-means clustering to lexical and grammatical error ratios.3. Ward’s Agglomerative Hierarchical Clustering files:* Lexical_Cluster_Dendrogram_ward.png (clustering by lexical aspects: dendrogram obtained after applying Ward's clustering method).* Grammatical_Cluster_Dendrogram_ward.png (clustering by grammatical aspects: dendrogram obtained after applying Ward's clustering method)* Lexical_Grammatical_Cluster_Dendrogram_ward.png (clustering by lexical and grammatical aspects: dendrogram obtained after applying Ward's clustering method)* Lexical_Grammatical_clusters (table) ward.xls: Centroids (from column 2 to 7) and cluster sizes (last column) obtained by applying Ward's agglomerative hierarchical clustering to lexical and grammatical error ratios.* Grammatical_clusters (table) ward.xls: Centroids (from column 2 to 4) and cluster sizes (last column) obtained by applying Ward's agglomerative hierarchical clustering to grammatical error ratios.* Lexical_clusters (table) ward.xls: Centroids (from column 2 to 4) and cluster sizes (last column) obtained by applying Ward's agglomerative hierarchical clustering to lexical error ratios.* Lexical_clusters_number_of_words (table) ward.xls: number of words (from column 2 to 4) and sizes (last column) obtained per each cluster by applying Ward's agglomerative hierarchical clustering to lexical error ratios.* Grammatical_clusters_number_of_words (table) ward.xls: number of words (from column 2 to 4) and sizes (last column) obtained per each cluster by applying Ward's agglomerative hierarchical clustering to grammatical error ratios.* Lexical_Grammatical_clusters_number_of_words (table) ward.xls: number of words (from column 2 to 4) and sizes (last column) obtained per each cluster by applying Ward's agglomerative hierarchical clustering to lexical and grammatical error ratios.
本数据集为西班牙加的斯大学DeutschUCA团队针对《Hidden Room》(隐藏房间)开放模拟器(Open Simulator)版本游戏所产出的k均值聚类(k-means)与沃德聚类法(Ward's clustering)的直方图及实验结果。本文件集包含三类来源信息:
1. 直方图文件
* Lexical_histogram.png:词法错误率直方图
* Grammatical_histogram.png:语法错误率直方图
2. k均值聚类(k-means)文件
* elbow-lex kmeans.png:基于词法维度的聚类——应用肘部法则(elbow method)确定最优聚类数时得到的错误曲线
* cube-lex kmeans.png:基于词法维度的聚类——经k均值聚类后得到的三维聚类可视化结果
* Lexical_clusters (table) kmeans.xls:基于词法维度的聚类表——各聚类的质心、标准差及分配至每个聚类的样本数
* elbow-gram kmeans.png:基于语法维度的聚类——应用肘部法则确定最优聚类数时得到的错误曲线
* cube-gramm kmeans.png:基于语法维度的聚类——经k均值聚类后得到的三维聚类可视化结果
* Grammatical_clusters (table) kmeans.xls:基于语法维度的聚类表——各聚类的质心、标准差及分配至每个聚类的样本数
* elbow-lexgram kmeans.png:基于词法与语法双维度的聚类——应用肘部法则确定最优聚类数时得到的错误曲线
* Lexical_Grammatical_clusters (table) kmeans.xls:基于词法与语法双维度的聚类表——各聚类的质心、标准差及分配至每个聚类的样本数
* Grammatical_clusters_number_of_words (table) kmeans.xls:针对语法错误率执行k均值聚类后得到的各聚类结果表——含各聚类的单词数(第2至4列)与聚类规模(最后一列)
* Lexical_clusters_number_of_words (table) kmeans.xls:针对词法错误率执行k均值聚类后得到的各聚类结果表——含各聚类的单词数(第2至4列)与聚类规模(最后一列)
* Lexical_Grammatical_clusters_number_of_words (table) kmeans.xls:针对词法与语法双维度错误率执行k均值聚类后得到的各聚类结果表——含各聚类的单词数(第2至4列)与聚类规模(最后一列)
3. 沃德聚合层次聚类(Ward’s Agglomerative Hierarchical Clustering)文件
* Lexical_Cluster_Dendrogram_ward.png:基于词法维度的聚类——经沃德聚类法得到的聚类树状图(dendrogram)
* Grammatical_Cluster_Dendrogram_ward.png:基于语法维度的聚类——经沃德聚类法得到的聚类树状图
* Lexical_Grammatical_Cluster_Dendrogram_ward.png:基于词法与语法双维度的聚类——经沃德聚类法得到的聚类树状图
* Lexical_Grammatical_clusters (table) ward.xls:针对词法与语法双维度错误率执行沃德聚合层次聚类后得到的结果表——含各聚类的质心(第2至7列)与聚类规模(最后一列)
* Grammatical_clusters (table) ward.xls:针对语法错误率执行沃德聚合层次聚类后得到的结果表——含各聚类的质心(第2至4列)与聚类规模(最后一列)
* Lexical_clusters (table) ward.xls:针对词法错误率执行沃德聚合层次聚类后得到的结果表——含各聚类的质心(第2至4列)与聚类规模(最后一列)
* Lexical_clusters_number_of_words (table) ward.xls:针对词法错误率执行沃德聚合层次聚类后得到的各聚类结果表——含各聚类的单词数(第2至4列)与聚类规模(最后一列)
* Grammatical_clusters_number_of_words (table) ward.xls:针对语法错误率执行沃德聚合层次聚类后得到的各聚类结果表——含各聚类的单词数(第2至4列)与聚类规模(最后一列)
* Lexical_Grammatical_clusters_number_of_words (table) ward.xls:针对词法与语法双维度错误率执行沃德聚合层次聚类后得到的各聚类结果表——含各聚类的单词数(第2至4列)与聚类规模(最后一列)
创建时间:
2018-04-27



