five

Generalized possibilistic fuzzy c-means with novel cluster validity indices for clustering noisy data

收藏
NIAID Data Ecosystem2026-03-10 收录
下载链接:
https://data.mendeley.com/datasets/dgxfv4s5vt
下载链接
链接失效反馈
官方服务:
资源简介:
Dear Researcher, Thank you for using this code and datasets. I explain how GPFCM code related to my paper "Generalized possibilistic fuzzy c-means with novel cluster validity indices for clustering noisy data" published in Applied Soft Computing, works. The main datasets mentioned in the paper together with GPFCM code are included. If there is any question, feel free to contact me at: bas_salaraskari@yahoo.com s_askari@aut.ac.ir Regards, S. Askari Guidelines for GPFCM algorithm: 1. Open the file "GPFCM-Code" using MATLAB. 2. DATA1 to DATA6 are the data sets we used in the paper. Each data set contains the data "yd", optimal value of ρ "ruopt" and number of clusters "C". 3. In line 13 of the code, change the number in "DATA1" to the number of the desired data set. For example, to load DATA3, change "load DATA1" to "load DATA3". 4. Click somewhere on the file "GPFCM-Code" and then Press "Ctrl+Enter" to run the code. 5. VFCM, VPFCM, and VGPFCM which appear on the command window are cluster centers computed by each of the algorithms FCM, PFCM, and GPFCM, respectively. You can find all of them in the "Workspace" of MATLAB as well. 6. Sometimes, PFCM may yield two or more coincident clusters for DATA4 or any other data. Then GPFCM will also give two or more coincident clusters because it starts with PFCM. You may run the algorithm again to get probably all cluster centers accurately. Generally, if you use GFCM rather than GPFCM, you'll get better results with no coincident clusters. Settings of the code for GFCM are mentioned in item 14. 7. Since the algorithm starts randomly, order of the cluster centers may be different in various runs but numerical values of the cluster centers will not change. For example, if is obtained as the third cluster center in one run which is the third column of the matrix VGPFCM, it may move to the fifth row of the matrix in another run (if ). But its value would not considerably change and is very close to . This is just because of random initializations of the algorithm. Since FCM (by which GPFCM is initialized) is randomly initialized, sometimes it is sensitive to initialization (depending on the data) and there may be negligible differences between cluster centers obtained in different runs. For example, consider DATA3 with 6 clusters. In one run we get: VGPFCM1 = -4.9960 -1.0169 -4.9708 1.9575 1.0521 -2.0271 -1.9853 -5.0464 5.9470 0.0031 6.0183 1.9896 And in another run we have: VGPFCM2 = -4.9960 -1.0169 1.9575 1.0521 -2.0271 -4.9708 -1.9853 -5.0464 0.0031 6.0183 1.9896 5.9470 It is observed that cluster centers are the same as those of the first run but their positions in the matrix VGPFCM is changed. 8. Line 46 computes Covariance norm matrix. If you "uncomment" line 47, then the program uses Identity norm matrix (Euclidean distance).
创建时间:
2017-01-17
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作