five

Distributed Algorithm for Tolerance Relation Rough Sets Based on Graph Optimization

收藏
中国科学数据2026-03-16 更新2026-04-25 收录
下载链接:
https://www.sciengine.com/AA/doi/10.19678/j.issn.1000-3428.0070081
下载链接
链接失效反馈
官方服务:
资源简介:
Researchers have introduced the theory of tolerance relation rough sets to address the data filtering problem in distributed Incomplete Information Systems (IIS). With the continuous growth in data volume, it is necessary to achieve scalable parallel computing through distributed computing. Consequently, distributed tolerance relation rough sets have emerged, among which the Block Set is the core method for computing approximate sets. However, the Block Set only uses set operations during computation, with no structure between the data, and the process involves a large number of repetitive calculations, resulting in low computational efficiency. This study proposes a Tolerance Relation rough sets Distributed (TRDG) algorithm, which is based on graph optimization, to address this issue. Using the existing concepts of reliable and disputed elements, a hierarchical directed acyclic graph is constructed with data in the IIS as nodes and asymmetric tolerance relationships as edges. The graph structure is used to organize the data. To improve the computational efficiency of the Block Set in distributed environments, the study proposes a strategy that uses the nearest tolerance relationship instead of the general asymmetric tolerance relationship to remove redundant edges, simplify the graph structure, and obtain a Block Set based on the path from reliable elements to zero-degree disputed elements. Distributed graph optimization and path search algorithms are then implemented on the Spark platform, which ultimately completes the design of the TRDG algorithm. Experimental results show that the TRDG algorithm exhibits good parallel acceleration performance. Compared to traditional general tolerance rough approximation set solving algorithms, TRDG can save computing resources, increase the average computing speed by approximately 40 times, and increase the amount of data that can be processed by more than 50 times.
创建时间:
2026-03-16
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作