|
|
Classification Model Based on Variable Multi-granulation Probabilistic Rough Set |
WANG Jiaqi, MIAO Duoqian, ZHANG Hongyun |
Department of Computer Science and Technology, Tongji University, Shanghai 201804 Key Laboratory of Embedded System and Service Computing, Ministry of Education, Tongji University, Shanghai 201804 |
|
|
Abstract Based on the multi-granulation rough set theory, a variable multi-granulation probabilistic rough set (VMGPRS) model combining the ideas of variable multi-granulation and misclassification rate is proposed. A granulation reduction algorithm is put forward grounded on the concept of attribute reduction in rough set theory, and the granulation redundancy caused by parameter setting in the variable multi-granulation rough set model is found and solved. The data before and after the reduction are applied to classical classification algorithms such as support vector machine, k-nearest neighbor, Naive Bayes, and it is verified that the classification ability of data is hardly influenced by the reduction. With the combination of the rule and the proposed algorithm, a rule-based classification algorithm is designed. Furthermore, two adjustment parameters, α and β, in the VMGPRS model are analyzed for classification effect of the classifier.
|
Received: 11 March 2017
|
|
About author:: (WANG Jiaqi, born in 1993, master student. Her research interests include rough set, natural language processing and pattern recognition.) (MIAO Duoqian(Corresponding author), born in 1964, Ph.D., professor. His research interests include rough set, granular computing, data mining and machine learning.) (ZHANG Hongyun, born in 1972, Ph.D., associate professor. Her research interests include principal curves, granular computing, rough set and data mining.) |
|
|
|
[1] ZADEH L A. Toward a Theory of Fuzzy Information Granulation and Its Centrality in Human Reasoning and Fuzzy Logic. Fuzzy Sets and Systems, 1997, 90(2): 111-127. [2] ZADEH L A. Fuzzy Logic=Computing with Words. IEEE Transactions on Fuzzy Systems, 1996, 4(2): 103-111. [3] ZADEH L A. From Computing with Numbers to Computing with Words-From Manipulation of Measurements to Manipulation of Perceptions // CANTONI V, DI GES V, SETTI A, et al., eds. Human and Machine Perception 3. Boston, USA: Springer, 2001: 1-25. [4] PAWLAK Z. Rough Sets. International Journal of Computer and Information Science, 1982, 11(5): 341-356. [5] 张 铃,张 钹.问题求解理论及应用:商空间粒度计算理论及应用.北京:清华大学出版社, 2007. (ZHANG L, ZHANG B. Theory and Application of Problem Solving: Theory and Application of Quotient Space Granularity Computing. Beijing, China: Tsinghua University Press, 2007.) [6] 苗夺谦,王国胤,刘 清.粒计算:过去、现在与展望.北京:科学出版社, 2007. (MIAO D Q, WANG G Y, LIU Q. Granular Computing: Past, Present and Future. Beijing, China: Science Press, 2007.) [7] QIAN Y H, LIANG J Y, YAO Y Y, et al. MGRS: A Multi-granulation Rough Set. Information Sciences, 2010, 180(6): 949-970. [8] QIAN Y H, LIANG J Y, DANG C Y. MGRS in Incomplete Information Systems // Proc of the IEEE International Conference on Granular Computing. Washington, USA: IEEE, 2007. DOI: 10.1109/GrC.2007.10. [9] QIAN Y H, LIANG J Y, DANG C Y. Incomplete Multigranulation Rough Set. IEEE Transactions on Systems, Man, and Cybernetics(Systems & Humans), 2010, 40(2): 420-431. [10] QIAN Y H, LIANG J Y, WEI W. Pessimistic Rough Decision. Journal of Zhejiang Ocean University(Natural Science), 2010, 29(5): 440-449. [11] ZIARKO W. Variable Precision Rough Set Model. Journal of Computer and System Sciences, 1993, 46(1): 39-59. [12] ZHANG M, TANG Z M, XU W Y, et al. A Variable Multi-granulation Rough Sets Approach // Proc of the 7th International Confe-rence on Intelligent Computing: Bio-inspired Computing and Application. Berlin, Germany: Springer, 2011: 315-322. [13] 苗夺谦,李道国.粗糙集理论、算法与应用.北京:清华大学出版社, 2008. (MIAO D G, LI D G. Rough Sets Theory, Algorithms and Applications. Beijing, China: Tsinghua University Press, 2008. ) [14] PAWLAK Z. Rough Sets: Theoretical Aspects of Reasoning about Data. London, UK: Kluwer Academic Publishers, 1991. [15] 张文修,吴伟志,梁吉业,等.粗糙集理论与方法.北京:科学出版社, 2001. (ZHANG W X, WU W Z, LIANG J Y, et al. Rough Set Theory and Method. Beijing, China: Science Press, 2001.) [16] 苗夺谦,胡桂荣.知识约简的一种启发式算法.计算机研究与发展, 1999, 36(6): 681-684. (MIAO D Q, HU G R. A Heuristic Algorithm for Reduction of Knowledge. Journal of Computer Research and Development, 1999, 36(6): 681-684.) [17] 任志刚,冯祖仁,柯良军.蚁群优化属性约简算法.西安交通大学学报, 2008, 42(4): 440-444. (REN Z G, FENG Z R, KE L J. Ant Colony Optimization Approach to Attribute Reduction Problem. Journal of Xi′an Jiaotong University, 2008, 42(4): 440-444.) [18] 郭春根.基于遗传算法的粗糙集属性约简研究.硕士学位论文.合肥:合肥工业大学, 2007. (GUO C G. Rough Set Attribute Reduction Algorithm Based on Genetic Algorithm. Master Dissertation. Hefei, China: Hefei University of Technology, 2007.) [19] LI M Z, WANG G Y. Approximate Concept Construction with Three-Way Decisions and Attribute Reduction in Incomplete Contexts. Knowledge-Based Systems, 2016, 91: 165-178. [20] QIAN J, MIAO D Q, ZHANG Z H, et al. Parallel Attribute Reduction Algorithms Using MapReduce. Information Sciences, 2014, 279: 671-690. [21] ZHANG J B, LI T R, PAN Y. Parallel Large-Scale Attribute Reduction on Cloud Systems[J/OL]. [2017-01-25]. https://www.arxiv.org/pdf/1610.01807v1.pdf. [22] 钱宇华.复杂数据的粒化机理与数据建模.博士学位论文.太原:山西大学, 2011. (QIAN Y H. Granulation Mechanism and Data Modeling for Complex Data. Ph.D Dissertation. Taiyuan, China: Shanxi University, 2011.) [23] LIN G P, QIAN Y H, LI J J. NMGRS: Neighborhood-Based Multigranulation Rough Sets. International Journal of Approximate Reasoning, 2012, 53(7): 1080-1093. [24] SHE Y H, HE X L, SHI H X, et al. A Multiple-valued Logic Approach for Multigranulation Rough Set Model. International Journal of Approximate Reasoning, 2017, 82: 270-284. [25] YAO Y Y, SHE Y H. Rough Set Models in Multigranulation Spaces. Information Sciences, 2016, 327: 40-56. [26] LIU C H, PEDRYCZ W. Covering-Based Multi-granulation Fuzzy Rough Sets. Journal of Intelligent and Fuzzy Systems, 2015, 30(1): 303-318.
[27] ZHU P F, HU Q H, ZUO W M, et al. Multi-granularity Distance Metric Learning via Neighborhood Granule Margin Maximization. Information Sciences, 2014, 282: 321-331. [28] ZHU P F, HU Q H, HAN Y H, et al. Combining Neighborhood Separable Subspaces for Classification via Sparsity Regularized Optimization. Information Sciences, 2016, 370/371: 270-287. |
|
|
|