Abstract:With the inspiration from selfprotection mechanism of biological immune system, an individual spam filtering algorithm based on immune principles is proposed. Firstly, the spam communities are defined according to the users’ interests and the email features. Then all spams are classified into different spam communities. Secondly, the community features are extracted and represented by a set of feature detectors. Finally, the identification of a spam depends on whether the email can be classified into any spam community. The proposed algorithm is an incremental learning algorithm and it can continuously filter spam without retraining. The immune learning and immune memory mechanisms adopted in this algorithm improve not only the detectable rate and the accuracy rate but also the filter speed. Experimental results show that the algorithm is better than the AISEC algorithm and the Nave Bayesian algorithm.
[1] IResearch Inc. China AntiSpam Market Research Report [EB/OL]. [20050701]. http://www.iresearch.com.cn/ html/free/Default.html [2] Pelletier L, Almhana J, Choulakian V. Adaptive Filtering of SPAM // Proc of the 2nd Annual Conference on Communication Networks and Services Research. Fredericton, Canada, 2004: 218224 [3] Androutsopoulos I, Koutsias J, Chandrinos K V, et al. An Experimental Comparison of Nave Bayesian and KeywordBased AntiSpam Filtering with Personal Email Message // Proc of the 23rd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval. Athens, Greece, 2000: 160167 [4] Sahami M, Dumais S, Heckerman D, et al. A Bayesian Approach to Filtering Junk Email // Proc of AAAI Workshop on Learning for Text Categorization. Madison, USA, 1998: 5562 [5] Moon J, Shon T, Seo J, et al. An Approach for Spam Email Detection with Support Vector Machine and nGram Indexing. Lecture Notes in Computer Science, 2004, 3280: 351362 [6] Drucker H, Wu D, Vapnik V. Support Vector Machines for Spam Categorization. IEEE Trans on Neural Networks, 1999, 10(5): 10481054 [7] Secker A, Freitas A, Timmis J. AISEC: an Artificial Immune System for Email Classification // Proc of the Congress on Evolutionary Computation. Canberra, Australia, 2003, Ⅰ: 131138 [8] Oda T, White T. Increasing the Accuracy of a SpamDetecting Artificial Immune System // Proc of the Congress on Evolutionary Computation. Canberra, Australia, 2003, Ⅰ: 390396 [9] Oda T, White T. Developing an Immunity to Spam // Proc of the Genetic and Evolutionary Computation Conference. Chicago, USA, 2003: 231242 [10] Cao Yukun, Liao Xiaofeng, Li Yunfeng. An Email Filtering Approach Using Neural Network // Proc of the International Symposium on Neural Networks. Dalian, China, 2004: 688694 [11] Zhan Chuan, Lu Xianliang, Xing Qian. A Novel AntiSpam Email Approach Based on LVQ // Proc of the 5th International Conference on Parallel and Distributed Computing: Applications and Technologies. Singapore, Singapore, 2004: 180183 [12] Kim J, Bentley P J. Negative Selection and Niching by an Artificial Immune System for Network Intrusion Detection // Proc of the Conference on Genetic and Evolutionary Computation. Orlando, USA, 1999: 149158 [13] Mo Hongwei. The Principle and Application of Artificial Immune System. Harbin, China: Harbin Institute of Technology Press, 2003 (in Chinese) (莫宏伟. 人工免疫系统原理与应用. 哈尔滨: 哈尔滨工业大学出版社, 2003) [14] Luo Wenjian. Research on Artificial Immune Models and Algorithms for Intrusion Detection. Ph.D Dissertation. Hefei, China: University of Science and Technology of China. Department of Computer Science and Technology, 2003 (in Chinese) (罗文坚. 面向入侵检测的人工免疫模型和算法研究. 博士学位论文.合肥:中国科学技术大学.计算机科学技术系,2003) [15] de Castro L N, Timmis J. Artificial Immune Systems: A New Computational Intelligence Approach. London, UK: SpringerVerlag, 2002 [16] Deepak P, John J, Parameswaran S. A Community Based Approach for Spam Filtering // Proc of the International Conference on Information and Communication Technologies: From Theory to Applications. Damascus, Syria, 2004: 611612 [17] Deepak P, Parameswaran S. Spam Filtering Using Spam Mail Communities // Proc of the Symposium on Applications and the Internet. Trento, Italy, 2005: 377383 [18] Zhang Zeming, Luo Wenjian, Wang Xufa. A Multilevel Spam Filtering Algorithm Based on Artificial Immunity. Acta Electronica Sinica, 2006, 34(9): 16161620 (in Chinese) (张泽明,罗文坚,王煦法.一种基于人工免疫的多层垃圾邮件过滤算法.电子学报, 2006, 34(9): 16161620)