用于非平衡样本分类的近似支持向量机

摘要
图/表
参考文献
相关文章 (15)

全文: PDF (353 KB) HTML (1 KB)
输出: BibTeX | EndNote (RIS)

摘要针对标准的近似支持向量机(PSVM)没有考虑样本分布不平衡的问题，提出一种改进的PSVM算法(MPSVM).根据训练样本数量的不平衡对正负样本集分别分配不同的惩罚因子，并将原始优化问题中的惩罚因子由数值变更为一个对角阵.最后推导出线性和非线性MPSVM的决策函数，并将其与PSVM、非平衡的SVM的运算机理和性能进行比较.实验结果表明，MPSVM的性能优于PSVM，与非平衡SVM方法相比效率更高.

	服务

	把本文推荐给朋友
	加入我的书架
	加入引用管理器
	E-mail Alert
	RSS
	作者相关文章
	陶晓燕
	姬红兵
	董淑福

关键词 ：近似支持向量机(PSVM), 非平衡分布, 改进的近似支持向量机(MPSVM)

Abstract：Aiming at the problem that unbalanced data classification is disregarded in the standard Proximal Support Vector Machines (PSVM), a modified PSVM algorithm is presented, namely MPSVM. The different penalty factors are assigned to the positive and negative training sets according to the unbalanced population. The penalty values are transformed into a diagonal matrix. Then the decision functions for the linear and nonlinear MPSVM are achieved. Finally, the comparisons of algorithmic principle and performance are drawn. The experimental results show that MPSVM has a better generalization performance than PSVM and higher efficiency than the unbalanced SVM.

Key words： Proximal Support Vector Machine (PSVM) Unbalanced Distribution Modified Proximal Support Vector Machine (MPSVM)

收稿日期: 2006-04-17

ZTFLH:

TP391.4

作者简介: 陶晓燕，女，1971年生，博士研究生，主要研究方向为机器学习、模式识别等.Email:taoxiaoyan@lab202.xidian.edu.cn.姬红兵，男，1963年生，教授，博士生导师，主要研究方向为雷达目标识别、智能信息处理等.董淑福，男，1970年生，副教授，主要研究方向为光通信.

引用本文:

陶晓燕，姬红兵，董淑福. 用于非平衡样本分类的近似支持向量机[J]. 模式识别与人工智能, 2007, 20(4): 552-557. TAO XiaoYan , JI HongBing , Dong ShuFu. Proximal Support Vector Machines for Samples with Unbalanced Classification. , 2007, 20(4): 552-557.

链接本文:

http://manu46.magtech.com.cn/Jweb_prai/CN/ 或 http://manu46.magtech.com.cn/Jweb_prai/CN/Y2007/V20/I4/552

[1] Vapnik V N. The Nature of Statistical Learning Theory. New York, USA: SpringerVerlag, 2000
[2] Cortes C, Vapnik V. Support Vector Networks. Machine Learning, 1995, 20(3): 273297
[3] Drucker H, Burges C J C, Kaufman L, et al. Support Vector Regression Machines // Mozer M C, Jordan M I, Petsche T, eds. Advances in Neural Information Processing Systems. Cambridge, UK: MIT Press, 1997: 155161
[4] Platt J C. Fast Training of Support Vector Machines Using Sequential Minimal Optimization // Schlkopf B, Burges C, Smola A, eds. Advances in Kernel MethodsSupport Vector Learning. Cambridge, UK: MIT Press, 1999: 185208
[5] Osuna E, Freund R, Girosi F. An Improved Training Algorithm for Support Vector Machines // Proc of the International Workshop on Neural Networks for Signal Processing. Amelia Island, USA, 1997: 276285
[6] Keerthi S, Shevade S, Bhattcharyya C, et al. Improvements to Platt’s SMO Algorithm for SVM Classifier Design. Neural Computation, 2001, 13(3): 637649
[7] Suykens J A K, Vandewalle J. Least Squares Support Vector Machines. Neural Network Letters, 1999, 9(3): 293300
[8] Mangasarian O L. Generalized Support Vector Machines // Smola A, Bartlett P, Schlkopf B, et al, eds. Advances in Large Margin Classifier. Cambridge, UK: MIT Press, 2000: 135146
[9] Fung G, Mangasarian O L. Proximal Support Vector Machine Classifiers // Proc of the 7th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. San Francisco, USA, 2001: 7786
[10] Agarwal D K, DuMouchel W. Shrinkage Estimator Generalizations of Proximal Support Vector Machines // Proc of the 8th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. Edmonton, Canada, 2002: 173182
[11] Chew H G, Crisp D J, Bogner R E, et al. Target Detection in Radar Imagery Using Support Vector Machines with Training Size Biasing [EB/OL]. [20010101]. http://users.on.net/^~hgchew/SVM/ChewCrispBognerLimICARCV2000.pdf
[12] Chew H G, Bogner R E, Lim C C. Dual vSupport Vector Machines with Error Rate and Training Size Biasing // Proc of the International Conference on Acoustics, Speech and Signal Processing. Salt Lake City, USA, 2001: 12691272
[13] Lin C F, Wang S D. Fuzzy Support Vector Machines. IEEE Trans on Neural Networks, 2002, 13(2): 464471
[14] Tao Qin, Wu Gaowei, Wang Feiyue, et al. Posterior Probability Support Vector Machines for Unbalanced Data. IEEE Trans on Neural Networks, 2005, 16(6): 15611573
[15] Golub G H, van Loan C C. Matrix Computations. Baltimore, USA: The John Hopkins University Press, 1996
[16] Lee Y J, Mangasarian O L. RSVM: Reduced Support Vector Machines. Technical Report, 0007, Madison, USA: University of Wisconsin. Data Mining Institute, 2000
[17] Murphy M. UCIBenchmark Repository of Artificial and Real Data Sets [DB/OL]. [20060401]. http://www.ics.uci.edu/~mlearn
[18] Mitchell T M. Machine Learning. Boston, USA: McGrawHill, 1997