面向增量更新数据库的隐私保护<sup>*</sup>

摘要
图/表
参考文献
相关文章 (15)

全文: PDF (567 KB) HTML (1 KB)
输出: BibTeX | EndNote (RIS)

摘要敏感模式隐藏是隐私保护数据挖掘中的一个重要课题.传统算法大多适用于处理静态数据，因此难以处理增量更新数据中的隐私保护问题.为解决以上问题，设计基于敏感模式图满足最小边际效应的牺牲项选择策略，构建面向增量数据库的隐私保护算法.实例和实验分析验证了算法的正确性、高效率及可扩展性.

	服务

	把本文推荐给朋友
	加入我的书架
	加入引用管理器
	E-mail Alert
	RSS
	作者相关文章
	陈文

关键词 ：数据挖掘, 数据清洗, 增量数据库, 隐私保护, 敏感模式隐藏

Abstract：The research of sensitive pattern hiding is important in privacy preserving data mining. Existing sensitive pattern hiding algorithms are originally designed for static database which cannot handle incremental datasets effectively and efficiently. To hide sensitive patterns in the incremental environment, a selection strategy for optimal victim items with minimal edge effect is designed based on sensitive pattern graph and a privacy preservation algorithm in the incremental updating database is proposed. The instance analysis and experimental results validate the correctness, efficiency and scalability of the proposed method.

Key words： Data Mining Data Cleaning Incremental Database Privacy Preservation Sensitive Pattern Hiding

收稿日期: 2012-09-13

ZTFLH:

TP311

基金资助:安徽省高校省级优秀青年人才基金项目(No.2012SQRL191)、安徽省高等学校省级自然科学研究重点项目(No.KJ2014A256)资助

作者简介: 陈文，男，1979年生，硕士，副教授，主要研究方向为数据挖掘、隐私保护.E-mail:tlxychenwen@163.com.

引用本文:

陈文. 面向增量更新数据库的隐私保护^*[J]. 模式识别与人工智能, 2014, 27(7): 638-645. CHEN Wen. Privacy Preservation for the Incremental Updating Database. , 2014, 27(7): 638-645.

链接本文:

http://manu46.magtech.com.cn/Jweb_prai/CN/ 或 http://manu46.magtech.com.cn/Jweb_prai/CN/Y2014/V27/I7/638

[1] Verykios V S, Bertino E, Fovino I N, et al. State-of-the-Art in Privacy Preserving Data Mining. SIGMOD Record, 2004, 33(1): 50-57
[2] Oliveira S R M, Zaane O R. Privacy Preserving Frequent Itemset Mining // Proc of the IEEE International Conference on Privacy, Security and Data Mining. Honolulu, USA, 2002: 43-54
[3] Lee G, Chen Y C, Peng S L, et al. Solving the Sensitive Itemset Hiding Problem Whilst Minimizing Side Effects on a Sanitized Database // Proc of the 2nd International Conference on Security-Enriched Urban Computing and Smart Grids. Hualien, China, 2011: 104-113
[4] Byun J W, Li T C, Bertino E, et al. Privacy-Preserving Incremental Data Dissemination. Journal of Computer Security, 2009, 17(1): 43-68
[5] Xiao X K, Tao Y F. M-Invariance: Towards Privacy Preserving Re-publication of Dynamic Datasets // Proc of the ACM SIGMOD International Conference on Management of Data. Beijing, China, 2007: 689-700
[6] He Y Y, Barman S, Naughton J F. Preventing Equivalence Attacks in Updated, Anonymized Data // Proc of the 27th IEEE International Conference on Data Engineering. Hannover, Germany, 2011: 529-540
[7] di Vimercati S D C, Foresti S, Livraga G, et al. Protecting Privacy in Data Release // Aldini A, Gorrieri R, eds. Foundations of Security Analysis and Design VI. Berlin, Germany: Springer-Verlag, 2011: 1-34
[8] Wang J L, Xu C F, Pan Y H. An Incremental Algorithm for Mining Privacy-Preserving Frequent Itemsets // Proc of the International Conference on Machine Learning and Cybernetics. Dalian, China, 2006: 1132-1137
[9] Dai B R, Chiang L H. Hiding Frequent Patterns in the Updated Database // Proc of the International Conference on Information Science and Applications. Seoul, Republic of Korea, 2010: 1-8
[10] Mhatre A, Toshniwal D. Hiding Co-occurring Sensitive Patterns in Progressive Databases // Proc of the EDBT/ICDT Workshops. Lausanne, Switzerland, 2010: 35
[11] Zhou B, Han Y, Pei J, et al. Continuous Privacy Preserving Publishing of Data Streams // Proc of the 12th International Conference on Extending Database Technology: Advances in Database Technology. Saint-Petersburg, Russia, 2009: 648-659
[12] Oliveira S R M, Zaane O R, Saygin Y. Secure Association Rule Sharing // Proc of the 8th Pacific-Asia Conference on Advances in Knowledge Discovery and Data Mining. Sydney, Australia, 2004: 74-85
[13] Geurts K, Wets G, Brijs T, et al. Profiling of High Frequency Accident Locations Using Association Rules. Transportation Research Record: Journal of the Transportation Research Board, 2003, 1840(1): 123-130
[14] Kuo Y P, Lin P Y, Dai B R. Hiding Frequent Patterns under Multiple Sensitive Thresholds // Proc of the 19th International Conference on Database and Expert Systems Applications. Turin, Italy, 2008: 5-18