模式识别与人工智能
Friday, Apr. 11, 2025 Home      About Journal      Editorial Board      Instructions      Ethics Statement      Contact Us                   中文
Pattern Recognition and Artificial Intelligence  2023, Vol. 36 Issue (1): 60-69    DOI: 10.16451/j.cnki.issn1003-6059.202301005
Current Issue| Next Issue| Archive| Adv Search |
Binary Acceleration and Compression for Dense Vector Entity Retrieval Models
WANG Yuanzheng1,2, FAN Yixing1,2, CHEN Wei1,2, ZHANG Ruqing1,2, GUO Jiafeng1,2
1. Key Laboratory of Network Data Science and Technology, Institute of Computing Technology, Chinese Academy of Sciences, Beijing 100190;
2. School of Computer Science and Technology, University of Chinese Academy of Sciences, Beijing 101408

Download: PDF (873 KB)   HTML (1 KB) 
Export: BibTeX | EndNote (RIS)      
Abstract  In entity retrieval tasks, dense vector entity retrieval models are utilized to efficiently filter candidate entities related to a query from a large-scale entity base.However, the existing dense vector retrieval models engender low real-time computation efficiency and large required storage space due to the high dimension of entity vectors. In this paper,it is found that these entity vectors contain a large amount of redundant information through experiments. Most entity vectors are distributed in non-overlapping quadrants and quadrants containing entities with similar semantics are also closer to each other.Thus, a binary entity retrieval method is proposed to compress entity vectors and accelerate similarity calculations.Specifically, the sign function is employed to binary-compress high-dimensional dense floating-point vectors, and Hamming distance is exploited to speed up the retrieval.The reason that the proposed method can guarantee the retrieval performance is theoretically analyzed.The correctness of the theory is verified through qualitative and quantitative analysis experiments, and a method for improving binary retrieval performance based on random dimension increase and rotation is provided.
Key wordsEntity Retrieval      Embedding Representation      Dense Vector Retrieval      Approximate Nearest Neighbor Retrieval      Product Quantization     
Received: 20 September 2022     
ZTFLH: TP391  
Fund:National Natural Science Foundation of China(No.61902381,62006218), Youth Innovation Promotion Association Member Project of Chinese Academy of Sciences(No.20144310,2021100), Young Elite Scientist Sponsorship Program by China Association for Science and Technology(No.YESS20200121), Innovation Project of Institute of Computing Technology of Chinese Academy of Sciences(No.E261090)
Corresponding Authors: FAN Yixing, Ph.D., associate professor. His research interests include data mining and information retrieval.   
About author:: WANG Yuanzheng, Ph.D. candidate. His research interests include information retrieval and natural language processing.CHEN Wei, Ph.D., professor. Her research interests include machine learning.ZHANG Ruqing, Ph.D., assistant profe-ssor. Her research interests include natural language processing.GUO Jiafeng, Ph.D., professor. His research interests include data mining and information retrieval.
Service
E-mail this article
Add to my bookshelf
Add to citation manager
E-mail Alert
RSS
Articles by authors
WANG Yuanzheng
FAN Yixing
CHEN Wei
ZHANG Ruqing
GUO Jiafeng
Cite this article:   
WANG Yuanzheng,FAN Yixing,CHEN Wei等. Binary Acceleration and Compression for Dense Vector Entity Retrieval Models[J]. Pattern Recognition and Artificial Intelligence, 2023, 36(1): 60-69.
URL:  
http://manu46.magtech.com.cn/Jweb_prai/EN/10.16451/j.cnki.issn1003-6059.202301005      OR     http://manu46.magtech.com.cn/Jweb_prai/EN/Y2023/V36/I1/60
Copyright © 2010 Editorial Office of Pattern Recognition and Artificial Intelligence
Address: No.350 Shushanhu Road, Hefei, Anhui Province, P.R. China Tel: 0551-65591176 Fax:0551-65591176 Email: bjb@iim.ac.cn
Supported by Beijing Magtech  Email:support@magtech.com.cn