稠密向量实体检索模型的二值化提速压缩

doi:10.16451/j.cnki.issn1003-6059.202301005

Abstract
Figure/Table
References
Related Citation (15)

Download: PDF (873 KB) HTML (1 KB)
Export: BibTeX | EndNote (RIS)

Abstract In entity retrieval tasks, dense vector entity retrieval models are utilized to efficiently filter candidate entities related to a query from a large-scale entity base.However, the existing dense vector retrieval models engender low real-time computation efficiency and large required storage space due to the high dimension of entity vectors. In this paper,it is found that these entity vectors contain a large amount of redundant information through experiments. Most entity vectors are distributed in non-overlapping quadrants and quadrants containing entities with similar semantics are also closer to each other.Thus, a binary entity retrieval method is proposed to compress entity vectors and accelerate similarity calculations.Specifically, the sign function is employed to binary-compress high-dimensional dense floating-point vectors, and Hamming distance is exploited to speed up the retrieval.The reason that the proposed method can guarantee the retrieval performance is theoretically analyzed.The correctness of the theory is verified through qualitative and quantitative analysis experiments, and a method for improving binary retrieval performance based on random dimension increase and rotation is provided.

Key words： Entity Retrieval Embedding Representation Dense Vector Retrieval Approximate Nearest Neighbor Retrieval Product Quantization

Received: 20 September 2022

ZTFLH:

TP391

Fund:National Natural Science Foundation of China(No.61902381,62006218), Youth Innovation Promotion Association Member Project of Chinese Academy of Sciences(No.20144310,2021100), Young Elite Scientist Sponsorship Program by China Association for Science and Technology(No.YESS20200121), Innovation Project of Institute of Computing Technology of Chinese Academy of Sciences(No.E261090)

Corresponding Authors: FAN Yixing, Ph.D., associate professor. His research interests include data mining and information retrieval.

About author:: WANG Yuanzheng, Ph.D. candidate. His research interests include information retrieval and natural language processing.CHEN Wei, Ph.D., professor. Her research interests include machine learning.ZHANG Ruqing, Ph.D., assistant profe-ssor. Her research interests include natural language processing.GUO Jiafeng, Ph.D., professor. His research interests include data mining and information retrieval.

	Service

	E-mail this article
	Add to my bookshelf
	Add to citation manager
	E-mail Alert
	RSS
	Articles by authors
	WANG Yuanzheng
	FAN Yixing
	CHEN Wei
	ZHANG Ruqing
	GUO Jiafeng

Cite this article:

WANG Yuanzheng,FAN Yixing,CHEN Wei等. Binary Acceleration and Compression for Dense Vector Entity Retrieval Models[J]. Pattern Recognition and Artificial Intelligence, 2023, 36(1): 60-69.

URL:

http://manu46.magtech.com.cn/Jweb_prai/EN/10.16451/j.cnki.issn1003-6059.202301005 OR http://manu46.magtech.com.cn/Jweb_prai/EN/Y2023/V36/I1/60