|
|
Cross-Domain Person Re-identification Method Based on Point-by-Point Feature Matching |
YANG Ping1, WU Xiaohong1, HE Xiaohai1, CHEN Honggang1, LIU Qiang1, LI Bo1 |
1. College of Electronics and Information Engineering, Sichuan University, Chengdu 610065 |
|
|
Abstract To improve the poor generalization and cross-domain capability of the existing direct cross-dataset person re-identification methods, a cross-domain person re-identification method based on point-by-point feature matching is proposed. By the proposed method, the model only needs to be trained on the source domain and tested on the target domain to achieve better results. Firstly, to improve the poor robustness of the network for style and color of cross-domain pedestrian images, instance normalization layer(IN) is introduced into the ResNet50 basic network to extract image features. Secondly, the multi-head self-attention module of Transformer is combined with convolution to enhance the representation ability of features. Finally, by establishing a point-by-point feature mapping relationship in the deep features, image matching is regarded as a point-by-point process of finding the local optimum to improve the ability of the proposed model to resist perspective changes in unknown scenes and enhance its generalization. The experimental results show that the advantages of the proposed method in improving the generalization ability.
|
Received: 25 February 2022
|
|
Fund:National Natural Science Foundation of China(No.61871278), Natural Science Foundation of Sichuan Province(No.2022NSFSC0922), Sichuan Science and Technology Plan Project(No.2021YFS0239) |
Corresponding Authors:
HE Xiaohai, Ph.D., professor. His research interests include image processing, pattern recognition, computer vision and image compression.
|
About author:: YANG Ping, master student. Her research interests include deep learning and person re-identification. WU Xiaohong, Ph.D., associate profe-ssor. Her research interests include image processing, pattern recognition and computer vision. CHEN Honggang, Ph.D., associate professor. His research interests include image/video processing, computer vision and artificial intelligence. LIU Qiang, Ph.D. candidate. His research interests include image processing, person re-identification and computer vision. LI Bo, master student. His research inte-rests include computer vision and pattern re-cognition. |
|
|
|
[1] GHEISSARI N, SEBASTIAN T B, HARTLEY R. Person Reidentification Using Spatiotemporal Appearance // Proc of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition. Washington, USA: IEEE, 2006: 1528-1535. [2] YU H X, WU A C, ZHENG W S. Cross-View Asymmetric Metric Learning for Unsupervised Person Re-identification // Proc of the IEEE International Conference on Computer Vision. Washington, USA: IEEE, 2017: 994-1002. [3] GE Y X, CHEN D P, LI H S.Mutual Mean-Teaching: Pseudo Label Refinery for Unsupervised Domain Adaptation on Person Re-identification[C/OL]. [2022-01-25].https://arxiv.org/pdf/2001.01526.pdf. [4] ZHAI Y P, LU S J, YE Q X, et al. AD-Cluster: Augmented Discriminative Clustering for Domain Adaptive Person Re-identification // Proc of the IEEE/CVF Conference on Computer Vision and Pa-ttern Recognition. Washington, USA: IEEE, 2020: 9018-9027. [5] DENG W J, ZHENG L, YE Q X, et al. Image-Image Domain Ada-ptation with Preserved Self-Similarity and Domain-Dissimilarity for Person Re-identification // Proc of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. Washington, USA: IEEE, 2018: 994-1003. [6] ZHONG Z, ZHENG L, LI S Z, et al. Generalizing a Person Retrieval Model Hetero-and Homogeneously // Proc of the European Conference on Computer Vision. Berlin, Germany: Springer, 2018: 176-192. [7] CHOI Y, CHOI M, KIM M, et al. StarGAN: Unified Generative Adversarial Networks for Multi-domain Image-to-Image Translation // Proc of the IEEE/CVF Conference on Computer Vision and Pa-ttern Recognition. Washington, USA: IEEE, 2018: 8789-8797. [8] LIU Q, HE X H, ZHANG M Z, et al. Feature Separation and Double Causal Comparison Loss for Visible and Infrared Person Re-identification. Knowledge-Based Systems, 2022, 239. DOI: 10.1016/j.knosys.2021.108042. [9] MUANDET K, BALDUZZI D, SCHÖLKOPF B. Domain Generalization via Invariant Feature Representation // Proc of the 30th International Conference on Machine Learning. San Diego, USA: JMLR, 2013: 10-18. [10] SHANKAR S, PIRATLA V, CHAKRABARTI S, et al. Generalizing Across Domains via Cross-Gradient Training[C/OL].[2022-01-25]. https://arxiv.org/pdf/1804.10745.pdf. [11] JIN X, LAN C L, ZENG W J, et al. Style Normalization and Restitution for Generalizable Person Re-identification // Proc of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. Washington, USA: IEEE, 2020: 3140-3149. [12] SARFRAZ M S, SCHUMANN A, EBERLE A, et al. A Pose-Sensitive Embedding for Person Re-identification with Expanded Cross Neighborhood Re-ranking // Proc of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. Washington, USA: IEEE, 2018: 420-429. [13] ZHENG Z D, YANG X D, YU Z D, et al. Joint Discriminative and Generative Learning for Person Re-identification // Proc of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. Washington, USA: IEEE, 2019: 2133-2142. [14] XU J, ZHAO R, ZHU F, et al. Attention-Aware Compositional Network for Person Re-identification // Proc of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. Washington, USA: IEEE, 2018: 2119-2128. [15] LI W, ZHU X T, GONG S G. Harmonious Attention Network for Person Re-identification // Proc of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. Washington, USA: IEEE, 2018: 2285-2294. [16] CHEN G Y, LIN C Z, REN L L, et al. Self-Critical Attention Learning for Person Re-identification // Proc of the IEEE/CVF International Conference on Computer Vision. Washington, USA: IEEE, 2019: 9636-9645. [17] ZHENG M, KARANAM S, WU Z Y, et al. Re-identification with Consistent Attentive Siamese Networks // Proc of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. Washington, USA: IEEE, 2019: 5728-5737. [18] DOSOVITSKIY A, BEYER L, KOLESNIKOV A, et al. An Image Is Worth 16×16 Words: Transformers for Image Recognition at Scale[C/OL].[2022-01-25]. https://arxiv.org/pdf/2010.11929.pdf. [19] CARION N, MASSA F, SYNNAEVE G, et al. End-to-End Object Detection with Transformers // Proc of the European Conference on Computer Vision. Berlin, Germany: Springer, 2020: 213-229. [20] HE S T, LUO H, WANG P C, et al. TransReID: Transformer-Based Object Re-identification // Proc of the IEEE/CVF International Conference on Computer Vision. Washington, USA: IEEE, 2021: 14993-15002. [21] HUANG X, BELONGIE S. Arbitrary Style Transfer in Real-Time with Adaptive Instance Normalization // Proc of the IEEE International Conference on Computer Vision. Washington, USA: IEEE, 2017: 1510-1519. [22] PAN X G, LUO P, SHI J P, et al. Two at Once: Enhancing Learning and Generalization Capacities via IBN-Net // Proc of the European Conference on Computer Vision. Berlin, Germany: Springer, 2018: 484-500. [23] WANG H, WANG Y T, ZHOU Z, et al. CosFace: Large Margin Cosine Loss for Deep Face Recognition // Proc of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. Washington, USA: IEEE, 2018: 5265-5274. [24] LIN T Y, GOYAL P, GIRSHICK R, et al. Focal Loss for Dense Object Detection. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2020, 42(2): 318-327. [25] ZHONG Z, ZHENG L, CAO D L, et al. Re-ranking Person Re-identification with k-Reciprocal Encoding // Proc of the IEEE Conference on Computer Vision and Pattern Recognition. Washington, USA: IEEE, 2017: 3652-3661. [26] LIAO S C, SHAO L. Interpretable and Generalizable Person Re-identification with Query-Adaptive Convolution and Temporal Lifting // Proc of the European Conference on Computer Vision. Berlin, Germany: Springer, 2020: 456-474. [27] WANG J Y, ZHU X T, GONG S G, et al. Transferable Joint Attri-bute-Identity Deep Learning for Unsupervised Person Re-identification // Proc of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. Washington, USA: IEEE, 2018: 2275-2284. [28] YANG Q Z, YU H X, WU A C, et al. Patch-Based Discriminative Feature Learning for Unsupervised Person Re-identification // Proc of the IEEE/CVF Conference on Computer Vision and Pattern Re-cognition. Washington, USA: IEEE, 2019: 3628-3637. [29] ZHONG Z, ZHENG L, LUO Z M, et al. Invariance Matters: Exemplar Memory for Domain Adaptive Person Re-identification // Proc of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. Washington, USA: IEEE, 2019: 598-607. [30] MEKHAZNI D, BHUIYAN A, EKLADIOUS G, et al. Unsupervised Domain Adaptation in the Dissimilarity Space for Person Re-identification // Proc of the European Conference on Computer Vision. Berlin, Germany: Springer, 2020: 159-174. [31] CHEN X D, LIU X C, LIU W, et al. Explainable Person Re-identification with Attribute-Guided Metric Distillation // Proc of the IEEE/CVF International Conference on Computer Vision. Wa-shington, USA: IEEE, 2021: 11793-11802. [32] ZHOU K Y, YANG Y X, CAVALLARO A, et al. Learning Generalisable Omni-Scale Representations for Person Re-identification. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2021. DOI: 10.1109/TPAMI.2021.3069237. |
|
|
|