Education, Science, Technology, Innovation and Life
Open Access
Sign In

Image registration with semantic information-guided focus on co-pixel points

Download as PDF

DOI: 10.23977/jipta.2025.080109 | Downloads: 4 | Views: 383

Author(s)

Xuedong Liu 1,2, Yang Yang 1,2

Affiliation(s)

1 School of Information Science and Technology, Yunnan Normal University, Kunming, China
2 Laboratory of Pattern Recognition and Artificial Intelligence, Yunnan Normal University, Kunming, China

Corresponding Author

Yang Yang

ABSTRACT

The goal of image registration is to align pixel points from images captured by different sensors, serving as a reliable foundation for subsequent information fusion and enhancing the performance of advanced vision tasks in degraded scenarios. However, current methods overlook the viewpoint variations caused by internal sensor differences, which introduce information of irrelevant interference, challenging the reliability of registration. Therefore, this paper proposes a semantic information-guided image registration algorithm that directs the network to focus on relevant pixel regions by leveraging the semantic information of corresponding pixels. Through comparative experiments, the proposed method demonstrates superior registration performance compared to existing methods.

KEYWORDS

Image registration, Viewpoint variations, Semantic information-guided, corresponding pixels, Irrelevant interference

CITE THIS PAPER

Xuedong Liu, Yang Yang, Image registration with semantic information-guided focus on co-pixel points. Journal of Image Processing Theory and Applications (2025) Vol. 8: 71-77. DOI: http://dx.doi.org/10.23977/jipta.2025.080109.

REFERENCES

[1] Tang L, Yuan J, Ma J. Image fusion in the loop of high-level vision tasks: A semantic-aware real-time infrared and visible image fusion network[J]. Information Fusion, 2022, 82: 28-42.
[2] Aharon N, Orfaig R, Bobrovsky B Z. Bot-sort: Robust associations multi-pedestrian tracking[J]. arXiv preprint arXiv:2206.14651, 2022.
[3] Wang C Y, Bochkovskiy A, Liao H Y M. YOLOv7: Trainable bag-of-freebies sets new state-of-the-art for real-time object detectors[C]. Proceedings of the IEEE/CVF conference on computer vision and pattern recognition. 2023: 7464-7475.
[4] Zhang H, Dana K, Shi J, et al. Context encoding for semantic segmentation[C]. Proceedings of the IEEE conference on Computer Vision and Pattern Recognition. 2018: 7151-7160.
[5] Sarlin P E, DeTone D, Malisiewicz T, et al. Superglue: Learning feature matching with graph neural networks[C]. Proceedings of the IEEE/CVF conference on computer vision and pattern recognition. 2020: 4938-4947.
[6] Li J, Hu Q, Ai M. RIFT: Multi-modal image matching based on radiation-variation insensitive feature transform[J]. IEEE Transactions on Image Processing, 2019, 29: 3296-3310.
[7] DeTone D, Malisiewicz T, Rabinovich A. Superpoint: Self-supervised interest point detection and description[C]. Proceedings of the IEEE conference on computer vision and pattern recognition workshops. 2018: 224-236.
[8] Lowe D G. Distinctive image features from scale-invariant keypoints[J]. International journal of computer vision, 2004, 60: 91-110.
[9] Wang Q, Zhang J, Yang K, et al. Matchformer: Interleaving attention in transformers for feature matching[C]. Proceedings of the Asian Conference on Computer Vision. 2022: 2746-2762.
[10] Sun J, Shen Z, Wang Y, et al. LoFTR: Detector-free local feature matching with transformers[C]. Proceedings of the IEEE/CVF conference on computer vision and pattern recognition. 2021: 8922-8931.
[11] Bay H, Tuytelaars T, Van Gool L. Surf: Speeded up robust features[C]. Computer Vision–ECCV 2006: 9th European Conference on Computer Vision, Graz, Austria, May 7-13, 2006. Proceedings, Part I 9. Springer Berlin Heidelberg, 2006: 404-417.
[12] Liu X, Ai Y, Zhang J, et al. A novel affine and contrast invariant descriptor for infrared and visible image registration[J]. Remote Sensing, 2018, 10(4): 658.
[13] Rosten E, Drummond T. Machine learning for high-speed corner detection[C]. Computer Vision–ECCV 2006: 9th European Conference on Computer Vision, Graz, Austria, May 7-13, 2006. Proceedings, Part I 9. Springer Berlin Heidelberg, 2006: 430-443.
[14] Cui S, Xu M, Ma A, et al. Modality-free feature detector and descriptor for multimodal remote sensing image registration[J]. Remote Sensing, 2020, 12(18): 2937.
[15] Dusmanu M, Rocco I, Pajdla T, et al. D2-net: A trainable cnn for joint description and detection of local features[C]. Proceedings of the ieee/cvf conference on computer vision and pattern recognition. 2019: 8092-8101.
[16] Vaswani A, Shazeer N, Parmar N, et al. Attention is all you need[J]. Advances in neural information processing systems, 2017, 30.

Downloads: 2241
Visits: 161889

Sponsors, Associates, and Links


All published work is licensed under a Creative Commons Attribution 4.0 International License.

Copyright © 2016 - 2031 Clausius Scientific Press Inc. All Rights Reserved.