YOLOv1 to YOLOv10: A Comprehensive Review of YOLO Variants and Their Application in Medical Image Detection

Ahealisi Yeerjiang; Zongyu Wang; Xiangtong Huang; Jing Zhang; Qi Chen; Yucheng Qin; Jia He

doi:10.23977/jaip.2024.070314

YOLOv1 to YOLOv10: A Comprehensive Review of YOLO Variants and Their Application in Medical Image Detection

Download as PDF

DOI: 10.23977/jaip.2024.070314 | Downloads: 209 | Views: 3964

Author(s)

Ahealisi Yeerjiang ^1,2, Zongyu Wang ², Xiangtong Huang ², Jing Zhang ², Qi Chen ², Yucheng Qin ², Jia He ^1,2

Affiliation(s)

¹ School of Health Sciences and Engineering, University of Shanghai for Science and Technology, Shanghai, China
² Department of Health Statistics, Naval Medical University, Shanghai, China

Corresponding Author

Jia He

ABSTRACT

The rapid evolution of computer vision has elevated object detection to a central task within the field. In medicine, automated lesion detection has the potential to greatly improve diagnostic efficiency for clinicians. The extraordinary success of deep learning in computer vision has motivated researchers globally to apply these advancements to medical image analysis. Deep learning techniques have demonstrated superior performance in medical image classification, detection, segmentation, registration, and retrieval compared to traditional methods. Among these, the YOLO (You Only Look Once) series of algorithms stands out for their exceptional speed and accuracy, making them a popular choice for medical image detection. This paper presents the underlying principles and structure of the classic YOLO algorithms, reviews their current applications in medical image detection, addresses existing challenges, and explores future directions for the application of YOLO in this domain.

KEYWORDS

Deep learning, YOLO, Computer vision

CITE THIS PAPER

Ahealisi Yeerjiang, Zongyu Wang, Xiangtong Huang, Jing Zhang, Qi Chen, Yucheng Qin, Jia He, YOLOv1 to YOLOv10: A Comprehensive Review of YOLO Variants and Their Application in Medical Image Detection. Journal of Artificial Intelligence Practice (2024) Vol. 7: 112-122. DOI: http://dx.doi.org/10.23977/jaip.2024.070314.

REFERENCES

[1] Girshick R, Donahue J, Darrell T, et al. Rich feature hierarchies for accurate object detection and semantic segmentation[C]//Proceedings of the IEEE conference on computer vision and pattern recognition. 2014: 580-587.
[2] He K, Zhang X, Ren S, et al. Spatial pyramid pooling in deep convolutional networks for visual recognition[J]. IEEE transactions on pattern analysis and machine intelligence, 2015, 37(9): 1904-1916.
[3] Girshick R. Fast r-cnn[C]//Proceedings of the IEEE international conference on computer vision. 2015: 1440-1448.
[4] Ren S, He K, Girshick R, et al. Faster r-cnn: Towards real-time object detection with region proposal networks[J]. Advances in neural information processing systems, 2015, 28.
[5] Liu W, Anguelov D, Erhan D, et al. Ssd: Single shot multibox detector[C]//Computer Vision–ECCV 2016: 14th European Conference, Amsterdam, The Netherlands, October 11–14, 2016, Proceedings, Part I 14. Springer International Publishing, 2016: 21-37.
[6] Lin T Y, Goyal P, Girshick R, et al. Focal loss for dense object detection[C]//Proceedings of the IEEE international conference on computer vision. 2017: 2980-2988.
[7] Redmon J, Divvala S, Girshick R, et al. You only look once: Unified, real-time object detection[C]//Proceedings of the IEEE conference on computer vision and pattern recognition. 2016: 779-788.
[8] Redmon J, Farhadi A. YOLO9000: better, faster, stronger[C]//Proceedings of the IEEE conference on computer vision and pattern recognition. 2017: 7263-7271.
[9] Redmon J, Farhadi A. Yolov3: An incremental improvement[J]. arXiv preprint arXiv:1804.02767, 2018.
[10] Bochkovskiy A. YOLOv4: Optimal Speed and Accuracy of Object Detection[J]. arXiv preprint arXiv:2004.10934, 2020.
[11] Li C, Li L, Jiang H, et al. YOLOv6: A single-stage object detection framework for industrial applications[J]. arXiv preprint arXiv:2209.02976, 2022.
[12] Wang C Y, Bochkovskiy A, Liao H Y M. YOLOv7: Trainable bag-of-freebies sets new state-of-the-art for real-time object detectors[C]//Proceedings of the IEEE/CVF conference on computer vision and pattern recognition. 2023: 7464-7475.
[13] Varghese R, Sambath M. YOLOv8: A Novel Object Detection Algorithm with Enhanced Performance and Robustness[C]//2024 International Conference on Advances in Data Engineering and Intelligent Computing Systems (ADICS). IEEE, 2024: 1-6.
[14] Wang C Y, Yeh I H, Liao H Y M. Yolov9: Learning what you want to learn using programmable gradient information [J]. arXiv preprint arXiv:2402.13616, 2024.
[15] Wang A, Chen H, Liu L, et al. Yolov10: Real-time end-to-end object detection[J]. arXiv preprint arXiv:2405.14458, 2024.
[16] Prinzi F, Insalaco M, Orlando A, et al. A yolo-based model for breast cancer detection in mammograms[J]. Cognitive Computation, 2024, 16(1): 107-120.
[17] Su Y, Liu Q, Xie W, et al. YOLO-LOGO: A transformer-based YOLO segmentation model for breast mass detection and segmentation in digital mammograms[J]. Computer Methods and Programs in Biomedicine, 2022, 221: 106903.
[18] Kang M, Ting C M, Ting F F, et al. Bgf-yolo: Enhanced yolov8 with multiscale attentional feature fusion for brain tumor detection [J]. arXiv preprint arXiv:2309.12585, 2023.
[19] Karacı A, Akyol K. YoDenBi-NET: YOLO+ DenseNet+ Bi-LSTM-based hybrid deep learning model for brain tumor classification [J]. Neural Computing and Applications, 2023, 35(17): 12583-12598.
[20] Tao Zhou, et al. "Mandibular fracture detection with enhanced feature extraction capabilities in the 3M-YOLOv5 network." Optics and Precision Engineering 31.21 (2023): 3178-3191.
[21] Bai J, Sun J, Cheng X G, et al. Construction and Application of Rib Fracture Diagnosis Model Based on YOLOv3 Algorithm [J]. Fa yi xue za zhi, 2023, 39(4): 343-349.
[22] Hržić F, Tschauner S, Sorantin E, et al. Fracture recognition in paediatric wrist radiographs: An object detection approach [J]. Mathematics, 2022, 10(16): 2939.
[23] Ju R Y, Cai W. Fracture detection in pediatric wrist trauma X-ray images using YOLOv8 algorithm[J]. Scientific Reports, 2023, 13(1): 20077.
[24] SONG Fangfang, SUN Zhaoyong, TIAN Yimin, et al. Improved YOLOv5 Pulmonary Nodule Detection Method[J]. Software Engineering and Applications, 2023, 12: 257.
[25] Xi Xiaoqian, Liu Wei. Auxiliary Diagnosis System for Pulmonary Nodules Based on Object Detection Algorithm[J]. Computer and Modernization, 2020 (11): 1.
[26] Gao W, Shan M, Song N, et al. Detection of microaneurysms in fundus images based on improved YOLOv4 with SENet embedded[J]. Journal of Biomedical Engineering, 2022, 39(4): 713-720.
[27] Wei Z, Hua Z, Yuhong L I U, et al. Research on Optic Disc and Macula Fovea Simultaneous Location and Detection Method on FPGA[J]. Journal of Computer Engineering & Applications, 2022, 58(11).
[28] Ünver H M, Ayan E. Skin lesion segmentation in dermoscopic images with combination of YOLO and grabcut algorithm [J]. Diagnostics, 2019, 9(3): 72.
[29] Nersisson R, Iyer T J, Joseph Raj A N, et al. A dermoscopic skin lesion classification technique using YOLO-CNN and traditional feature model[J]. Arabian Journal for Science and Engineering, 2021, 46(10): 9797-9808.

Subscription

E-Mail Alert

Downloads:	15934
Visits:	544929

YOLOv1 to YOLOv10: A Comprehensive Review of YOLO Variants and Their Application in Medical Image Detection

Author(s)

Affiliation(s)

Corresponding Author

ABSTRACT

KEYWORDS

CITE THIS PAPER

REFERENCES

RESOURCES

JOIN US

PUBLICATION SERVICES

CONTACT US