YOLOv1 to YOLOv10: A Comprehensive Review of YOLO Variants and Their Application in Medical Image Detection
DOI: 10.23977/jaip.2024.070314 | Downloads: 203 | Views: 3197
Author(s)
Ahealisi Yeerjiang 1,2, Zongyu Wang 2, Xiangtong Huang 2, Jing Zhang 2, Qi Chen 2, Yucheng Qin 2, Jia He 1,2
Affiliation(s)
1 School of Health Sciences and Engineering, University of Shanghai for Science and Technology, Shanghai, China
2 Department of Health Statistics, Naval Medical University, Shanghai, China
Corresponding Author
Jia HeABSTRACT
The rapid evolution of computer vision has elevated object detection to a central task within the field. In medicine, automated lesion detection has the potential to greatly improve diagnostic efficiency for clinicians. The extraordinary success of deep learning in computer vision has motivated researchers globally to apply these advancements to medical image analysis. Deep learning techniques have demonstrated superior performance in medical image classification, detection, segmentation, registration, and retrieval compared to traditional methods. Among these, the YOLO (You Only Look Once) series of algorithms stands out for their exceptional speed and accuracy, making them a popular choice for medical image detection. This paper presents the underlying principles and structure of the classic YOLO algorithms, reviews their current applications in medical image detection, addresses existing challenges, and explores future directions for the application of YOLO in this domain.
KEYWORDS
Deep learning, YOLO, Computer visionCITE THIS PAPER
Ahealisi Yeerjiang, Zongyu Wang, Xiangtong Huang, Jing Zhang, Qi Chen, Yucheng Qin, Jia He, YOLOv1 to YOLOv10: A Comprehensive Review of YOLO Variants and Their Application in Medical Image Detection. Journal of Artificial Intelligence Practice (2024) Vol. 7: 112-122. DOI: http://dx.doi.org/10.23977/jaip.2024.070314.
REFERENCES
[1] Girshick R, Donahue J, Darrell T, et al. Rich feature hierarchies for accurate object detection and semantic segmentation[C]//Proceedings of the IEEE conference on computer vision and pattern recognition. 2014: 580-587.
[2] He K, Zhang X, Ren S, et al. Spatial pyramid pooling in deep convolutional networks for visual recognition[J]. IEEE transactions on pattern analysis and machine intelligence, 2015, 37(9): 1904-1916.
[3] Girshick R. Fast r-cnn[C]//Proceedings of the IEEE international conference on computer vision. 2015: 1440-1448.
[4] Ren S, He K, Girshick R, et al. Faster r-cnn: Towards real-time object detection with region proposal networks[J]. Advances in neural information processing systems, 2015, 28.
[5] Liu W, Anguelov D, Erhan D, et al. Ssd: Single shot multibox detector[C]//Computer Vision–ECCV 2016: 14th European Conference, Amsterdam, The Netherlands, October 11–14, 2016, Proceedings, Part I 14. Springer International Publishing, 2016: 21-37.
[6] Lin T Y, Goyal P, Girshick R, et al. Focal loss for dense object detection[C]//Proceedings of the IEEE international conference on computer vision. 2017: 2980-2988.
[7] Redmon J, Divvala S, Girshick R, et al. You only look once: Unified, real-time object detection[C]//Proceedings of the IEEE conference on computer vision and pattern recognition. 2016: 779-788.
[8] Redmon J, Farhadi A. YOLO9000: better, faster, stronger[C]//Proceedings of the IEEE conference on computer vision and pattern recognition. 2017: 7263-7271.
[9] Redmon J, Farhadi A. Yolov3: An incremental improvement[J]. arXiv preprint arXiv:1804.02767, 2018.
[10] Bochkovskiy A. YOLOv4: Optimal Speed and Accuracy of Object Detection[J]. arXiv preprint arXiv:2004.10934, 2020.
[11] Li C, Li L, Jiang H, et al. YOLOv6: A single-stage object detection framework for industrial applications[J]. arXiv preprint arXiv:2209.02976, 2022.
[12] Wang C Y, Bochkovskiy A, Liao H Y M. YOLOv7: Trainable bag-of-freebies sets new state-of-the-art for real-time object detectors[C]//Proceedings of the IEEE/CVF conference on computer vision and pattern recognition. 2023: 7464-7475.
[13] Varghese R, Sambath M. YOLOv8: A Novel Object Detection Algorithm with Enhanced Performance and Robustness[C]//2024 International Conference on Advances in Data Engineering and Intelligent Computing Systems (ADICS). IEEE, 2024: 1-6.
[14] Wang C Y, Yeh I H, Liao H Y M. Yolov9: Learning what you want to learn using programmable gradient information [J]. arXiv preprint arXiv:2402.13616, 2024.
[15] Wang A, Chen H, Liu L, et al. Yolov10: Real-time end-to-end object detection[J]. arXiv preprint arXiv:2405.14458, 2024.
[16] Prinzi F, Insalaco M, Orlando A, et al. A yolo-based model for breast cancer detection in mammograms[J]. Cognitive Computation, 2024, 16(1): 107-120.
[17] Su Y, Liu Q, Xie W, et al. YOLO-LOGO: A transformer-based YOLO segmentation model for breast mass detection and segmentation in digital mammograms[J]. Computer Methods and Programs in Biomedicine, 2022, 221: 106903.
[18] Kang M, Ting C M, Ting F F, et al. Bgf-yolo: Enhanced yolov8 with multiscale attentional feature fusion for brain tumor detection [J]. arXiv preprint arXiv:2309.12585, 2023.
[19] Karacı A, Akyol K. YoDenBi-NET: YOLO+ DenseNet+ Bi-LSTM-based hybrid deep learning model for brain tumor classification [J]. Neural Computing and Applications, 2023, 35(17): 12583-12598.
[20] Tao Zhou, et al. "Mandibular fracture detection with enhanced feature extraction capabilities in the 3M-YOLOv5 network." Optics and Precision Engineering 31.21 (2023): 3178-3191.
[21] Bai J, Sun J, Cheng X G, et al. Construction and Application of Rib Fracture Diagnosis Model Based on YOLOv3 Algorithm [J]. Fa yi xue za zhi, 2023, 39(4): 343-349.
[22] Hržić F, Tschauner S, Sorantin E, et al. Fracture recognition in paediatric wrist radiographs: An object detection approach [J]. Mathematics, 2022, 10(16): 2939.
[23] Ju R Y, Cai W. Fracture detection in pediatric wrist trauma X-ray images using YOLOv8 algorithm[J]. Scientific Reports, 2023, 13(1): 20077.
[24] SONG Fangfang, SUN Zhaoyong, TIAN Yimin, et al. Improved YOLOv5 Pulmonary Nodule Detection Method[J]. Software Engineering and Applications, 2023, 12: 257.
[25] Xi Xiaoqian, Liu Wei. Auxiliary Diagnosis System for Pulmonary Nodules Based on Object Detection Algorithm[J]. Computer and Modernization, 2020 (11): 1.
[26] Gao W, Shan M, Song N, et al. Detection of microaneurysms in fundus images based on improved YOLOv4 with SENet embedded[J]. Journal of Biomedical Engineering, 2022, 39(4): 713-720.
[27] Wei Z, Hua Z, Yuhong L I U, et al. Research on Optic Disc and Macula Fovea Simultaneous Location and Detection Method on FPGA[J]. Journal of Computer Engineering & Applications, 2022, 58(11).
[28] Ünver H M, Ayan E. Skin lesion segmentation in dermoscopic images with combination of YOLO and grabcut algorithm [J]. Diagnostics, 2019, 9(3): 72.
[29] Nersisson R, Iyer T J, Joseph Raj A N, et al. A dermoscopic skin lesion classification technique using YOLO-CNN and traditional feature model[J]. Arabian Journal for Science and Engineering, 2021, 46(10): 9797-9808.
Downloads: | 15127 |
---|---|
Visits: | 485225 |
Sponsors, Associates, and Links
-
Power Systems Computation
-
Internet of Things (IoT) and Engineering Applications
-
Computing, Performance and Communication Systems
-
Advances in Computer, Signals and Systems
-
Journal of Network Computing and Applications
-
Journal of Web Systems and Applications
-
Journal of Electrotechnology, Electrical Engineering and Management
-
Journal of Wireless Sensors and Sensor Networks
-
Journal of Image Processing Theory and Applications
-
Mobile Computing and Networking
-
Vehicle Power and Propulsion
-
Frontiers in Computer Vision and Pattern Recognition
-
Knowledge Discovery and Data Mining Letters
-
Big Data Analysis and Cloud Computing
-
Electrical Insulation and Dielectrics
-
Crypto and Information Security
-
Journal of Neural Information Processing
-
Collaborative and Social Computing
-
International Journal of Network and Communication Technology
-
File and Storage Technologies
-
Frontiers in Genetic and Evolutionary Computation
-
Optical Network Design and Modeling
-
Journal of Virtual Reality and Artificial Intelligence
-
Natural Language Processing and Speech Recognition
-
Journal of High-Voltage
-
Programming Languages and Operating Systems
-
Visual Communications and Image Processing
-
Journal of Systems Analysis and Integration
-
Knowledge Representation and Automated Reasoning
-
Review of Information Display Techniques
-
Data and Knowledge Engineering
-
Journal of Database Systems
-
Journal of Cluster and Grid Computing
-
Cloud and Service-Oriented Computing
-
Journal of Networking, Architecture and Storage
-
Journal of Software Engineering and Metrics
-
Visualization Techniques
-
Journal of Parallel and Distributed Processing
-
Journal of Modeling, Analysis and Simulation
-
Journal of Privacy, Trust and Security
-
Journal of Cognitive Informatics and Cognitive Computing
-
Lecture Notes on Wireless Networks and Communications
-
International Journal of Computer and Communications Security
-
Journal of Multimedia Techniques
-
Automation and Machine Learning
-
Computational Linguistics Letters
-
Journal of Computer Architecture and Design
-
Journal of Ubiquitous and Future Networks