Next Article in Journal
Design, Calibration and Morphological Characterization of a Flexible Sensor with Adjustable Chemical Sensitivity and Possible Applications to Sports Medicine
Previous Article in Journal
Reflective Distributed Denial of Service Detection: A Novel Model Utilizing Binary Particle Swarm Optimization—Simulated Annealing for Feature Selection and Gray Wolf Optimization-Optimized LightGBM Algorithm
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
This is an early access version, the complete PDF, HTML, and XML versions will be available soon.
Article

IV-YOLO: A Lightweight Dual-Branch Object Detection Network

by
Dan Tian
,
Xin Yan
,
Dong Zhou
*,
Chen Wang
and
Wenshuai Zhang
Institute of Electronic Science and Technology, University of Electronic Science and Technology of China, Chengdu 611731, China
*
Author to whom correspondence should be addressed.
Sensors 2024, 24(19), 6181; https://doi.org/10.3390/s24196181
Submission received: 27 August 2024 / Revised: 19 September 2024 / Accepted: 23 September 2024 / Published: 24 September 2024
(This article belongs to the Section Sensor Networks)

Abstract

With the rapid growth in demand for security surveillance, assisted driving, and remote sensing, object detection networks with robust environmental perception and high detection accuracy have become a research focus. However, single-modality image detection technologies face limitations in environmental adaptability, often affected by factors such as lighting conditions, fog, rain, and obstacles like vegetation, leading to information loss and reduced detection accuracy. We propose an object detection network that integrates features from visible light and infrared images—IV-YOLO—to address these challenges. This network is based on YOLOv8 (You Only Look Once v8) and employs a dual-branch fusion structure that leverages the complementary features of infrared and visible light images for target detection. We designed a Bidirectional Pyramid Feature Fusion structure (Bi-Fusion) to effectively integrate multimodal features, reducing errors from feature redundancy and extracting fine-grained features for small object detection. Additionally, we developed a Shuffle-SPP structure that combines channel and spatial attention to enhance the focus on deep features and extract richer information through upsampling. Regarding model optimization, we designed a loss function tailored for multi-scale object detection, accelerating the convergence speed of the network during training. Compared with the current state-of-the-art Dual-YOLO model, IV-YOLO achieves mAP improvements of 2.8%, 1.1%, and 2.2% on the Drone Vehicle, FLIR, and KAIST datasets, respectively. On the Drone Vehicle and FLIR datasets, IV-YOLO has a parameter count of 4.31 M and achieves a frame rate of 203.2 fps, significantly outperforming YOLOv8n (5.92 M parameters, 188.6 fps on the Drone Vehicle dataset) and YOLO-FIR (7.1 M parameters, 83.3 fps on the FLIR dataset), which had previously achieved the best performance on these datasets. This demonstrates that IV-YOLO achieves higher real-time detection performance while maintaining lower parameter complexity, making it highly promising for applications in autonomous driving, public safety, and beyond.
Keywords: dual-branch image object detection; IV-YOLO; bi-directional pyramid feature fusion; attention mechanism; small target detection dual-branch image object detection; IV-YOLO; bi-directional pyramid feature fusion; attention mechanism; small target detection

Share and Cite

MDPI and ACS Style

Tian, D.; Yan, X.; Zhou, D.; Wang, C.; Zhang, W. IV-YOLO: A Lightweight Dual-Branch Object Detection Network. Sensors 2024, 24, 6181. https://doi.org/10.3390/s24196181

AMA Style

Tian D, Yan X, Zhou D, Wang C, Zhang W. IV-YOLO: A Lightweight Dual-Branch Object Detection Network. Sensors. 2024; 24(19):6181. https://doi.org/10.3390/s24196181

Chicago/Turabian Style

Tian, Dan, Xin Yan, Dong Zhou, Chen Wang, and Wenshuai Zhang. 2024. "IV-YOLO: A Lightweight Dual-Branch Object Detection Network" Sensors 24, no. 19: 6181. https://doi.org/10.3390/s24196181

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop