Low-observable Target Detection Method for Autonomous Vehicles Based on Multi-modal Feature Fusion#br#

doi:10.3969/j.issn.1004-132X.2021.09.013

China Mechanical Engineering ›› 2021, Vol. 32 ›› Issue (09): 1114-1125.DOI: 10.3969/j.issn.1004-132X.2021.09.013

Previous Articles Next Articles

Low-observable Target Detection Method for Autonomous Vehicles Based on Multi-modal Feature Fusion#br#

ZOU Wei;YIN Guodong;LIU Haoji;GENG Keke;HUANG Wenhan;WU Yuan;XUE Hongwei#br#

School of Mechanical Engineering,Southeast University,Nanjing,211189

Online:2021-05-10 Published:2021-05-28

基于多模态特征融合的自主驾驶车辆低辨识目标检测方法#br#

邹伟;殷国栋;刘昊吉;耿可可;黄文涵;吴愿;薛宏伟

东南大学机械工程学院，南京，211189

通讯作者: 殷国栋（通信作者），男，1976年生，教授、博士研究生导师。研究方向为智能网联汽车、无人驾驶与智能辅助驾驶系统、车路协同、新能源汽车控制系统、车辆动力学及其控制等。E-mail：ygd@seu.edu.cn。
作者简介:邹伟，男，1994年生,硕士研究生。研究方向为计算机视觉、深度学习、视觉感知等。
基金资助:
国家自然科学基金(51975118)；
江苏省重点研发计划（BE2019004-2）；
江苏省成果转化项目（BA2018023）

Abstract

Abstract: Aiming at the problems of low-observable target detection in autonomous vehicles under real driving conditions, a target detection method was proposed based on multi-modal feature fusion. In order to improve the detection on the low-observable targets, the multi-modal deep convolutional neural network was designed based on Faster R-CNN to fuse the features of RGB images, polarized images and infrared images, the development of the multi-modal (three) image low-observable target real-time detection system was studied and the applications of multi-modal image feature fusion in the intelligent perception system of autonomous vehicles were explored herein. A manually labeled multi-modal image dataset of low-observable targets was built. The deep learning neural network was trained to optimize internal parameters to make the system capable for both pedestrians and vehicle recognition in the complex environments. The experimental results indicate that the deep convolutional neural network based on the multi-modal fusion has a better performance on the low-observable target detection and recognition in complex environments than that of traditional single-modal methods.

Key words: autonomous driving, multi-modal feature fusion, deep convolutional neural network, low-observable target, intelligent perception

摘要： 针对自主驾驶车辆在真实驾驶环境下对低辨识目标的识别问题，提出了基于多模态特征融合的目标检测方法。基于Faster R-CNN算法设计多模态深度卷积神经网络，融合彩色图像、偏振图像、红外图像特征，提高对低辨识目标的检测性能；开发多模态（3种）图像低辨识度目标实时检测系统，探索多模态图像特征融合在自动驾驶智能感知系统中的应用。建立了人工标注过的多模态（3种）图像低辨识目标数据集，对深度学习神经网络进行训练，优化内部参数，使得该系统适用于复杂环境下对行人、车辆目标的检测和识别。实验结果表明，相对于传统的单模态目标检测算法，基于多模态特征融合的深度卷积神经网络对复杂环境下的低辨识目标具有更好的检测和识别性能。

关键词: 自主驾驶, 多模态特征融合, 深度卷积神经网络, 低辨识目标, 智能感知

CLC Number:

V323.19

ZOU Wei, YIN Guodong, LIU Haoji, GENG Keke, HUANG Wenhan, WU Yuan, XUE Hongwei. Low-observable Target Detection Method for Autonomous Vehicles Based on Multi-modal Feature Fusion#br#[J]. China Mechanical Engineering, 2021, 32(09): 1114-1125.

邹伟, 殷国栋, 刘昊吉, 耿可可, 黄文涵, 吴愿, 薛宏伟. 基于多模态特征融合的自主驾驶车辆低辨识目标检测方法#br#[J]. 中国机械工程, 2021, 32(09): 1114-1125.

References

［1］BIAWAS S K, MiLANFAR P. Linear Support Tensor Machine with LSK Channels:Pedestrian Detection in Thermal Infrared Images［J］. IEEE Transactions on Image Processing, 2017, 26(9):4229-4242.
［2］ENZWEILER M, GAVRILA D M. Monocular Pedestrian Detection:Survey and Experiments［J］. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2009, 31:2179-2195.
［3］WOJEK C,DOLLAR P, SCHIELE B, et al. Pedestrian Detection:an Evaluation of the State of the Art［J］. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2012, 34(4):743-761.
［4］LI X F, LI L X, FLOHR F, et al. A Unified Framework for Concurrent Pedestrian and Cyclist Detection［J］. IEEE Transactions on Intelligent Transportation Systems, 2017, 18(2):269-281.
［5］王方石,王坚,李兵,等.基于深度属性学习的交通标志检测［J］.吉林大学学报(工学版), 2018, 48(1):319-329.
WANG Fangshi，SHI Jian，LI Bing ,et al. Deep Attribute Learning Based Traffic Sign Detection［J］. Journal of Jilin University(Engineering and Technology Edition), 2018, 48(1):319-329.
［6］AMDITIS A , BIMPAS M , THOMAIDIS G , et al. A situation-adaptive Lane-keeping Support System:overview of the SAFELANE Approach［J］. IEEE Transactions on Intelligent Transportation Systems, 2010, 11(3):617-629.
［7］SIMON M O, CORNEANU C, NASROLLAHI K, et al. Improved RGB-D-T Based Face Recognition［J］. IET Biometrics, 2016, 5(4):297-303.
［8］ZHOU Z, DONG M, XIE X, et al. Fusion of Infrared and Visible Images for Night-vision Context Enhancement［J］. Applied Optics, 2016, 55(3):6480-6490.
［9］GONZALEZ A, FANG Z J, SOCARRAS Y, et al. Pedestrian Detection at Day/Night Time with Visible and FIR Cameras:a Comparison［J］. Sensors, 2016, 16(6):820.
［10］CAI Y, LIU Z, WANG H, et al. Saliency-based Pedestrian Detection in Far Infrared Images［J］. IEEE Access, 2017, 5:5013-5019.
［11］KONIG D, ADAM M, JARVERS C, et al. Fully Convolutional Region Proposal Networks for Multispectral Person Detection［C］∥Computer Vision and Pattern Recognition Workshops. Honolulu, 2017:243-250.
［12］XU D, OUYANG W, RICCI E, et al. Learning Cross-modal Deep Representations for Robust Pedestrian Detection［C］∥IEEE Conference on Computer Vision and Pattern Recognition.Honolulu, 2017:4236-4244.
［13］SAVASTURK D, FROEHLICH B, SCHNEIDER N, et al. A Comparison Study on Vehicle Detection in Far Infrared and Regular Images［C］∥18th International Conference on Intelligent Transportation Systems.Las Palmas, 2015:1595-1600.
［14］HWANG S, PARK J, KIM N, et al. Multispectral Pedestrian Detection:Benchmark Dataset and Baseline［C］∥2015 IEEE Conference on Computer Vision and Pattern Recognition. Boston, 2015:1037-1045.
［15］LIU J J, ZHANG S T, WANG S, et al. Multispectral Deep Neural Networks for Pedestrian Detection［C］∥ British Machine Vision Conference. Durham, 2016:73.1-73.13.
［16］KAIST:Multispectral Pedestrian Detection Benchmark［DS/OL］. https:∥sites.google.com/site/pedestrianbenchmark/home.
［17］REN S , HE K , GIRSHICK R , et al. Faster RCNN:towards Real-time Object Detection with Region Proposal Networks［J］. IEEE Transactions on Pattern Analysis & Machine Intelligence, 2015, 39(6):1137-1149.
［18］GIRSHICK R. Fast R-CNN［C］∥2015 IEEE International Conference on Computer Vision.Santiago, 2015:1440-1448.
［19］REDMON J, DIVVALA S, GIRSHICK R, et al. You Only Look Once:Unified, Real-time Object Detection［C］∥2016 IEEE Conference on Computer Vision and Pattern Recognition. Las Vegas, 2016:779-788.
［20］LIU W, ANGUELOV D, ERHAN D, et al. SSD:Single Shot Multi Box Detector［C］∥European Conference on Computer Vision. Amsterdam, 2016:21-37.
［21］LIN T Y , DOLLR P, GIRSHICK R , et al. Feature Pyramid Networks for Object Detection［C］∥2017 IEEE Conference on Computer Vision and Pattern Recognition. Honolulu, 2017:936-944.
［22］LECUN Y，BENGIO Y，HINTON G E. Deep Learning［J］. Nature, 2015，521(7553)：436-444.
［23］SIMONYAN K, ZISSERMAN A. Very Deep Convolutional Networks for Large-scale Image Recognition［C］∥3rd International Conference on Learning Representations. San Diego, 2015:1-14.
［24］GENG K, ZOUW, YING, et al. Low-observable Targets Detection for Autonomous Vehicles Based on Dual-modal Sensor Fusion with Deep Learning Approach［J］.Proceedings of the Institution of Mechanical Engineers Part D:Journal of Automobile Engineering,2019, 233(9):2270-2283.
［25］高翔, 张涛, 刘毅, 等. 视觉SLAM十四讲［M］. 北京：电子工业出版社，2017.
GAO Xiang, ZHANG Tao, LIU Yi, et al. Fourteen Lectures on Vision SLAM［M］. Beijing:Electronic Industry Press, 2017.
［26］周志华. 机器学习［M］. 北京:清华大学出版社，2016.
ZHOU Zhihua. Machine Learning［M］. Beijing:Tsinghua University Press, 2016.

[1]	WANG Yulong, PEI Feng, LIU Wenru, YAN Chunxiang, ZHOU Weilin, LI Zhi. Human-imitative Autonomous Driving Decision-making Algorithm Based on Switched Deep Neural Networks [J]. China Mechanical Engineering, 2021, 32(06): 689-696.
[2]	CHEN Huan, LING Dui, LI Shun-Ming. Steering Control on Large Curvature Road Based on Preview Optimal Curvature Model [J]. China Mechanical Engineering, 2012, 23(17): 2111-2116.
[3]	Li Xu;Zhang Weigong. Research on Multi-Sensor Integrated Navigation Technique  Based on Federated Filter for Intelligent Vehicle [J]. J4, 2008, 19(12): 0-1392.

Low-observable Target Detection Method for Autonomous Vehicles Based on Multi-modal Feature Fusion#br#

基于多模态特征融合的自主驾驶车辆低辨识目标检测方法#br#

PDF

Knowledge

Abstract

Cite this article

share this article

References

Related Articles 3

Recommended Articles

Metrics