China Mechanical Engineering ›› 2026, Vol. 37 ›› Issue (2): 442-451.DOI: 10.3969/j.issn.1004-132X.2026.02.019
LUO Hang, YANG Ye(
), CHEN Benyong
Received:2025-01-15
Online:2026-02-25
Published:2026-03-13
Contact:
YANG Ye
通讯作者:
杨晔
作者简介:罗 杭,女,2000 年生,硕士研究生。研究方向为机器人控制基金资助:CLC Number:
LUO Hang, YANG Ye, CHEN Benyong. Intelligent Part Identification and Grabbing Method Based on SGV-YOLOv8 Model[J]. China Mechanical Engineering, 2026, 37(2): 442-451.
罗杭, 杨晔, 陈本永. 基于SGV-YOLOv8模型的机械零件智能识别与抓取方法[J]. 中国机械工程, 2026, 37(2): 442-451.
Add to citation manager EndNote|Ris|BibTeX
URL: https://www.cmemo.org.cn/EN/10.3969/j.issn.1004-132X.2026.02.019
| 网络配置项 | 结构参数 |
|---|---|
| Epochs | 150 |
| Ir0 | 0.01 |
| Optimizer | SGD |
| Momentum | 0.937 |
| Weight decay | 0.0005 |
| Batch size | 64 |
Tab.1 Model super parameter setting
| 网络配置项 | 结构参数 |
|---|---|
| Epochs | 150 |
| Ir0 | 0.01 |
| Optimizer | SGD |
| Momentum | 0.937 |
| Weight decay | 0.0005 |
| Batch size | 64 |
| 模型 | 参数规模/MB | GFLOPs/G | 推理速度/ (帧·s | mAP@0.5/% |
|---|---|---|---|---|
| Faster R-CNN | 140.8 | 406.6 | 10 | 83.1 |
| SSD | 50.2 | 360.9 | 107 | 75.7 |
| YOLOv5 | 13.8 | 15.9 | 11 | 98.8 |
| YOLOv6s | 8.3 | 11.8 | 263 | 98.0 |
| YOLOv8m | 49.6 | 79.1 | 208 | 99.2 |
| YOLOv8n | 6.0 | 8.9 | 164 | 99.0 |
| YOLOv8s | 21.4 | 28.8 | 303 | 98.9 |
Tab.2 Performance comparison of part datasets on different algorithms
| 模型 | 参数规模/MB | GFLOPs/G | 推理速度/ (帧·s | mAP@0.5/% |
|---|---|---|---|---|
| Faster R-CNN | 140.8 | 406.6 | 10 | 83.1 |
| SSD | 50.2 | 360.9 | 107 | 75.7 |
| YOLOv5 | 13.8 | 15.9 | 11 | 98.8 |
| YOLOv6s | 8.3 | 11.8 | 263 | 98.0 |
| YOLOv8m | 49.6 | 79.1 | 208 | 99.2 |
| YOLOv8n | 6.0 | 8.9 | 164 | 99.0 |
| YOLOv8s | 21.4 | 28.8 | 303 | 98.9 |
| 模型 | 参数规模/MB | GFLOPs/G | mAP@0.5/% |
|---|---|---|---|
| MobileNet | 11.2 | 22.6 | 98.5 |
| ShuffleNet | 12.4 | 17.4 | 98.7 |
| GhostNet | 12.4 | 17.3 | 98.6 |
| FasterNet | 16.7 | 21.7 | 99.0 |
| StarNet | 11.1 | 17.3 | 98.7 |
Tab.3 Comparison of different backbone networks
| 模型 | 参数规模/MB | GFLOPs/G | mAP@0.5/% |
|---|---|---|---|
| MobileNet | 11.2 | 22.6 | 98.5 |
| ShuffleNet | 12.4 | 17.4 | 98.7 |
| GhostNet | 12.4 | 17.3 | 98.6 |
| FasterNet | 16.7 | 21.7 | 99.0 |
| StarNet | 11.1 | 17.3 | 98.7 |
原始 YOLOv8 网络 | YOLOv8 网络+ StarNet | YOLOv8 网络+ GSConv | YOLOv8 网络+ VoV-GSCSP | YOLOv8 网络+ StarNet+ GSConv | YOLOv8 网络+ StarNet+ VoV-GSCSP | YOLOv8 网络+ GSConv+ VoV-GSCSP | 原始YOLOv8 网络+ StarNet+ GSConv+ VoV-GSCSP | |
|---|---|---|---|---|---|---|---|---|
| YOLOv8 | √ | √ | √ | √ | √ | √ | √ | √ |
| StarNet | √ | √ | √ | √ | ||||
| GSConv | √ | √ | √ | √ | ||||
| VoV-GSCSP | √ | √ | √ | √ | ||||
| 参数规模/MB | 21.4 | 11.1 | 5.81 | 19.3 | 12.0 | 12.7 | 19.9 | 11.1 |
| GFLOPs/G | 28.8 | 17.3 | 26.2 | 21.3 | 16.9 | 17.3 | 25.1 | 14.1 |
推理速度/ (帧·s | 303.4 | 384.6 | 277.5 | 286 | 323.3 | 344.9 | 293.7 | 417.2 |
| mAP@0.5/% | 98.9 | 98.7 | 99.0 | 98.9 | 98.5 | 98.7 | 99.2 | 98.9 |
Tab.4 Ablation test results of YOLOv8
原始 YOLOv8 网络 | YOLOv8 网络+ StarNet | YOLOv8 网络+ GSConv | YOLOv8 网络+ VoV-GSCSP | YOLOv8 网络+ StarNet+ GSConv | YOLOv8 网络+ StarNet+ VoV-GSCSP | YOLOv8 网络+ GSConv+ VoV-GSCSP | 原始YOLOv8 网络+ StarNet+ GSConv+ VoV-GSCSP | |
|---|---|---|---|---|---|---|---|---|
| YOLOv8 | √ | √ | √ | √ | √ | √ | √ | √ |
| StarNet | √ | √ | √ | √ | ||||
| GSConv | √ | √ | √ | √ | ||||
| VoV-GSCSP | √ | √ | √ | √ | ||||
| 参数规模/MB | 21.4 | 11.1 | 5.81 | 19.3 | 12.0 | 12.7 | 19.9 | 11.1 |
| GFLOPs/G | 28.8 | 17.3 | 26.2 | 21.3 | 16.9 | 17.3 | 25.1 | 14.1 |
推理速度/ (帧·s | 303.4 | 384.6 | 277.5 | 286 | 323.3 | 344.9 | 293.7 | 417.2 |
| mAP@0.5/% | 98.9 | 98.7 | 99.0 | 98.9 | 98.5 | 98.7 | 99.2 | 98.9 |
| 模型 | 参数 规模/MB | GFLOPs/ G | 推理速度 FPS/(帧·s | mAP@0.5/% | mAP@0.5:0.95/% |
|---|---|---|---|---|---|
| YOLOv5 | 13.8 | 15.9 | 21.2 | 99.2 | 76.9 |
| YOLOv6s | 8.3 | 11.8 | 266.0 | 98.7 | 74.4 |
| YOLOv8n | 6.0 | 8.9 | 128.2 | 99.0 | 77.9 |
| YOLOv8s | 21.4 | 28.8 | 312.5 | 98.9 | 76.8 |
| YOLOv8m | 49.6 | 79.1 | 288.9 | 99.1 | 77.4 |
| SGV-YOLOv8 | 11.1 | 14.1 | 344.8 | 99.2 | 78.6 |
Tab.5 Generalization experiments on industrial tool datasets
| 模型 | 参数 规模/MB | GFLOPs/ G | 推理速度 FPS/(帧·s | mAP@0.5/% | mAP@0.5:0.95/% |
|---|---|---|---|---|---|
| YOLOv5 | 13.8 | 15.9 | 21.2 | 99.2 | 76.9 |
| YOLOv6s | 8.3 | 11.8 | 266.0 | 98.7 | 74.4 |
| YOLOv8n | 6.0 | 8.9 | 128.2 | 99.0 | 77.9 |
| YOLOv8s | 21.4 | 28.8 | 312.5 | 98.9 | 76.8 |
| YOLOv8m | 49.6 | 79.1 | 288.9 | 99.1 | 77.4 |
| SGV-YOLOv8 | 11.1 | 14.1 | 344.8 | 99.2 | 78.6 |
| 模型 | 试验次数 | 定位失败的零件数量 | 识别错误的零件数量 | 成功抓取次数 | 成功率/% |
|---|---|---|---|---|---|
| YOLOv8 | 30 | 7 | 2 | 21 | 70 |
| SGV-YOLOv8 | 30 | 6 | 0 | 24 | 80 |
Tab.6 Robot arm part grab results based on improved YOLO8
| 模型 | 试验次数 | 定位失败的零件数量 | 识别错误的零件数量 | 成功抓取次数 | 成功率/% |
|---|---|---|---|---|---|
| YOLOv8 | 30 | 7 | 2 | 21 | 70 |
| SGV-YOLOv8 | 30 | 6 | 0 | 24 | 80 |
| [1] | 谢丰隆, 韩建海, 李向攀. 一种快速的机器人固定视觉标定方法[J]. 机械设计与制造,2018(11): 237-240. |
| XIE Fenglong, HAN Jianhai, LI Xiangpan. A Fast Way of Stable Camera Calibration with Robot[J]. Machinery Design & Manufacture, 2018(11):237-240. | |
| [2] | 那一鸣, 胡超, 邱业余, 等. 基于机器视觉的汽车车门三维定位引导[J]. 中国机械工程, 2024, 35(9): 1677-1687. |
| NA Yiming, HU Chao, QIU Yeyu, et al. Three-dimensional Positioning Guidance of Automobile Doors Based on Machine Vision [J]. China Mechanical Engineering, 2024, 35(9): 1677-1687. | |
| [3] | NAKAGUCHI V M, LIU Zifu, et al. 3D Camera and Single-point Laser Sensor Integration for Apple Localization in Spindle-type Orchard Systems[J]. Sensors, 2024, 24(12): 3753. |
| [4] | LUHMANN T, FRASER C, MAAS H G. Sensor Modelling and Camera Calibration for Close-range Photogrammetry[J]. ISPRS Journal of Photogrammetry and Remote Sensing, 2016, 115: 37-46. |
| [5] | LIU Zewei, LU Dongming, QIAN Weixian, et al. Calibration of a Single-point Laser Range Finder and a Camera[J]. Optical and Quantum Electronics, 2018, 50(12): 447. |
| [6] | PATEL S N, REKIMOTO J, ABOWD G D. ICam: Precise At-a-distance Interaction in the Physical Environment[C]∥Pervasive Computing. Berlin, 2006: 272-287. |
| [7] | WITHER J, COFFIN C, VENTURA J, et al. Fast Annotation and Modeling with a Single-point Laser Range Finder[C]∥2008 7th IEEE/ACM International Symposium on Mixed and Augmented Reality. Cambridge, 2008: 65-68. |
| [8] | 吕张成, 张建业, 陈哲钥, 等. 基于深度学习的工业零件识别与抓取实时检测算法[J]. 机床与液压, 2023, 51(24): 33-38. |
| Zhangcheng LYU, ZHANG Jianye, CHEN Zheyao, et al. Real-time Detection Algorithm for Industrial Parts Recognition and Grabbing Based on Deep Learning [J]. Machine Tool & Hydraulics, 2023, 51(24): 33-38. | |
| [9] | HINTON G E, SALAKHUTDINOV R R. Reducing the Dimensionality of Data with Neural Networks[J]. Science, 2006, 313(5786): 504-507. |
| [10] | GIRSHICK R, DONAHUE J, DARRELL T, et al. Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation[C]∥2014 IEEE Conference on Computer Vision and Pattern Recognition. Columbus, 2014: 580-587. |
| [11] | HE Kaiming, ZHANG Xiangyu, REN Shaoqing, et al. Spatial Pyramid Pooling in Deep Convolutional Networks for Visual Recognition[C]∥Computer Vision – ECCV 2014. Cham, 2014: 346-361. |
| [12] | GIRSHICK R. Fast R-CNN[C]∥2015 IEEE International Conference on Computer Vision (ICCV). Santiago, 2015: 1440-1448. |
| [13] | REN Shaoqing, HE Kaiming, GIRSHICK R, et al. Faster R-CNN: Towards Real-time Object Detection with Region Proposal Networks[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2017, 39(6): 1137-1149. |
| [14] | DAI J, LI Y, HE K, et al. R⁃FCN: Object Detection via Region⁃based Fully Convolutional Network[C]∥30th Conference on Neural Information Processing Systems. Barcelona, 2016:379-387. |
| [15] | 黎洲, 黄妙华. 基于YOLO_v2模型的车辆实时检测[J].中国机械工程, 2018, 29(15): 1869-1874. |
| LI Zhou, HUANG Miaohua. Vehicle Detections Based on YOLO_v2 in Real-time [J]. China Mechanical Engineering, 2018, 29(15): 1869-1874. | |
| [16] | REDMON J, DIVVALA S, GIRSHICK R, et al. You Only Look Once: Unified, Real-time Object Detection[C]∥2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR). Las Vegas, 2016: 779-788. |
| [17] | LIU Wei, ANGUELOV D, ERHAN D, et al. SSD: Single Shot MultiBox Detector[C]∥Computer Vision–ECCV 2016. Cham, 2016: 21-37. |
| [18] | MA Xu, DAI Xiyang, BAI Yue, et al. Rewrite the Stars[C]∥2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). Seattle, 2024: 5694-5703. |
| [19] | LI H, LI J, WEI H, et al. Slim-neck by GSConv: a Better Design Paradigm of Detector Architectures for Autonomous Vehicles[J]. arXiv Preprint arXiv:, 2022. |
| [1] | Song ZHANG, Chaoyong ZHANG, Chuanjun ZHU, Saixiyalatu BAO. In-situ Monitoring Method of Milling Cutter Wear Based on Spatial Attention Mechanism U-Net [J]. China Mechanical Engineering, 2025, 36(11): 2720-2727. |
| [2] | NA Yiming1, HU Chao1, QIU Yeyu2, LU Libing2, SONG Kai1. Three-dimensional Positioning Guidance of Automobile Doors Based on Machine Vision [J]. China Mechanical Engineering, 2024, 35(09): 1677-1687. |
| [3] | FANG Yuntao, WANG Xiaodong, XU Song, WANG Huibin, LUO Yi, . Miniature Thread Pairs Automatic Assembly System for Gimbals [J]. China Mechanical Engineering, 2022, 33(06): 698-706. |
| [4] | YI Huaian, ZHAO Xinjia, TANG Le, CHEN Yonglun. Vision Measurement Method for Ground Surface Roughness Based on Color Image Singular Value Entropy Index#br# [J]. China Mechanical Engineering, 2021, 32(13): 1577-1583. |
| [5] |
LI Maoyue;LYU Hongyu;WANG Fei;JIA Dongkai.
An Intelligent Vehicle Robust Lane Line Identification Method Based on Machine Vision
[J]. China Mechanical Engineering, 2021, 32(02): 242-251.
|
| [6] | Chen Tianfan, Gao Chenghui, He Bingwei. View Planning in Line Laser Measurement for Self-occlusion Objects [J]. China Mechanical Engineering, 2016, 27(10): 1370-1376. |
| [7] | Tian Mingrui, Hu Yongbiao, Jin Shoufeng. Experimental Study on Recognation of Bulk Materials Level by SVM Posterior Probability [J]. China Mechanical Engineering, 2016, 27(05): 646-651. |
| [8] | ZHENG Jin-Ju, LI Wen-Long, WANG Yu-Hui, LUO Meng-Cheng. QFP Chip Visual Inspection System and Its Inspection Method [J]. China Mechanical Engineering, 2013, 24(3): 290-294,301. |
| [9] | TIAN Meng-Dui-1, HU Yong-Biao-1, JIN Shou-Feng-2. Experimental Study on Level Detection of Bulk Material Entrucking Based on Image Texture [J]. China Mechanical Engineering, 2013, 24(07): 910-914. |
| [10] | YANG Qiang-Hua-1, CHEN Liang-1, XUN Yi-1, CHEN Wen-Biao-2. Automatic Defect Inspection of PCB Bare Board Based on Machine Vision [J]. China Mechanical Engineering, 2012, 23(22): 2661-2666. |
| [11] | CHEN Tian-Fan-1, 2, GAO Cheng-Hui-1, HE Bing-Wei-1. Study on View Planning of Eliminating Occlusion and Holes in 3D Digitization of Objects [J]. China Mechanical Engineering, 2012, 23(21): 2585-2590. |
| [12] |
ZHOU Bo-Wen, WANG Yao-Na, ZHANG Hui, GE Ji.
Research and Development on Intelligent Inspection System for Wine Based on Machine Vision
[J]. China Mechanical Engineering, 2010, 21(7): 766-772,821.
|
| [13] | Zhang Hui;Wang Yaonan;Ge Ji;Zhou Bowen. Design for System of Intelligent Robot Detecting Liquid Pharmaceutical Foreign Bodies #br# [J]. China Mechanical Engineering, 2009, 20(20): 0-2519. |
| [14] | Wu Jigang;Bin Hongzan. Subpixel Edge Detection of Machine Vision Image for Thin Sheet Part [J]. J4, 2009, 20(03): 0-266. |
| [15] | He Boxia;Zhang Zhisheng;Xu Sunhao;Shi Jinfei. Research on High-precision Machine Vision Measurement Method for Large Scale Parts [J]. J4, 2009, 20(01): 0-14,1. |
| Viewed | ||||||
|
Full text |
|
|||||
|
Abstract |
|
|||||