1. 基于深度学习的目标检测及机械臂抓取.
- Author
-
张 蕾, 张森晖, 严 松, and 袁 媛
- Subjects
- *
OBJECT recognition (Computer vision) , *FEATURE extraction , *RUNNING speed , *DEEP learning , *ROBOTICS - Abstract
Addressing the issues of slow speed and poor performance in multi-object grasping detection in unstructured environments, a method of performing object detection before grasping detection is pro- posed. In object detection, to accelerate the network's running speed, this paper improved the YOLOv5 network by employing depth wise separable convolutions and coordinate attention mechanisms. For the grasping task.a single-stage grasping pose detection algorithm was designed. Firstly, considering the in- terference present in unstructured environments, RGB-D images were selected as the input data for the grasping network, and GG-CNN was chosen as the backbone network. Secondly, to enhance the feature extraction capabilities of the grasping network, the parallel use of different size convolutional kernels in the Inception-ResNet module was utilized to broaden the network's receptive field. Additionally, the inte gration of a parameter-free three-dimensional attention mechanism enabled the network to focus more on grasping information features and suppress background noise. Finally, a grasping quality evaluation was employed to refine the grasping boxes, and the grasping box with the highest confidence score was out- put. The experimental results indicate that the improved object detection network has a parameter count of 2 776 708 and achieves 102 frames per second (FPS). On the public Cornell dataset, the improved grasping detection network achieves an accuracy of 96. 57% with a FPS of 54, 17. The combination of the two improved networks can be deployed on robotic arms and effectively accomplish grasping tasks in multi-object scenarios, making them suitable for practical industrial applications. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF