Back to Search Start Over

InstaBoost++: Visual Coherence Principles for Unified 2D/3D Instance Level Data Augmentation.

Authors :
Sun, Jianhua
Fang, Hao-Shu
Li, Yuxuan
Wang, Runzhong
Gou, Minghao
Lu, Cewu
Source :
International Journal of Computer Vision. Oct2023, Vol. 131 Issue 10, p2665-2681. 17p.
Publication Year :
2023

Abstract

Instance-level perception tasks like object detection, instance segmentation, and 3D detection require many training samples to achieve satisfactory performance. The meticulous labels for these tasks are usually expensive to obtain and data augmentation is a natural choice to tackle such a problem. However, instance-level augmentation is less studied in previous research. In this paper, we present an effective, efficient and unified crop-paste mechanism to augment the training set utilizing existing instance-level annotations. Our design is derived from visual coherence and mines three inherent principles that widely exist in real-world data: (i) background coherence in local neighbor area, (ii) appearance coherence for instance placement, and (iii) instance coherence within the same category. Such methodologies are unified for various tasks including object detection, instance segmentation, and 3D detection. Extensive experiments demonstrate that our proposed approaches can successfully boost the performance of diverse frameworks on various datasets across multiple tasks, without modifying the network structure. Remarkable improvements are obtained: 5.1 mAP for object detection and 3.2 mAP for instance segmentation on COCO dataset, and 6.9 mAP for 3D detection on ScanNetV2 dataset. Our method can be easily integrated into different frameworks without affecting the training and inference efficiency. [ABSTRACT FROM AUTHOR]

Subjects

Subjects :
*DATA augmentation

Details

Language :
English
ISSN :
09205691
Volume :
131
Issue :
10
Database :
Academic Search Index
Journal :
International Journal of Computer Vision
Publication Type :
Academic Journal
Accession number :
170028632
Full Text :
https://doi.org/10.1007/s11263-023-01807-9