Chenfeng Liu, Jingkai Zhou, Dongdong Li, Panagiotis Giannakeris, Nitin Bansal, Yunchao Wei, Chase Brown, Bo Ke, Han Deng, Zhipeng Deng, Lin Cheng, Lianjie Wang, Qingming Huang, Zexin Wang, Qinqin Nie, Guanghui Wang, Wenrui He, Xintao Lian, Shubo Wei, Lars Sommer, Jian Cheng, Lei Zhang, Wei Li, Xiaoyu Liu, Haipeng Zhang, Emmanouil Michail, Ioannis Kompatsiaris, Zhaoyue Xia, Yuanwei Wu, Qinghua Hu, Liyu Lu, Haoran Wang, Haibin Ling, Kaiwen Duan, Qiuchen Sun, Siwei Wang, Hongyu Xu, Qishang Cheng, Yong Wang, Hao Liu, Xiao Bian, Nehal Mamgain, Lin Ma, Shengjin Wang, Yue Fan, K J Joseph, Minyu Huang, Honggang Qi, Robert Laganiere, Honghui Shi, Yali Li, Chen Qian, Lu Ding, Juanping Zhao, Xiufang Li, Zichen Song, Ke Wang, Heqian Qiu, Oliver Acatay, Zhen Cui, Wei Zhang, Stefanos Vrochidis, Arne Schumann, Xinbin Luo, Usman Sajid, Yifan Zhang, Sujuan Wang, Ying Li, Qijie Zhao, Feng Ni, Tiaojio Lee, Zhenwei He, Weikun Wu, Yongtao Wang, Fan Zhang, Yangliu Kuai, Qiong Liu, Wenzhe Yang, Hao Cheng, Vineeth N Balasubramanian, Yuqin Zhang, Jianqiang Wang, Jianxiu Yang, Zhiyao Guo, Dawei Du, Li Yang, Chengzheng Li, Xiaoyu Chen, Longyin Wen, Hongliang Li, Sheng Jiang, Yi Luo, Naveen Kumar Vedurupaka, Karthik Suresh, Zhangyang Wang, Qian Wang, Pengfei Zhu, Yiling Liu, Wenya Ma, Hu Lin, Wenchi Ma, Feng Zhu, Konstantinos Avgerinakis, Xin Sun, and Haotian Wu
Object detection is a hot topic with various applications in computer vision, e.g., image understanding, autonomous driving, and video surveillance. Much of the progresses have been driven by the availability of object detection benchmark datasets, including PASCAL VOC, ImageNet, and MS COCO. However, object detection on the drone platform is still a challenging task, due to various factors such as view point change, occlusion, and scales. To narrow the gap between current object detection performance and the real-world requirements, we organized the Vision Meets Drone (VisDrone2018) Object Detection in Image challenge in conjunction with the 15th European Conference on Computer Vision (ECCV 2018). Specifically, we release a large-scale drone-based dataset, including 8, 599 images (6, 471 for training, 548 for validation, and 1, 580 for testing) with rich annotations, including object bounding boxes, object categories, occlusion, truncation ratios, etc. Featuring a diverse real-world scenarios, the dataset was collected using various drone models, in different scenarios (across 14 different cities spanned over thousands of kilometres), and under various weather and lighting conditions. We mainly focus on ten object categories in object detection, i.e., pedestrian, person, car, van, bus, truck, motor, bicycle, awning-tricycle, and tricycle. Some rarely occurring special vehicles (e.g., machineshop truck, forklift truck, and tanker) are ignored in evaluation. The dataset is extremely challenging due to various factors, including large scale and pose variations, occlusion, and clutter background. We present the evaluation protocol of the VisDrone-DET2018 challenge and the comparison results of 38 detectors on the released dataset, which are publicly available on the challenge website: http://www.aiskyeye.com/. We expect the challenge to largely boost the research and development in object detection in images on drone platforms.