ICCV 2021 Workshop SSLAD Track 3B - Continual Object Detection.

DAVAR LAB

Official Website

[Technical Report]

Introduction

Autonomous driving technology has been significantly accelerated in recent years because of its great potential in reducing accidents, saving human lives and improving efficiency. In this competition, we focus on continul learning for object detection on four different scenes sequentially.

SODA10M is a 2D object detection dataset, which contains images captured from four different scenarios. These scenarios are set as four tasks for continual learning. The continual learner is trained on each task sequentially. The tasks are:

  • Task 1: Daytime, citystreet and clear weather. There are 4470 images in the training set.
  • Task 2: Daytime, highway and clear/overcast weather. There are 1329 images in the training set.
  • Tast 3: Night. There are 1479 images in the training set.
  • Tast 4: Daytime, rain. There are 524 images in the training set.

We have won the 1st place out of 53 teams in this challenge using semi-supervised learning solution.

Solution Description

In the SSLAD-Track 3B challenge on continual learning, we propose the method of COntinual Learning with Transformer (COLT). We find that transformers suffer less from catastrophic forgetting compared to convolutional neural network. The major principle of our method is to equip the transformer based feature extractor with old knowledge distillation and head expanding strategies to compete catastrophic forgetting. In this report, we first introduce the overall framework of continual learning for object detection. Then, we analyse the key elements’ effect on withstanding catastrophic forgetting in our solution. Our method achieves 70.78 mAP on the SSLAD-Track 3B challenge test set.

For more details, please refer to our Technical Report