Abstract
Though Faster R-CNN based two-stage detectors have witnessed significant boost in pedestrian detection accuracy, they are still slow for practical applications. One solution is to simplify this working flow as a single-stage detector. However, current single-stage detectors (e.g. SSD) have not presented competitive accuracy on common pedestrian detection benchmarks. Accordingly, a structurally simple but effective module called Asymptotic Localization Fitting (ALF) is proposed, which stacks a series of predictors to directly evolve the default anchor boxes of SSD step by step to improve detection results. Additionally, combining the advantages from residual learning and multi-scale context encoding, a bottleneck block is proposed to enhance the predictors' discriminative power. On top of the above designs, an efficient single-stage detection architecture is designed, resulting in an attractive pedestrian detector in both accuracy and speed. A comprehensive set of experiments on two of the largest pedestrian detection datasets (i.e. CityPersons and Caltech) demonstrate the superiority of the proposed method, comparing to the state of the arts on both the benchmarks.
Original language | English |
---|---|
Article number | 8839736 |
Pages (from-to) | 1413-1425 |
Number of pages | 13 |
Journal | IEEE Transactions on Image Processing |
Volume | 29 |
DOIs | |
Publication status | Published - 2020 |
Externally published | Yes |
Keywords
- asymptotic localization fitting
- convolutional neural networks
- Pedestrian detection
ASJC Scopus subject areas
- Software
- Computer Graphics and Computer-Aided Design