Abstract
Recently, pedestrian attributes like gender, age, clothing etc., have been used as soft biometric traits for recognizing people. Unlike existing methods that assume the independence of attributes during their prediction, we propose a multi-label convolutional neural network (MLCNN) to predict multiple attributes together in a unified framework. Firstly, a pedestrian image is roughly divided into multiple overlapping body parts, which are simultaneously integrated in the multi-label convolutional neural network. Secondly, these parts are filtered independently and aggregated in the cost layer. The cost function is a combination of multiple binary attribute classification cost functions. Experiments show that the proposed method significantly outperforms the SVM based method on the PETA database.
| Original language | English |
|---|---|
| Pages (from-to) | 224-229 |
| Number of pages | 6 |
| Journal | Image and Vision Computing |
| Volume | 58 |
| DOIs | |
| Publication status | Published - Feb 1 2017 |
| Externally published | Yes |
Keywords
- Convolutional neural network
- Multi-label classification
- Pedestrian attribute classification
ASJC Scopus subject areas
- Signal Processing
- Computer Vision and Pattern Recognition