New publication added!
Posted on Tue 14 March 2017 in research
Our work on Deep Refinement Convolutional Networks for Human Pose Estimation was accepted at The 12th IEEE International Conference on Automatic Face and Gesture Recognition (FG 2017) which will be held from May 30 to June 3, 2017 in Washington, DC.
Abstract
Deep Refinement Convolutional Networks for Human Pose Estimation
Ioannis Marras, Petar Palasek, Ioannis Patras
This work introduces a novel Convolutional Network architecture (ConvNet) for the task of human pose estimation, that is the localization of body joints in a single static image. The proposed coarse to fine architecture addresses shortcomings of the baseline architecture that stem from the fact that large inaccuracies of its coarse ConvNet cannot be corrected by the refinement ConvNet that refines the estimation within small windows of the coarse prediction. This is achieved by a) changes in architectural parameters that both increase the accuracy of the coarse model and make the refinement model more capable of correcting the errors of the coarse model, b) the introduction of a Markov Random Field (MRF)-based spatial model network between the coarse and the refinement model that introduces geometric constraints and c) a training scheme that adapts the data augmentation and the learning rate according to the difficulty of the data examples. The proposed architecture is trained in an end-to-end fashion. Experimental results show that the proposed method improves the baseline model and provides state of the art results on the FashionPose [8] and MPII benchmarks [1].