The prediction model adapts the VGG16 architecture. Based on the pre-trained model that was used by the VGG team in the ILSVRC-2014 competition, we are using the pre-trained weight for all the convolutional layers to extract deep features from the images and fine tuning the last two dense layers on the FLIC dataset.
Keras:Deep Learning library for Theano and TensorFlow
Some sample outputs on the FLIC dataset. Skeleton in green is groundtruth, while the red one is prediction from the model.
We are actively working on accomodating the model to perform 3D pose estimation.