Towards human pose estimation in video sequences