Discriminative Video Representation Learning