Methods for design of efficient on-device natural language processing architectures