Decoupling efficiency-performance optimization for modern neural networks