Improving cache replacement policy using deep reinforcement learning