Learning Variable Selection Rules For The Branch-And-Bound Algorithm Using Reinforcement Learning