The paper proposes a Reinforcement Learning based agent that controls three KPIs of the mobile network to reach a maximized sum throughput of the newtork, such that the number of uncovered users is kept minimum and the energy consumed due to the MIMO technology is kept minimum as well. The environment is a simulated mobile network using NS3.