Proximal policy optimization with an integral compensator for quadrotor control