Model gradient: unified model and policy learning in model-based reinforcement learning