Model-free RL does not do that planning, hence enjoys a more challenging job
Model-free RL does not do that planning, hence enjoys a more challenging job The difference would be the fact Tassa et al have fun with design predictive manage, and therefore reaches do planning against a ground-truth industry model (the physics simulation). Likewise, in the event that considered facing a design facilitate that much, as to…