PolicyCacheswithSuccessorFeaturesMarkNemecek1RonaldParr1Abstracttaskswhichvaryintheirrewardfunctions,butwherethedynamicsremainthesame.Althoughlimitedinscope,thisTransferinreinforcementlearningisbas...