PsiPhi-Learning:ReinforcementLearningwithDemonstrationsusingSuccessorFeaturesandInverseTemporalDifferenceLearningAngelosFilos1ClareLyle1YarinGal1SergeyLevine2NatashaJaques23GregoryFarquhar4Abstract...
PolicyCacheswithSuccessorFeaturesMarkNemecek1RonaldParr1Abstracttaskswhichvaryintheirrewardfunctions,butwherethedynamicsremainthesame.Althoughlimitedinscope,thisTransferinreinforcementlearningisbas...
APS:ActivePretrainingwithSuccessorFeaturesHaoLiu1PieterAbbeel1Abstract2019;Vinyalsetal.,2019;Badiaetal.,2020a)andsolvingcomplexroboticcontroltasks(Andrychowiczetal.,2017;Weintroduceanewunsupervised...
ANewRepresentationofSuccessorFeaturesforTransferacrossDissimilarEnvironmentsMajidAbdolshah1HungLe1ThommenKarimpanalGeorge1SunilGupta1SantuRana1SvethaVenkatesh1Abstractintoindependentsub-domains.How...
TransferinDeepReinforcementLearningUsingSuccessorFeaturesandGeneralisedPolicyImprovementAndre´Barreto1DianaBorsa1JohnQuan1TomSchaul1DavidSilver1MatteoHessel1DanielMankowitz1AugustinZˇ´ıdek1Re´...