DecouplingValueandPolicyforGeneralizationinReinforcementLearningRobertaRaileanu1RobFergus1Abstractization(Farebrotheretal.,2018;Zhangetal.,2018a;Cobbeetal.,2018;Igletal.,2019),dataaugmentation(Cobb...
DecouplingRepresentationLearningfromReinforcementLearningAdamStooke1KiminLee1PieterAbbeel1MichaelLaskin1AbstractHaarnojaetal.,2018)andhavebeensuccessfullyappliedtodomainsrangingfromreal-world(Levin...
DecouplingExplorationandExploitationforMeta-ReinforcementLearningwithoutSacrificesEvanZheranLiu1AditiRaghunathan1PercyLiang1ChelseaFinn1Abstractanewkitchen(theenvironment)afterithaslearnedtocookoth...
GEP-PG:DecouplingExplorationandExploitationinDeepReinforcementLearningAlgorithmsCe´dricColas1OlivierSigaud12Pierre-YvesOudeyer1AbstractDeepRLalgorithmsgenerallyconsistinapplyingStochas-ticGradient...
DecouplingGradient-LikeLearningRulesfromRepresentationsPhilipS.Thomas1ChristophDann2EmmaBrunskill3Abstractworkpoorlywith,orbeincompatiblewith,others.Althoughthisintertwiningisunavoidableandislikely...