ReinforcementLearninginConfigurableContinuousEnvironmentsAlbertoMariaMetelli1EmanueleGhelfi1MarcelloRestelli1AbstractasaConfigurableMarkovDecisionProcess(Conf-MDP,Metellietal.,2018).Asintraditional...
ConfigurableMarkovDecisionProcessesAlbertoMariaMetelli1MircoMutti1MarcelloRestelli1Abstractified(Givanetal.,1997;Ni&Liu,2008).AcommonapproachistosolveaminimaxproblemtofindarobustInmanyreal-worldpro...