ValueIterationinContinuousActions,StatesandTimeMichaelLutter12ShieMannor13JanPeters2DieterFox14AnimeshGarg15AbstractValueIterationFittedValueIterationContinuousFittedValueIterationClassicalvalueite...
IdentifyingRewardFunctionsusingAnchorActionsSinongGeng1HoussamNassif2CarlosA.Manzanares2A.MaxReppen3RonnieSircar3Abstractwithfirmprofitfunctions(Abbring,2010;AguirregabiriaandNevo,2013).Weproposear...
GeneralizationtoNewActionsinReinforcementLearningAyushJain1AndrewSzot1JosephJ.Lim1AbstractActionAfundamentaltraitofintelligenceistheabil-GoalGoalitytoachievegoalsinthefaceofnovelcircum-stances,such...