Multi-StepGreedyReinforcementLearningAlgorithmsMananTomar1YonathanEfroni2MohammadGhavamzadeh3Abstractestimations(Greensmithetal.,2004)andtohavedifficultiesinhandlingfunctionapproximation(e.g.,Thrun...