AnInference-BasedPolicyGradientMethodforLearningOptionsMatthewJ.A.Smith1HerkeVanHoof2JoellePineau1Abstractatvariouslevelsofabstraction,itispossibletoinfer,learnandplanmuchmoreefficiently.Further,ab...