PC-MLP:Model-basedReinforcementLearningwithPolicyCoverGuidedExplorationYudaSong1WenSun2Abstractsuccessrate0.5HandEgg0.4Model-basedReinforcementLearning(RL)isa0.3DeepPC-MPL200000popularlearningparad...