Self-ImitationLearningJunhyukOh1YijieGuo1SatinderSingh1HonglakLee21AbstractMontezumaRevengeThispaperproposesSelf-ImitationLearning2500(SIL),asimpleoff-policyactor-criticalgorithmthatlearnstoreprodu...