Dead-endsandSecureExplorationinReinforcementLearningMehdiFatemi1ShikharSharma1HarmvanSeijen1SamiraEbrahimiKahou2Abstracthastointeractwiththeenvironmentandlearnfromitsexpe-rience.Therealwaysexistsa(...