CompetitiveMulti-agentInverseReinforcementLearningwithSub-optimalDemonstrationsXingyuWang1DiegoKlabjan1Abstractoftherewardfunction,oratleastobservationsofimmediatereward.Somelearningtasks,however,p...