POLITEX:RegretBoundsforPolicyIterationUsingExpertPredictionYasinAbbasi-Yadkori1PeterL.Bartlett2KushBhatia2NevenaLazic´3CsabaSzepesvári4GellértWeisz4Abstractmodel-basedalgorithms,andtheoreticalev...