FindingOptionsthatminimizePlanningTimeYuuJinnai1DavidAbel1DEllisHershkowitz2MichaelL.Littman1GeorgeKonidaris1AbstractBarto,2009;Bacon,2013;Moradietal.,2012),findingre-peatedpolicyfragments(Pickett&...
Correlatedbanditsor:Howtominimizemean-squarederroronlineVinayPraneethBoda1PrashanthL.A.2Abstractbanditproblem,thisobjectiveinvolvesanestimationofthecorrelationstructureamongthevariousarms.Thisismo-...