PrincipledExplorationviaOptimisticBootstrappingandBackwardInductionChenjiaBai1LingxiaoWang2LeiHan3JianyeHao4AnimeshGarg5PengLiu1ZhaoranWang2Abstract2007;Jinetal.,2018)isaprincipledapproachforeffici...
DichotomousOptimisticSearchtoQuantifyHumanPerceptionJulienAudiffren1Abstracttheobserver.Inparticular,manyexperimentsareinterestedinmeasuringthesensitivitythreshold,wherethestimulusisInthispaperwead...
BayesianOptimisticOptimisationwithExponentiallyDecayingRegretHungTran-The1SunilGupta1SantuRana1SvethaVenkatesh1Abstracttransformaglobaloptimisationproblemintoasequenceofauxiliaryoptimisationproblem...
RobustBayesianClassificationUsinganOptimisticScoreRatioVietAnhNguyen1NianSi1JoseBlanchet1Abstractlossfunctioncanbeeasilyreached,thechoiceofaclasspriorandaclass-conditionaldistribution(i.e.,thelikel...
OptimisticPolicyOptimizationwithBanditFeedbackYonathanEfroni1LiorShani1AvivRosenberg2ShieMannor1AbstractDuetotheirpopularity,thereisarichliteraturethatpro-videsdifferenttypesoftheoreticalguarantees...
OptimisticBoundsforMulti-outputPredictionHenryW.J.Reeve1AtaKaba´n1Abstractoftheoutputspace.Whilstmodernapplicationsofmulti-outputpredictiondealwithincreasinglylargedatasets,theyWeinvestigatethecha...
ImprovedOptimisticAlgorithmsforLogisticBanditsLouisFaury12MarcAbeille1Cle´mentCalauze`nes1OlivierFercoq2Abstractetal.(2017)andreferencestherein),itspracticalinterestislimitedbythelinearstructureof...
EfficientOptimisticExplorationinLinear-QuadraticRegulatorsviaLagrangianRelaxationMarcAbeille1AlessandroLazaric2AbstractConfidence-basedexploration.Bittantietal.(2006)intro-ducedanadaptivecontrolsys...
AnOptimisticPerspectiveonOfflineReinforcementLearningRishabhAgarwal1DaleSchuurmans12MohammadNorouzi1Abstractunsafe,orrequireahigh-fidelitysimulatorthatisoftendiffi-culttobuild(Dulac-Arnoldetal.,201...
Stable-PredictiveOptimisticCounterfactualRegretMinimizationGabrieleFarina1ChristianKroer2NoamBrown1TuomasSandholm1345Abstractwereusedasanessentialingredientforallrecentmilestonesinthebenchmarkdomai...
OptimisticPolicyOptimizationviaMultipleImportanceSamplingMatteoPapini1AlbertoMariaMetelli1LorenzoLupo1MarcelloRestelli1Abstractpeholtetal.,2018).Thisiswellmotivated,asinteractingwithsomeenvironment...