PreferentialTemporalDifferenceLearningNishanthAnand12DoinaPrecup123AbstractTD-learningcanbeviewedasawaytoapproximatedy-namicprogrammingalgorithmsinMarkovianenviron-Temporal-Difference(TD)learningis...
TheBuckley-OsthusmodelandtheblockPreferentialattachmentmodel:statisticalanalysisandapplicationXinGuo1FengminTang2WenpinTang1Abstractsons.Thispaperisconcernedwithstatisticalestima-•TheErdo¨s-Re´n...
ProjectivePreferentialBayesianOptimizationPetrusMikkola1MilicaTodorovic´2JariJa¨rvi2PatrickRinke2SamuelKaski13AbstractFigure1.AnillustrationofaprojectivePreferentialqueryonmolecularproperties:inw...
PreferentialBayesianOptimizationJavierGonza´lez1ZhenwenDai1AndreasDamianou1NeilD.Lawrence12AbstractX×Xfromwhichweobtainbinaryfeedback{0,1}thatrepresentswhetherornotxispreferredoverx(haslowerBayes...