TowardsOpen-WorldRecommendation:AnInductiveModel-BasedCollaborativeFilteringApproachQitianWu12HengruiZhang1XiaofengGao12JunchiYan12HongyuanZha3Abstract(MC)whereonehasauser-itemratingmatrixwhoseentr...
TemporalPredictiveCodingForModel-BasedPlanningInLatentSpaceTungNguyen1RuiShu2TuanPham1HungBui1StefanoErmon2Abstractever,itiswellknownthatperformingRLdirectlyinthehigh-dimensionalobservationspaceiss...
RNNRepair:AutomaticRNNRepairviaModel-BasedAnalysisXiaofeiXie12WenboGuo3LeiMa452WeiLe6JianWang1LingjunZhou7XinyuXing3YangLiu1Abstractthewrongprediction(Koh&Liang,2017).Oncetherootcauseisidentified,u...
PC-MLP:Model-BasedReinforcementLearningwithPolicyCoverGuidedExplorationYudaSong1WenSun2Abstractsuccessrate0.5HandEgg0.4Model-BasedReinforcementLearning(RL)isa0.3DeepPC-MPL200000popularlearningparad...
Model-FreeandModel-BasedPolicyEvaluationwhenCausalityisUncertainDavidBruns-Smith1Abstractunobservedshocksareoftenassumedtobedrawniidev-eryperiod.ConsidertheFederalReserveBoardadjustingWhendecision-...
Model-BasedReinforcementLearningviaLatent-SpaceCollocationOlehRybkin1ChuningZhu1AnushaNagabandi2KostasDaniilidis1IgorMordatch3SergeyLevine4AbstractLatCooptimizationoverlatentstatesTheabilitytoplani...
Model-BasedReinforcementLearningforContinuousControlwithPosteriorSamplingYingFan1YifeiMing1AbstractinRLhasbeenoneofthemainchallenges:theagentisexpectedtobalancebetweenexploringunseenstate-actionBal...
Continuous-TimeModel-BasedReinforcementLearningÇag˘atayYıldız1MarkusHeinonen1HarriLähdesmäki1AbstractFigure1:AcomparisonoftruesolutionoftheCartPolesystemagainstdiscreteandcontinuous-timetraje...
ConservativeObjectiveModelsforEffectiveOfflineModel-BasedOptimizationBrandonTrabucco1AviralKumar1XinyangGeng1SergeyLevine1Abstracty-ylabel2yyLossInthispaper,weaimtosolvedata-drivenmodel-Learnedbase...
ASharpAnalysisofModel-BasedReinforcementLearningwithSelf-PlayQinghuaLiu1TianchengYu2YuBai3ChiJin1Abstract1.IntroductionModel-Basedalgorithms—algorithmsthatexploreThispaperisconcernedwiththeproblem...
ProvablyEfficientModel-BasedPolicyAdaptationYudaSong1AditiMavalankar1WenSun2SicunGao1AbstractMordatchetal.,2015),ormeta-learnpoliciesormodelsthatcanbequicklyadaptedtoin-distributionenvironments(Fin...
OnBreakingDeepGenerativeModel-BasedDefensesandBeyondYanzhiChen1RenjieXie2ZhanxingZhu3Abstract2018),featuredenoising(Liaoetal.,2018;Xieetal.,2019),randomizedsmoothing(Salmanetal.,2019;Cohenetal.,Dee...
Model-BasedReinforcementLearningwithValue-TargetedRegressionAlexAyoub1ZeyuJia2CsabaSzepesva´ri13MengdiWang43LinF.Yang5Abstractmains,suchasgames,roboticsandscience,haswitnessedphenomenalempiricalad...
Context-awareDynamicsModelforGeneralizationinModel-BasedReinforcementLearningKiminLee1YounggyoSeo2SeunghyunLee2HonglakLee34JinwooShin2Abstractetal.,2019;Hafneretal.,2019;2020;Kaiseretal.,2020).Mode...
BidirectionalModel-BasedPolicyOptimizationHangLai1JianShen1WeinanZhang1YongYu1Abstractbehindtheirmodel-freecounterpartsduetomodelerror,whichisespeciallysevereformulti-steprolloutbecauseofModel-base...
SOLAR:DeepStructuredRepresentationsforModel-BasedReinforcementLearningMarvinZhang1SharadVikram2LauraSmith1PieterAbbeel1MatthewJ.Johnson3SergeyLevine1AbstractFigure1.Ourmethodcanlearnpoliciesforcomp...
Model-BasedActiveExplorationPranavShyam1WojciechJas´kowski1FaustinoGomez1AbstractThisapproachisinherentlymorepowerfulthanreactiveex-ploration,butrequiresamethodtopredicttheconsequencesEfficientexp...
CalibratedModel-BasedDeepReinforcementLearningAliMalik1VolodymyrKuleshov12JiamingSong1DannyNemer2HarlanSeymour2StefanoErmon1AbstractFigure1.ModernModel-Basedplanningalgorithmswithproba-bilisticmode...
PIPPS:FlexibleModel-BasedPolicySearchRobusttotheCurseofChaosPaavoParmas1CarlEdwardRasmussen2JanPeters34KenjiDoya1AbstractVelocityPreviously,theexplodinggradientproblemhasPositionPositionbeenexplain...
LipschitzContinuityinModel-BasedReinforcementLearningKavoshAsadi1DipendraMisra2MichaelL.Littman1Abstractintroduceanovelcharacterizationofmodels,referredtoasaLipschitzmodelclass,thatrepresentsstocha...