"Reinforcement"的相关文档 - 文库宝

开通VIP限时优惠

|

登录 | 注册

标签“Reinforcement”的相关文档，共211条

An Optimistic Perspective on Offline Deep Reinforcement Learning
AnOptimisticPerspectiveonOfﬂineReinforcementLearningRishabhAgarwal1DaleSchuurmans12MohammadNorouzi1Abstractunsafe,orrequireahigh-ﬁdelitysimulatorthatisoftendifﬁ-culttobuild(Dulac-Arnoldetal.,201...
An Reinforcement on Deep Perspective
2023-11-14 21:43:0610131.06 MB10
下载文档
Adaptive Reward-Poisoning Attacks against Reinforcement Learning
AdaptiveReward-PoisoningAttacksagainstReinforcementLearningXuezhouZhang1YuzheMa1AdishSingla2XiaojinZhu1AbstractgroupofTwitteruserswhodeliberatelytaughtitmisogynis-ticandracistremarksshortlyafterits...
Adaptive Learning Reinforcement Attacks against
2023-11-14 21:42:59774703.1 KB7
下载文档
Transfer Learning for Related Reinforcement Learning Tasks via Image-to-Image Translation
TransferLearningforRelatedReinforcementLearningTasksviaImage-to-ImageTranslationShaniGamrian1YoavGoldberg12Abstractprocessofanewtaskhastobeperformedfromscratchevenforarelatedone.Recentworkshavetrie...
Learning for Reinforcement via Transfer
2023-11-13 14:48:518451.15 MB15
下载文档
Trajectory-Based Off-Policy Deep Reinforcement Learning
Trajectory-BasedOff-PolicyDeepReinforcementLearningAndreasDoerr123MichaelVolpp1MarcToussaint3SebastianTrimpe2ChristianDaniel1Abstractstandardalgorithmsarevastlydata-inefﬁcientandrelyonmillionsofda...
Learning Reinforcement Deep Off-Policy Trajectory-Based
2023-11-13 14:48:511404580.76 KB27
下载文档
Tighter Problem-Dependent Regret Bounds in Reinforcement Learning without Domain Knowledge using Value Function Bounds
TighterProblem-DependentRegretBoundsinReinforcementLearningwithoutDomainKnowledgeusingValueFunctionBoundsAndreaZanette1EmmaBrunskill2AbstractFortunatelyinpracticeReinforcementlearningalgorithmsof-t...
Learning Reinforcement in Regret bounds
2023-11-13 14:48:48577493.86 KB20
下载文档
The Value Function Polytope in Reinforcement Learning
TheValueFunctionPolytopeinReinforcementLearningRobertDadashi1AdrienAliTa¨ıga12NicolasLeRoux1DaleSchuurmans13MarcG.Bellemare1AbstractLinetheorem.Weshowthatpoliciesthatagreeonallbutonestategenerate...
Learning Reinforcement the in Value
2023-11-13 14:48:466535.41 MB14
下载文档
Task-Agnostic Dynamics Priors for Deep Reinforcement Learning
Task-AgnosticDynamicsPriorsforDeepReinforcementLearningYilunDu1KarthikNarasimhan2Abstracttt+1Whilemodel-baseddeepReinforcementlearningFigure1.Twodifferentenvironmentswithobjectdynamicsthat(RL)holds...
Learning for Reinforcement Deep Dynamics
2023-11-13 14:48:43646757.21 KB23
下载文档
Statistics and Samples in Distributional Reinforcement Learning
StatisticsandSamplesinDistributionalReinforcementLearningMarkRowland1RobertDadashi2SaurabhKumar2Re´miMunos1MarcG.Bellemare2WillDabney1AbstractthatDRLalgorithmscanbeviewedascombiningastatisti-cales...
Learning and Reinforcement in Samples
2023-11-13 14:48:3716412.3 MB25
下载文档
SOLAR Deep Structured Representations for Model-Based Reinforcement Learning
SOLAR:DeepStructuredRepresentationsforModel-BasedReinforcementLearningMarvinZhang1SharadVikram2LauraSmith1PieterAbbeel1MatthewJ.Johnson3SergeyLevine1AbstractFigure1.Ourmethodcanlearnpoliciesforcomp...
for Representations Reinforcement Deep Model-Based
2023-11-13 14:48:358663.5 MB2
下载文档
Reinforcement Learning in Configurable Continuous Environments
ReinforcementLearninginConﬁgurableContinuousEnvironmentsAlbertoMariaMetelli1EmanueleGhelﬁ1MarcelloRestelli1AbstractasaConﬁgurableMarkovDecisionProcess(Conf-MDP,Metellietal.,2018).Asintraditional...
Learning Reinforcement in Continuous Configurable
2023-11-13 14:48:231562840.93 KB21
下载文档
Quantifying Generalization in Reinforcement Learning
QuantifyingGeneralizationinReinforcementLearningKarlCobbe1OlegKlimov1ChrisHesse1TaehoonKim1JohnSchulman1Abstract(Nicholetal.,2018),weseektobetterquantifyanagent’sabilitytogeneralize.Inthispaper,we...
Learning Reinforcement in generalization Quantifying
2023-11-13 14:48:1916953.4 MB16
下载文档
Policy Consolidation for Continual Reinforcement Learning
PolicyConsolidationforContinualReinforcementLearningChristosKaplanis12MurrayShanahan13ClaudiaClopath2Abstractwaythatcannotbediscretisedeasilyintoseparatetasks.InReinforcementlearning(RL),forexample...
Learning for Reinforcement Policy Continual
2023-11-13 14:48:1618589.18 MB10
下载文档
Policy Certificates Towards Accountable Reinforcement Learning
PolicyCertiﬁcates:TowardsAccountableReinforcementLearningChristophDann1LihongLi2WeiWei2EmmaBrunskill3Abstractploration.Evensharpdropsinpolicyperformanceduringlearningarecommon,e.g.,whentheagentsta...
Learning Reinforcement Policy Towards Certificates
2023-11-13 14:48:151190423.51 KB10
下载文档
On the Generalization Gap in Reparameterizable Reinforcement Learning
OntheGeneralizationGapinReparameterizableReinforcementLearningHuanWang1StephanZheng1CaimingXiong1RichardSocher1Abstract2018a).Amodelthatperformswellinthetrainingenvi-ronment,mayormaynotperformwellw...
gap Reinforcement on the in
2023-11-13 14:48:05711341.43 KB20
下载文档
Off-Policy Deep Reinforcement Learning without Exploration
Off-PolicyDeepReinforcementLearningwithoutExplorationScottFujimoto12DavidMeger12DoinaPrecup12Abstractrequirefurtherinteractionswiththeenvironmenttocom-pensate(Hesteretal.,2017;Sunetal.,2018;Chenget...
Learning Reinforcement Deep Off-Policy without
2023-11-13 14:48:01954900.82 KB19
下载文档
Neural Logic Reinforcement Learning
NeuralLogicReinforcementLearningZhengyaoJiang1ShanLuo1Abstract(Doshi-Velez&Kim,2017)deﬁnesinterpretabilityastheabilitytoexplainortopresentthedecisioninunderstand-DeepReinforcementlearning(DRL)hasa...
Learning Neural Reinforcement Logic
2023-11-13 14:47:571986314.41 KB27
下载文档
Multi-Agent Adversarial Inverse Reinforcement Learning
Multi-AgentAdversarialInverseReinforcementLearningLantaoYu1JiamingSong1StefanoErmon1Abstractever,thesuccessofRLcruciallydependsoncarefulrewarddesign(Amodeietal.,2016).AsReinforcementlearningReinfor...
Learning Adversarial Reinforcement Multi-Agent Inverse
2023-11-13 14:47:541134523.81 KB28
下载文档
Maximum Entropy-Regularized Multi-Goal Reinforcement Learning
MaximumEntropy-RegularizedMulti-GoalReinforcementLearningRuiZhao12XudongSun1VolkerTresp12AbstractOneofthebiggestchallengesinRListomaketheagentlearnefﬁcientlyinapplicationswithsparserewards.ToInMul...
Learning Reinforcement Maximum Multi-Goal Entropy-Regularized
2023-11-13 14:47:496143.42 MB12
下载文档
Learning Action Representations for Reinforcement Learning
LearningActionRepresentationsforReinforcementLearningYashChandak1GeorgiosTheocharous2JamesE.Kostas1ScottM.Jordan1PhilipS.Thomas1AbstractFigure1.Thestructureoftheproposedoverallpolicy,πo,consist-in...
Learning for Representations Reinforcement Action
2023-11-13 14:47:361281815.65 KB20
下载文档
Learning a Prior over Intent via Meta-Inverse Reinforcement Learning
LearningaPrioroverIntentviaMeta-InverseReinforcementLearningKelvinXu1EllisRatner1AncaDragan1SergeyLevine1ChelseaFinn1AbstractFigure1.Adiagramofourmeta-inverseRLapproach.Ourap-proachattemptstoremedy...
Learning Reinforcement via Prior over
2023-11-13 14:47:3516294.77 MB21
下载文档

首页上页 5 6 7 8 9 下页尾页

确认删除?

VIP会员服务
限时5折优惠