"Reinforcement"的相关文档 - 文库宝

开通VIP限时优惠

|

登录 | 注册

标签“Reinforcement”的相关文档，共211条

Hierarchical Imitation and Reinforcement Learning
HierarchicalImitationandReinforcementLearningHoangM.Le1NanJiang2AlekhAgarwal2MiroslavDud´ık2YisongYue1HalDaume´III32AbstractﬁciencyinRLoverlongtimehorizonsistoexploithierar-chicalstructureofthe...
Learning and Reinforcement Hierarchical Imitation
2023-11-13 11:59:43915622.9 KB5
下载文档
Fully Decentralized Multi-Agent Reinforcement Learning with Networked Agents
FullyDecentralizedMulti-AgentReinforcementLearningwithNetworkedAgentsKaiqingZhang1ZhuoranYang2HanLiu3TongZhang4TamerBas¸ar1Abstractagent.Inaddition,theagentsareallowedtoobserveonlyitsownreward,whi...
Learning with Reinforcement Multi-Agent Decentralized
2023-11-13 11:59:371503556.78 KB18
下载文档
Feedback-Based Tree Search for Reinforcement Learning
Feedback-BasedTreeSearchforReinforcementLearningDanielR.Jiang1EmmanuelEkwedike23HanLiu24Abstractleaf-nodeevaluators(eitherapolicyfunction(Chaslotetal.,2006)rollout,avaluefunctionevaluation(Campbell...
Learning for Reinforcement Tree Search
2023-11-13 11:59:3511752.63 MB8
下载文档
End-to-end Active Object Tracking via Reinforcement Learning
End-to-endActiveObjectTrackingviaReinforcementLearningWenhanLuo1PengSun1FangweiZhong2WeiLiu1TongZhang1YizhouWang2AbstractActionActionActiveTrackerCameraControlWestudyactiveobjecttracking,whereatrac...
Learning Active Reinforcement via End-to-End
2023-11-13 11:59:3112711.89 MB7
下载文档
Efficient Model-Based Deep Reinforcement Learning with Variational State Tabulation
EfﬁcientModel–BasedDeepReinforcementLearningwithVariationalStateTabulationDaneCorneil1WulframGerstner1JohanniBrea1Abstractstates(e.g.Mnihetal.(2015;2016))andlearningapproxi-matedynamicstoperformp...
Learning Efficient Variational with Reinforcement
2023-11-13 11:59:3014603.11 MB15
下载文档
Efficient Bias-Span-Constrained Exploration-Exploitation in Reinforcement Learning
EfﬁcientBias-Span-ConstrainedExploration-ExploitationinReinforcementLearningRonanFruit1MatteoPirotta1AlessandroLazaric2RonaldOrtner3Abstractand,ateachstep,itexecutesthepolicywithhighestopti-mistic...
Learning Efficient Reinforcement in Exploration-Exploitation
2023-11-13 11:59:301711517.31 KB27
下载文档
Deep Variational Reinforcement Learning for POMDPs
DeepVariationalReinforcementLearningforPOMDPsMaximilianIgl1LuisaZintgraf1TuanAnhLe1FrankWood2ShimonWhiteson1Abstract(a)RNN-basedapproach.TheRNNactsasanencoderfortheaction-observationhistory,onwhich...
Learning for Variational Reinforcement Deep
2023-11-13 11:59:2513192 MB2
下载文档
Deep Reinforcement Learning in Continuous Action Spaces a Case Study in the Game of Simulated Curling
DeepReinforcementLearninginContinuousActionSpaces:aCaseStudyintheGameofSimulatedCurlingKyowoonLee1Sol-AKim1JaesikChoi1Seong-WhanLee2Abstract1992),andothello(Buro,1999).Recently,deepconvolu-tionalne...
Learning Reinforcement Deep in Spaces
2023-11-13 11:59:258151.25 MB15
下载文档
Coordinated Exploration in Concurrent Reinforcement Learning
CoordinatedExplorationinConcurrentReinforcementLearningMariaDimakopoulou1BenjaminVanRoy1Abstractandreﬁnesestimatesasdataisgathered.Atthestartofeachepisode,theagentsamplesanMDPfromitscurrentposte-W...
Learning Reinforcement in Coordinated Exploration
2023-11-13 11:59:201213850.74 KB7
下载文档
Continual Reinforcement Learning with Complex Synapses
ContinualReinforcementLearningwithComplexSynapsesChristosKaplanis12MurrayShanahan13ClaudiaClopath2Abstractoldmemories-aparadoxoftenreferredtoasthestability-plasticitydilemma(Carpenter&Grossberg,198...
Learning with Reinforcement Continual Complex
2023-11-13 11:59:1915062.8 MB13
下载文档
Competitive Multi-agent Inverse Reinforcement Learning with Sub-optimal Demonstrations
CompetitiveMulti-agentInverseReinforcementLearningwithSub-optimalDemonstrationsXingyuWang1DiegoKlabjan1Abstractoftherewardfunction,oratleastobservationsofimmediatereward.Somelearningtasks,however,p...
Learning with Reinforcement Multi-Agent Inverse
2023-11-13 11:59:16947382.25 KB28
下载文档
Can Deep Reinforcement Learning Solve Erdos-Selfridge-Spencer Games
CanDeepReinforcementLearningSolveErdos-Selfridge-SpencerGames?MaithraRaghu12AlexIrpan1JacobAndreas3RobertKleinberg2QuocLe1JonKleinberg2Abstractbehaviorisdifﬁcult.Optimalbehaviorintheseenviron-ment...
Learning Reinforcement Deep Can Games
2023-11-13 11:59:1215633.21 MB24
下载文档
Beyond the One-Step Greedy Approach in Reinforcement Learning
BeyondtheOne-StepGreedyApproachinReinforcementLearningYonathanEfroni1GalDalal1BrunoScherrer2ShieMannor1Abstractsuggestedthatgreedyapproachesw.r.t.multiplestepsper-formbetterthanw.r.t.1-step.Notable...
Reinforcement the in Beyond Approach
2023-11-13 11:59:0911662.4 MB18
下载文档
Automatic Goal Generation for Reinforcement Learning Agents
AutomaticGoalGenerationforReinforcementLearningAgentsCarlosFlorensa1DavidHeld2XinyangGeng1PieterAbbeel13AbstracttodefeatachampionGoplayer(Silveretal.,2016),tooutperformhumansin49Atarigames(Guoetal....
Learning for Generation Reinforcement Automatic
2023-11-13 11:59:0614896.2 MB1
下载文档
A Laplacian Framework for Option Discovery in Reinforcement Learning
ALaplacianFrameworkforOptionDiscoveryinReinforcementLearningMarlosC.Machado1MarcG.Bellemare2MichaelBowling1Abstracttheoptimalpolicyforthatrewardfunction.Inthispaperweintroduceanalgorithmforoptiondi...
for Reinforcement Discovery in Laplacian
2023-11-12 20:45:3317832.5 MB28
下载文档
A Distributional Perspective on Reinforcement Learning
ADistributionalPerspectiveonReinforcementLearningMarcG.Bellemare1WillDabney1Re´miMunos1Abstractmentlearning.Speciﬁcally,themainobjectofourstudyistherandomreturnZwhoseexpectationisthevalueQ.ThisIn...
Learning Reinforcement on Perspective Distributional
2023-11-12 20:45:339621.13 MB23
下载文档
Zero-Shot Task Generalization with Multi-Task Deep Reinforcement Learning
Zero-ShotTaskGeneralizationwithMulti-TaskDeepReinforcementLearningJunhyukOh1SatinderSingh1HonglakLee12PushmeetKohli3AbstractFigure1:Exampleof3Dworldandinstructions.Theagentistaskedtoexecutelongerse...
with Reinforcement Deep Zero-Shot Multi-task
2023-11-12 20:45:318251.37 MB26
下载文档
Unifying Task Specification in Reinforcement Learning
UnifyingTaskSpeciﬁcationinReinforcementLearningMarthaWhite1Abstractjectives,includingoptions(Suttonetal.,1999),state-baseddiscounting(Sutton,1995;Suttonetal.,2011)andinter-Reinforcementlearningtas...
Learning Unifying Reinforcement in Task
2023-11-12 20:45:281532539.43 KB10
下载文档
Stabilising Experience Replay for Deep Multi-Agent Reinforcement Learning
StabilisingExperienceReplayforDeepMulti-AgentReinforcementLearningJakobFoerster1NantasNardelli1GregoryFarquhar1TriantafyllosAfouras1Philip.H.S.Torr1PushmeetKohli2ShimonWhiteson1Abstractmulti-agents...
for Reinforcement Deep Multi-Agent Stabilising
2023-11-12 20:45:1715741.02 MB13
下载文档
Robust Adversarial Reinforcement Learning
RobustAdversarialReinforcementLearningLerrelPinto1JamesDavidson2RahulSukthankar3AbhinavGupta13Abstractpolicy-learningmethodsistheirrelianceondata:train-inghigh-capacitymodelsrequireshugeamountsoftr...
Learning Adversarial Reinforcement Robust
2023-11-12 20:45:088952.62 MB1
下载文档

首页上页 7 8 9 10 11 下页尾页

确认删除?

VIP会员服务
限时5折优惠