TeachMyAgent:aBenchmarkforAutomaticCurriculumLearninginDeepRLCle´mentRomac1Re´myPortelas1KatjaHofmann2Pierre-YvesOudeyer1Abstractagentisunlikelytobeoptimalforcomplexlearningprob-lems.Buildingupon...
WILDS:ABenchmarkofin-the-WildDistributionShiftsPangWeiKoh1ShioriSagawa1HenrikMarklund1SangMichaelXie1MarvinZhang2AkshayBalsubramani1WeihuaHu1MichihiroYasunaga1RichardLanasPhillips3IrenaGao1TonyLee1...
CURI:ABenchmarkforProductiveConceptLearningunderUncertaintyRamakrishnaVedantam1ArthurSzlam1MaximilianNickel1AriMorcos1BrendenLake12Abstracttotakefromone’sapartmentinafire”(children,dogs,keep-sake...
AGENT:ABenchmarkforCorePsychologicalReasoningTianminShu1AbhishekBhandwaldar2ChuangGan2KevinA.Smith1ShariLiu1DanGutfreund2ElizabethSpelke3JoshuaB.Tenenbaum1TomerD.Ullman3AbstractGoalobjectsObstacles...
Alarge-scaleBenchmarkforfew-shotprograminductionandsynthesisFerranAlet1JavierLopez-Contreras1JamesKoppel1MaxwellNye1ArmandoSolar-Lezama1TomásLozano-Pérez1LesliePackKaelbling1JoshuaB.Tenenbaum1Abs...
LeveragingProceduralGenerationtoBenchmarkReinforcementLearningKarlCobbe1ChristopherHesse1JacobHilton1JohnSchulman1Abstractoraretheyapproximatelymemorizingspecifictrajectories?WeintroduceProcgenBenc...
Agent57:OutperformingtheAtariHumanBenchmarkAdriàPuigdomènechBadia1BilalPiot1StevenKapturowski1PabloSprechmann1AlexVitvitskyi1DanielGuo1CharlesBlundell1AbstractFigure1.Numberofgameswherealgorithms...