UnsupervisedLearningofVisual3DKeypointsforControlBoyuanChen1PieterAbbeel1DeepakPathak2AbstractFigure1.Weproposeanend-to-endframeworkforunsupervisedlearningof3Dkeypointsfrommulti-viewimages.Thesekey...
TowardsDistraction-RobustActiveVisualTrackingFangweiZhong1PengSun2WenhanLuo3TingyunYan14YizhouWang1AbstractWhichoneistherealtarget?InactiveVisualtracking,itisnotoriouslydiffi-TemplateVisualObservat...
ScalingUpVisualandVision-LanguageRepresentationLearningWithNoisyTextSupervisionChaoJia1YinfeiYang1YeXia1Yi-TingChen1ZaranaParekh1HieuPham1QuocV.Le1YunhsuanSung1ZhenLi1TomDuerig1Abstract1.Introducti...
LearningTransferableVisualModelsFromNaturalLanguageSupervisionAlecRadford1JongWookKim1ChrisHallacy1AdityaRamesh1GabrielGoh1SandhiniAgarwal1GirishSastry1AmandaAskell1PamelaMishkin1JackClark1Gretchen...
Keyframe-FocusedVisualImitationLearningChuanWen1JieruiLin2JianingQian3YangGao14DineshJayaraman3Abstractthedemonstrationdata.WhileBChaswell-documenteddistributionalshiftissuesduetocompoundingimitati...
ExploreVisualConceptFormationforImageClassificationShengzhouXiong1YihuaTan1GuoyouWang1Abstractorderthatunseensamplesalsofitthismapping.Butforhu-mans,theabilityofclassificationisgainedthroughconcept...
VisualGroundingofLearnedPhysicalModelsYunzhuLi1ToruLin1KexinYi2DanielM.Bear3DanielL.K.Yamins3JiajunWu4JoshuaB.Tenenbaum5AntonioTorralba5Abstractdynamicsofphysicalsystemsappliestonotonlyrigidbod-ies...
Neuro-SymbolicVisualReasoning:Disentangling“Visual”from“Reasoning”SaeedAmizadeh1HamidPalangi2OleksandrPolozov2YichenHuang2KazuhitoKoishida1AbstractRecentadvancesincomputervision,representationl...
HallucinativeTopologicalMemoryforZero-ShotVisualPlanningKaraLiu1ThanardKurutach1ChristineTung1PieterAbbeel1AvivTamar12Abstracttemdynamicsarenotknown,andonlyadatasetofstatetransitionsisavailable.Inp...
DeepIsometricLearningforVisualRecognitionHaozhiQi1ChongYou1XiaolongWang12YiMa1JitendraMalik1Abstract(optional)b=1?(?)skipconnectionb=-1Initialization,normalization,andskipconnectionsb=-3arebeliev...
AngularVisualHardnessBeidiChen1WeiyangLiu2ZhidingYu3JanKautz3AnshumaliShrivastava1AnimeshGarg345AnimaAnandkumar36AbstractPlateRackSharpnessContrastBlurRecentconvolutionalneuralnetworks(CNNs)Dishwas...
ASimpleFrameworkforContrastiveLearningofVisualRepresentationsTingChen1SimonKornblith1MohammadNorouzi1GeoffreyHinton1AbstractFigure1.ImageNetTop-1accuracyoflinearclassifierstrainedonrepresentationsl...
ProbabilisticNeural-symbolicModelsforInterpretableVisualQuestionAnsweringRamakrishnaVedantam1KaranDesai2StefanLee2MarcusRohrbach1DhruvBatra12DeviParikh12AbstractSymbolmanipulation(Newell&Simon,1976...
LatentGNN:LearningEfficientNon-localRelationsforVisualRecognitionSongyangZhang1ShipengYan1XumingHe1Abstractvisioncommunitybylearninghierarchicalfeaturerepresen-tationsofimageswithdeepconvolutionaln...
CounterfactualVisualExplanationsYashGoyal1ZiyanWu2JanErnst2DhruvBatra1DeviParikh1StefanLee1AbstractFigure1.OurapproachgeneratescounterfactualVisualexplana-tionsforaqueryimageI(left)–explainingwhyt...