ScalingPropertiesofDeepResidualNetworksAlain–SamCohen1RamaCont2AlainRossier21RenyuanXu2Abstractwhereh(kL)isthehiddenstateatlayerk=0,...,L,h(0L)=x∈Rdtheinput,h(LL)∈Rdtheoutput,σ:R→Risanon-Resid...
LARNet:LieAlgebraResidualNetworkforFaceRecognitionXiaolongYang123XiaohongJia12DihongGong3Dong-MingYan42ZhifengLi3WeiLiu3Abstractmodelsarestrongandrobusttofacerecognitionconductedinunconstrainedenvi...
EvolvingAttentionwithResidualConvolutionsYujingWang1YamingYang2JiangangBai12MingliangZhang12JingBai2JingYu3CeZhang4GaoHuang5YunhaiTong1Abstract8079.6379.1Transformerisaubiquitousmodelfornaturallan-...
TowardsAdaptiveResidualNetworkTraining:ANeural-ODEPerspectiveChengyuDong1LiyuanLiu2ZichaoLi1JingboShang1Abstractisnotnew(Changetal.,2017;Wenetal.,2019),butthedynamicsofgrowingispoorlyunderstood.Inp...
StochasticLatentResidualVideoPredictionJean-YvesFranceschi1EdouardDelasalles1MickaëlChen1SylvainLamprier1PatrickGallinari12AbstractRecurrentNeuralNetworks(RNNs),whereeachgeneratedframeisfedbacktot...
InvertibleResidualNetworksJensBehrmann12WillGrathwohl2RickyT.Q.Chen2DavidDuvenaud2Jo¨rn-HenrikJacobsen2AbstractStandardResNetInvertibleResNetWeshowthatstandardResNetarchitecturescanOutputOutputbem...
DeepResidualOutputLayersforNeuralLanguageGenerationNikolaosPappas1JamesHenderson1Abstractbeddingstocapturethesimilaritystructureoftheoutputlabelspace,sothatdataforsimilarlabelscanhelpclassi-Manytas...
ResidualUnfairnessinFairMachineLearningfromPrejudicedDataNathanKallus1AngelaZhou2Abstractnewquestionsaboutthepossibleharmsoflearningfromdatawhichissubjecttohistoricalbias.Unlikeclean-cutpre-Recentw...
FunctionalGradientBoostingbasedonResidualNetworkPerceptionAtsushiNitanda12TaijiSuzuki12Abstract&Wolf(2016).TheypresentedthatResNetsareensembleofshallowermodelsusinganunraveledviewofResNets.Residual...