SharingLessisMore:LifelongLearninginDeepNetworkswithSelectiveLayerTransferSeungwonLee1SimaBehpour2EricEaton1Abstract&Hospedales,2017;Leeetal.,2019;Liuetal.,2019b;Bulatetal.,2020),suchassharingthelo...
WhenAllWeNeedisaPieceofthePie:AGenericFrameworkforOptimizingTwo-wayPartialAUCZhiyongYang12QianqianXu3ShilongBao12YuanHe4XiaochunCao12QingmingHuang3567Abstractframework.TheAreaUndertheROCCurve(AUC)i...
Poisson-RandomisedDirBN:LargemutationisneededinDirichletbeliefnetworksXuhuiFan1BinLi2YaqiongLi3ScottA.Sisson1Abstract1.IntroductionTheDirichletBeliefNetwork(DirBN)wasre-TheDirichletBeliefNetwork(Di...
NewtonMethodoverNetworksisFastuptotheStatisticalPrecisionAmirDaneshmand1GesualdoScutari1PavelDvurechensky23AlexanderGasnikov43Abstractwhere:Rd×Z→Risthelossfunction,assumedtobe(strongly)convexinx,...
MaximumMeanDiscrepancyTestisAwareofAdversarialAttacksRuizeGao12FengLiu3JingfengZhang4BoHan1TongliangLiu5GangNiu4Masashisugiyama46AbstractsupinEq.(1),Grettonetal.(2012b)restrictedFtobeaunitballinthe...
LotteryTicketPreservesWeightCorrelation:isitDesirableorNot?NingLiu1GengYuan2ZhengpingChe3XuanShen2XiaolongMa2QingJin2JianRen4JianTang1†SijiaLiu5YanzhiWang2†Abstracttypicalpruningpipelinehasthreem...
JustHowToxicisDataPoisoning?AUnifiedBenchmarkforBackdoorandDataPoisoningAttacksAvischwarzschild1MicahGoldblum2ArjunGupta3JohnP.Dickerson2TomGoldstein2AbstractAtthisscale,itisofteninfeasibletoproper...
isSpace-TimeAttentionAllYouNeedforVideoUnderstanding?GedasBertasius1HengWang1LorenzoTorresani12AbstractVideounderstandingsharesseveralhigh-levelsimilaritieswithNLP.Firstofall,videosandsentencesareb...
isPessimismProvablyEfficientforOfflineRL?YingJin1ZhuoranYang2ZhaoranWang3AbstractVinyalsetal.,2017)reliesontwoingredients:(i)expressivefunctionapproximators,e.g.,deepneuralnetworks(LeCunWestudyoffl...
HowImportantistheTrain-ValidationSplitinMeta-Learning?YuBai1MinshuoChen2PanZhou1TuoZhao2JasonD.Lee3ShamKakade4HuanWang1CaimingXiong1Abstract1.IntroductionMeta-learningaimstoperformfastadaptationMet...
EfficientLotteryTicketFinding:LessDataisMoreZhenyuZhang1XuxiChen2TianlongChen2ZhangyangWang293CIFAR-1072CIFAR-1007045000Abstract924500024457684082491Thelotterytickethypothesis(LTH)(Frankle&90171203...
Attentionisnotallyouneed:pureattentionlosesrankdoublyexponentiallywithdepthYiheDong1Jean-BaptisteCordonnier2AndreasLoukas3Abstractattentionlayers.Surprisingly,wefindthatpureself-attentionnetworks(S...
StrategicClassificationisCausalModelinginDisguiseJohnMiller1SmithaMilli1MoritzHardt1Abstractthatexplicitlyincentiveimprovementonsometargetmea-sure(Kleinberg&Raghavan,2019;Alonetal.,2020;Kha-Consequ...
RegularizedOptimalTransportisGroundCostAdversarialFrançois-PierrePaty1MarcoCuturi21Abstractandallowingforfastersolvers,butalsoaddsomestabilitywithrespecttotheinputmeasures,improvingnumericalRegula...
ProvingtheLotteryTicketHypothesis:PruningisAllYouNeedEranMalach1GiladYehudai2Shaishalev-shwartz1OhadShamir2Abstractwithoutanytraining.(Ramanujanetal.,2019)statedthefol-lowingconjecture:asufficientl...
OptimalSequentialMaximizationOneInterviewisEnough!MoeinFalahatgar1AlonOrlitsky2VenkatadheerajPichapati1Abstractreturnsofallstocksonagivenday;andpartialknowl-edge,wherethelearnerobservestheoutcomeso...
NaiveExplorationisOptimalforOnlineLQRMaxSimchowitz1DylanJ.Foster2Abstractdevelopanon-asymptotictheoryofdata-drivencontinuouscontrol,withanemphasisonunderstandingkeyalgorithmicWeconsidertheproblemof...
isThereaTrade-OffBetweenFairnessandAccuracy?APerspectiveUsingMismatchedHypothesisTestingSanghamitraDutta12DennisWei1HazarYueksel1Pin-YuChen1SijiaLiu1KushR.Varshney1Abstract2012;Agarwaletal.,2018;Ha...
HowGoodistheBayesPosteriorinDeepNeuralNetworksReally?FlorianWenzel1KevinRoth+2BastiaanS.Veeling+31JakubS´wia˛tkowski4+LinhTran5+StephanMandt6+JasperSnoek1TimSalimans1RodolpheJenatton1SebastianNow...