NotAllMemoriesareCreatedEqual:LearningtoForgetbyExpiringSainbayarSukhbaatar1DaJu1SpencerPoff1StephenRoller1ArthurSzlam1JasonWeston1AngelaFan12AbstractSukhbaataretal.,2019a).However,acriticalcompone...
FinitemixturemodelsdoNotreliablylearnthenumberofcomponentsDianaCai1TrevorCampbell2TamaraBroderick3Abstracthakaranetal.,2016),microscopygroups(Rubin-Delanchyetal.,2015;Griffie´etal.,2016),haplotype...
AttentionisNotallyouneed:pureattentionlosesrankdoublyexponentiallywithdepthYiheDong1Jean-BaptisteCordonnier2AndreasLoukas3Abstractattentionlayers.Surprisingly,wefindthatpureself-attentionnetworks(S...
“Hey,that’sNotanODE”:FasterODEAdjointsviaSeminormsPatrickKidger12RickyT.Q.Chen34TerryLyons12AbstractarecomposedsuchthatNeuraldifferentialequationsmaybetrainedz(τ)=1(x,φ),bybackpropagatinggradi...
KnowingTheWhatButNotTheWhereinBayesianOptimizationVuNguyen1MichaelAOsborne1AbstractBayesianoptimizationfindstheglobalmaximizerx∗=argmaxx∈Xf(x)oftheblack-boxfunctionfbyincorporat-Bayesianoptimizat...
It’sNotWhatMachinesCanLearn,It’sWhatWeCanNotTeachGalYehuda1MosheGabel2AssafSchuster1AbstractdatasetgenerationprocesscomplexityofresultingtaskCandeepneuralnetworkslearntosolveanytask,harderandinpa...
AttacksWhichDoNotKillTrainingMakeAdversarialLearningStrongerJingfengZhang1†XilieXu2BoHan34GangNiu4LizhenCui5MasashiSugiyama46MohanKankanhalli1Abstractsitatestheneedfordeepneuralnetworks(DNNs)tobea...
TighterVariationalBoundsareNotNecessarilyBetterTomRainforth1AdamR.Kosiorek12TuanAnhLe2ChrisJ.Maddison1MaximilianIgl2FrankWood3YeeWhyeTeh1Abstract&Kamp,1988;Hinton&Zemel,1994;Gregoretal.,2016;Chenet...
NottoCryWolf:DistantlySupervisedMultitaskLearninginCriticalCarePatrickSchwab1EmanuelaKeller2CarlMuroi2DavidJ.Mack2ChristianStra¨ssle2WalterKarlen1AbstractIntracranialPressureAlarmRelevantArterialB...
NotAllSamplesAreCreatedEqual:DeepLearningwithImportanceSamplingAngelosKatharopoulos12Franc¸oisFleuret12Abstractmodel.Tothisend,weproposeanovelimportancesamplingschemethatacceleratesthetrainingofan...