AdaptiveNewtonSketch:Linear-TimeOptimizationwithQuadraticConvergenceandEffectiveHessianDimensionalityJonathanLacotte1YifeiWang1MertPilanci1Abstract1.IntroductionWeproposearandomizedalgorithmwithWec...
OnlineandLinear-TimeAttentionbyEnforcingMonotonicAlignmentsColinRaffel1Minh-ThangLuong1PeterJ.Liu1RonJ.Weiss1DouglasEck1Abstractmechanisms(Bahdanauetal.,2015).Inasequence-to-sequencemodelwithattent...