DoWeNeedZeroTrainingLossafterAchievingZeroTrainingError?TakashiIshida12IkkoYamane1TomoyaSakai3GangNiu2MasashiSugiyama21Abstract(a)w/oFlooding(b)w/FloodingOverparameterizeddeepnetworkshavethecapac-[...
RandomShufflingBeatsSGDafterFiniteEpochsJeffHaoChen1SuvritSra2Abstract1.IntroductionAlong-standingprobleminoptimizationisWefocusonminimizationofthefinite-sumprovingthatRANDOMSHUFFLE,thewithout-repl...