SWALP:StochasticWeightAveraginginLow-PrecisionTrainingGuandaoYang1TianyiZhang1PolinaKirichenko1JunwenBai1AndrewGordonWilson1ChristopherDeSa1Abstractandaccumulategradientinformationinhigherprecision...