SoftthenHard:RethinkingtheQuantizationinNeuralImageCompressionZongyuGuo1ZhizhengZhang1RunsenFeng1ZhiboChen1AbstractQuantizationisoneofthekeychallengesforneuralimagecompression.Sincethegradientofqua...
QuantizationAlgorithmsforRandomFourierFeaturesXiaoyunLi,PingLiCognitiveComputingLabBaiduResearch10900NE8thStBellevueWA98004USA{lixiaoyun996,pingli98}@gmail.comAbstract1.IntroductionThemethodofrando...
I-BERT:Integer-onlyBERTQuantizationSehoonKim1AmirGholami1ZheweiYao1MichaelW.Mahoney1KurtKeutzer1Abstract2019),andtheGPTfamily(Brownetal.,2020;Radfordetal.,2018;2019)),haveachievedasignificantaccura...
HAWQ-V3:DyadicNeuralNetworkQuantizationZheweiYao1ZhenDong1ZhangchengZheng1AmirGholami1JialiYu23EricTan1LeyuanWang2QijingHuang1YidaWang2MichaelW.Mahoney1KurtKeutzer1Abstract1.IntroductionCurrentlow-...
EstimationandQuantizationofExpectedPersistenceDiagramsVincentDivol1ThéoLacombe1Abstractmulti-scalefashion.Relyingonpersistenthomologytheory(Edelsbrunneretal.,2000;Zomorodian&Carlsson,2005;Persiste...
DifferentiableDynamicQuantizationwithMixedPrecisionandAdaptiveResolutionZhangZhaoyang1ShaoWenqi1GuJinwei23WangXiaogang1LuoPing4Abstractdynamicrangeandstepsizearefreezedineachlayer.Sec-ondly,gradien...
AccuratePostTrainingQuantizationWithSmallCalibrationSetsItayHubara12YuryNahshan1YairHanani1RonBanner1DanielSoudry2Abstract1.IntroductionLately,post-trainingQuantizationmethodshaveThepursuitofadvanc...
TowardsAccuratePost-trainingNetworkQuantizationviaBit-SplitandStitchingPeisongWang1QiangChen1XiangyuHe1JianCheng1AbstracthardwarelikeTPUorgeneralhardwarelikeCPUandGPU.Byturningthefloating-pointvalu...
OnlineLearnedContinualCompressionwithAdaptiveQuantizationModulesLucasCaccia123EugeneBelilovsky42MassimoCaccia425JoellePineau123Abstractetal.,2017;Balle´etal.,2016;Johnstonetal.,2018).Yetitsapplica...
FeatureQuantizationImprovesGANTrainingYangZhao1ChunyuanLi2PingYu1JianfengGao2ChangyouChen1AbstractTrainingGANsisanotoriouslychallengingtask,asitin-volvesoptimizinganon-convexproblemforitsNashequi-T...
DifferentiableProductQuantizationforEnd-to-EndEmbeddingCompressionTingChen1LalaLi1YizhouSun2Abstractetal.,2013)andrecommendersystems(Korenetal.,2009),wherethevocabularysizesareevenlarger.Embeddingl...
AcceleratingLarge-ScaleInferencewithAnisotropicVectorQuantizationRuiqiGuo1PhilipSun1ErikLindgren1QuanGeng1DavidSimcha1FelixChern1SanjivKumar1Abstract(MIPS)problem,consideradatabaseX={xi}i=1,2,...,n...
Same,SameButDifferent:RecoveringNeuralNetworkQuantizationErrorThroughWeightFactorizationMellerEldad1FinkelsteinAlexander1AlmogUri1GrobmanMark1Abstractmance.Here,wefollowthecommonlyusedQuantizations...
ImprovingNeuralNetworkQuantizationwithoutRetrainingusingOutlierChannelSplittingRitchieZhao1YuweiHu1JordanDotzel1ChristopherDeSa1ZhiruZhang1AbstracttoreducingthecostsofDNNexecutionistoquantizetheflo...