ThreeOperatorSplittingwithaNonconvexLossFunctionAlpYurtsever1VarunMangalick1SuvritSra1Abstract2007),incertaintransportandassignmentproblems(Koop-mans&Beckmann,1957;Peyre´etal.,2019),amongcount-Wec...
EMaQ:Expected-MaxQ-LearningOperatorforSimpleYetEffectiveOfflineandOnlineRLSeyedKamyarSeyedGhasemipour12DaleSchuurmans3ShixiangShaneGu3Abstract1.IntroductionOff-policyreinforcementlearning(RL)holdst...
PDO-eConvs:PartialDifferentialOperatorBasedEquivariantConvolutionsZhengyangShen1LingshenHe2ZhouchenLin2JinwenMa1AbstractasignificantadvantageofCNNsisthattheyareshiftequiv-ariant:shiftinganimageandt...
RevisitingtheSoftmaxBellmanOperator:NewBenefitsandNewPerspectiveZhaoSong1RonaldE.Parr1LawrenceCarin1Abstracttivatestheuseofexploratoryandpotentiallysub-optimalactionsduringlearning,andonecommonly-u...
AdaptiveThreeOperatorSplittingFabianPedregosa12GauthierGidel3AbstractthedesiretomodelincreasinglycomplexphenomenahasledtothedevelopmentofaflurryofpenaltieswithcostlyWeproposeandanalyzeanadaptiveste...
OnTheProjectionOperatortoAThree-viewCardinalityConstrainedSetHaichuanYang1ShupengGui1ChuyangKe1DanielStefankovic1RyoheiFujimaki2JiLiu1Abstractwherewistheoptimizationvariable,gisanindexsubsetThecard...
GSOS:Gauss-SeidelOperatorSplittingAlgorithmforMulti-TermNonsmoothConvexCompositeOptimizationLiShen1WeiLiu1GanzhaoYuan2ShiqianMa3Abstractdifferentiableconvexfunctionwithitsgradientsatisfyingtheinequ...
AnAlternativeSoftmaxOperatorforReinforcementLearningKavoshAsadi1MichaelL.Littman1AbstractAnidealsoftmaxOperatorisaparameterizedsetofOperatorsthat:AsoftmaxOperatorappliedtoasetofvaluesactssomewhatli...