IncentivizedBanditLearningwithSelf-ReinforcingUserPreferencesTianchenZhou1JiaLiu1ChaoshengDong2JingyuanDeng2Abstractaccumulatesmorepositivefeedbacks.Forexample,onamovierentalwebsite,currentcustomer...
Multi-TaskLearningwithUserPreferences:GradientDescentwithControlledAscentinParetoOptimizationDebabrataMahapatra1VaibhavRajan2Abstract2019a),naturallanguageprocessing(Liuetal.,2019b)andbioinformatic...
ChoiceRank:IdentifyingPreferencesfromNodeTrafficinNetworksLucasMaystre1MatthiasGrossglauser1Abstractitgetsfrompageslinkingtoit).BuildinguponrecentworkbyKumaretal.(2015),wepresentastatisticalframewo...