StateEntropyMaximizationwithRandomEncodersforEfficientExplorationYounggyoSeo1LiliChen2JinwooShin1HonglakLee34PieterAbbeel2KiminLee2Abstractproachesencourageagentstovisitdiversestates,butleaveunansw...
Order-AgnosticCrossEntropyforNon-AutoregressiveMachineTranslationCunxiaoDu1ZhaopengTu2JingJiang1AbstractAnumberofrecenteffortshaveexploredwaystoimprovetheNATmodels’abilitytohandlemultimodality.One...
MixedCrossEntropyLossforNeuralMachineTranslationHaoranLi1WeiLu1AbstractSourceDieGeburtenrategehtweiterzurück.one-hotmodelgroundpredictiontruthInneuralmachinetranslation,crossEntropy(CE)isthestanda...
DiversityActor-Critic:Sample-AwareEntropyRegularizationforSample-EfficientExplorationSeungyulHan1YoungchulSung1Abstractforchallengingcontinuouscontroltasks.Inthispaper,sample-awarepolicyEntropyregu...
Multi-objectiveBayesianOptimizationusingPareto-frontierEntropyShinyaSuzuki1ShionTakeno12TomoyukiTamura34KazukiShitara56MasayukiKarasuyama147AbstractcanbeformulatedasjointlymaximizingLunknownfunc-ti...
Multi-fidelityBayesianOptimizationwithMax-valueEntropySearchanditsParallelizationShionTakeno12HitoshiFukuoka3YuhkiTsukada34ToshiyukiKoyama3MotokiShiga245IchiroTakeuchi12MasayukiKarasuyama146Abstrac...
MaximumEntropyGainExplorationforLongHorizonMulti-goalReinforcementLearningSilviuPitis12HarrisChan12StephenZhao1BradlyStadie2JimmyBa12AbstractInthispaper,weimproveuponexistingapproachestointrin-sicg...
EntropyMinimizationInEmergentLanguagesEugeneKharitonov1RahmaChaabouni12DianeBouchacourt1MarcoBaroni13AbstractChevalier-Boisvertetal.,2019).Thepursuitmightalsoprovidecomparativeevidenceabouthowcorep...
Datapreprocessingtomitigatebias:AmaximumEntropybasedapproachL.ElisaCelis1VijayKeswani1NisheethK.Vishnoi2Abstracttodebiasdatastrivetoensurethateither1)therepresenta-tionofsalientsocialgroupsinthedat...
AR-DAE:TowardsUnbiasedNeuralEntropyGradientEstimationJaeHyunLim12AaronCourville1234ChristopherPal154Chin-WeiHuang12Abstractcontrolthisquantityaspartoftheoptimizationobjective.Inlightofthis,wepropos...
AlignedCrossEntropyforNon-AutoregressiveMachineTranslationMarjanGhazvininejad1VladimirKarpukhin1LukeZettlemoyer1OmerLevy1AbstractTargetYittastesprettygoodthoughNon-autoregressivemachinetranslationm...
UnderstandingtheImpactofEntropyonPolicyOptimizationZafaraliAhmed12NicolasLeRoux13MohammadNorouzi3DaleSchuurmans34Abstractlis,2000;Greensmithetal.,2004;Schulmanetal.,2015b;Tuckeretal.,2018).Entropyr...
ProvablyEfficientMaximumEntropyExplorationEladHazan12ShamM.Kakade342KaranSingh12AbbyVanSoest12Abstractsuchaslearningwithintrinsicrewardandcuriositydrivenmethods,surveyedbelow.Ourworkstudiesaclassof...
FastIncrementalvonNeumannGraphEntropyComputation:Theory,Algorithm,andApplicationsPin-YuChen1LingfeiWu1SijiaLiu1IndikaRajapakse2Abstractfeaturesembeddedingraphs.Inparticular,evaluatingsim-ilaritybet...
SoftActor-Critic:Off-PolicyMaximumEntropyDeepReinforcementLearningwithaStochasticActorTuomasHaarnoja1AurickZhou1PieterAbbeel1SergeyLevine1Abstractnetworksholdsthepromiseofautomatingawiderangeofdeci...
PathConsistencyLearninginTsallisEntropyRegularizedMDPsOfirNachum1YinlamChow2MohamamdGhavamzadeh2Abstractmodelisknown,theoptimalpolicyisthesolutionofthenon-linearBellmanoptimalityequations(Bellman,1...
Max-valueEntropySearchforEfficientBayesianOptimizationZiWang1StefanieJegelka1AbstractAmongthemostpopularonesrangetheGaussianprocessupperconfidencebound(GP-UCB)(Auer,2002;SrinivasEntropySearch(ES)an...