UnICORNN:ArecurrentmodelforlearningveryLongtimedependenciesT.KonstantinRusch1SiddharthaMishra1AbstractsuchasinLSTMs(Hochreiter&Schmidhuber,1997)andGRUs(Choetal.,2014),wheretheadditivestructureofThe...
Poolingformer:LongDocumentModelingwithPoolingAttentionHangZhang12YeyunGong3YeLongShen4WeishengLi5JianchengLv1NanDuan3WeizhuChen4Abstract(a)Single-levellocalattention(b)Two-levelpoolingattentionInth...
NeuralRoughDifferentialEquationsforLongTimeSeriesJamesMorrill12CristopherSalvi12PatrickKidger12JamesFoster12Abstractdirectwaytomodifythetrajectorygivensubsequentob-servations.Incontrast,thevectorfi...
LearningtoRehearseinLongSequenceMemorizationZhuZhang12ChangZhou2JianxinMa2ZhijieLin1JingrenZhou2HongxiaYang2ZhouZhao1Abstract2020b),orpredictwhetherauserwillclickthegivenitembasedontheuserbehaviors...
MaximumEntropyGainExplorationforLongHorizonMulti-goalReinforcementLearningSilviuPitis12HarrisChan12StephenZhao1BradlyStadie2JimmyBa12AbstractInthispaper,weimproveuponexistingapproachestointrin-sicg...
AStatisticalInvestigationofLongMemoryinLanguageandMusicAlexanderGreaves-Tunnell1ZaidHarchaoui1Abstractundoubtedlyhelpful,suchheuristicsarerarelydefinedwithrespecttoanunderlyingmathematicalorstatist...
LearningLongTermDependenciesviaFourierRecurrentUnits∗JiongZhang1YiboLin2ZhaoSong3InderjitS.Dhillon4Abstractissuesbecomeparticularlytroublesomeforrecurrentneu-ralnetworks(RNNs)sincetheweightmatrixi...