"is"的相关文档

标签“is”的相关文档，共29条

Sharing Less is More Lifelong Learning in Deep Networks with Selective Layer Transfer
SharingLessisMore:LifelongLearninginDeepNetworkswithSelectiveLayerTransferSeungwonLee1SimaBehpour2EricEaton1Abstract&Hospedales,2017;Leeetal.,2019;Liuetal.,2019b;Bulatetal.,2020),suchassharingthelo...
Learning in is Sharing More
2023-11-16 19:41:461046709.1 KB19
下载文档
When All We Need is a Piece of the Pie A Generic Framework for Optimizing Two-way Partial AUC
WhenAllWeNeedisaPieceofthePie:AGenericFrameworkforOptimizingTwo-wayPartialAUCZhiyongYang12QianqianXu3ShilongBao12YuanHe4XiaochunCao12QingmingHuang3567Abstractframework.TheAreaUndertheROCCurve(AUC)i...
of is When All We
2023-11-16 19:41:277931.76 MB27
下载文档
Poisson-Randomised DirBN Large Mutation is Needed in Dirichlet Belief Networks
Poisson-RandomisedDirBN:LargemutationisneededinDirichletbeliefnetworksXuhuiFan1BinLi2YaqiongLi3ScottA.Sisson1Abstract1.IntroductionTheDirichletBeliefNetwork(DirBN)wasre-TheDirichletBeliefNetwork(Di...
in is Large Poisson-Randomised DirBN
2023-11-16 19:28:29692738.38 KB6
下载文档
Newton Method over Networks is Fast up to the Statistical Precision
NewtonMethodoverNetworksisFastuptotheStatisticalPrecisionAmirDaneshmand1GesualdoScutari1PavelDvurechensky23AlexanderGasnikov43Abstractwhere:Rd×Z→Risthelossfunction,assumedtobe(strongly)convexinx,...
Networks Newton Fast Method is
2023-11-16 19:15:37618486.91 KB27
下载文档
Maximum Mean Discrepancy Test is Aware of Adversarial Attacks
MaximumMeanDiscrepancyTestisAwareofAdversarialAttacksRuizeGao12FengLiu3JingfengZhang4BoHan1TongliangLiu5GangNiu4Masashisugiyama46AbstractsupinEq.(1),Grettonetal.(2012b)restrictedFtobeaunitballinthe...
of Test Mean is Maximum
2023-11-16 19:05:1515843.64 MB15
下载文档
Lottery Ticket Preserves Weight Correlation is It Desirable or Not
LotteryTicketPreservesWeightCorrelation:isitDesirableorNot?NingLiu1GengYuan2ZhengpingChe3XuanShen2XiaolongMa2QingJin2JianRen4JianTang1†SijiaLiu5YanzhiWang2†Abstracttypicalpruningpipelinehasthreem...
IT is Weight Correlation Lottery
2023-11-16 19:05:1110256.23 MB14
下载文档
Just How Toxic is Data Poisoning A Unified Benchmark for Backdoor and Data Poisoning Attacks
JustHowToxicisDataPoisoning?AUniﬁedBenchmarkforBackdoorandDataPoisoningAttacksAvischwarzschild1MicahGoldblum2ArjunGupta3JohnP.Dickerson2TomGoldstein2AbstractAtthisscale,itisofteninfeasibletoproper...
Data How is Just Unified
2023-11-16 18:47:051475528.98 KB19
下载文档
is Space-Time Attention All You Need for Video Understanding
isSpace-TimeAttentionAllYouNeedforVideoUnderstanding?GedasBertasius1HengWang1LorenzoTorresani12AbstractVideounderstandingsharesseveralhigh-levelsimilaritieswithNLP.Firstofall,videosandsentencesareb...
for Attention is All Need
2023-11-16 18:47:0515338.17 MB11
下载文档
is Pessimism Provably Efficient for Offline RL
isPessimismProvablyEfﬁcientforOfﬂineRL?YingJin1ZhuoranYang2ZhaoranWang3AbstractVinyalsetal.,2017)reliesontwoingredients:(i)expressivefunctionapproximators,e.g.,deepneuralnetworks(LeCunWestudyofﬂ...
for Efficient Provably is RL
2023-11-16 18:47:051601887.78 KB12
下载文档
How Important is the Train-Validation Split in Meta-Learning
HowImportantistheTrain-ValidationSplitinMeta-Learning?YuBai1MinshuoChen2PanZhou1TuoZhao2JasonD.Lee3ShamKakade4HuanWang1CaimingXiong1Abstract1.IntroductionMeta-learningaimstoperformfastadaptationMet...
the in How is Important
2023-11-16 18:46:571374780.83 KB28
下载文档
Efficient Lottery Ticket Finding Less Data is More
EfﬁcientLotteryTicketFinding:LessDataisMoreZhenyuZhang1XuxiChen2TianlongChen2ZhangyangWang293CIFAR-1072CIFAR-1007045000Abstract924500024457684082491Thelotterytickethypothesis(LTH)(Frankle&90171203...
Efficient Data is Finding less
2023-11-16 18:37:5819194.42 MB27
下载文档
Attention is not all you need pure attention loses rank doubly exponentially with depth
Attentionisnotallyouneed:pureattentionlosesrankdoublyexponentiallywithdepthYiheDong1Jean-BaptisteCordonnier2AndreasLoukas3Abstractattentionlayers.Surprisingly,weﬁndthatpureself-attentionnetworks(S...
Attention is All Not Need
2023-11-16 18:00:189421.2 MB21
下载文档
Strategic Classification is Causal Modeling in Disguise
StrategicClassiﬁcationisCausalModelinginDisguiseJohnMiller1SmithaMilli1MoritzHardt1Abstractthatexplicitlyincentiveimprovementonsometargetmea-sure(Kleinberg&Raghavan,2019;Alonetal.,2020;Kha-Consequ...
in Classification Causal Modeling is
2023-11-14 21:46:34776233.26 KB27
下载文档
Regularized Optimal Transport is Ground Cost Adversarial
RegularizedOptimalTransportisGroundCostAdversarialFrançois-PierrePaty1MarcoCuturi21Abstractandallowingforfastersolvers,butalsoaddsomestabilitywithrespecttotheinputmeasures,improvingnumericalRegula...
Adversarial Optimal Transport is Regularized
2023-11-14 21:46:0610062.36 MB25
下载文档
Proving the Lottery Ticket Hypothesis Pruning is All You Need
ProvingtheLotteryTicketHypothesis:PruningisAllYouNeedEranMalach1GiladYehudai2Shaishalev-shwartz1OhadShamir2Abstractwithoutanytraining.(Ramanujanetal.,2019)statedthefol-lowingconjecture:asufﬁcientl...
Hypothesis the is Pruning Proving
2023-11-14 21:46:001694316.18 KB6
下载文档
Optimal Sequential Maximization One Interview is Enough!
OptimalSequentialMaximizationOneInterviewisEnough!MoeinFalahatgar1AlonOrlitsky2VenkatadheerajPichapati1Abstractreturnsofallstocksonagivenday;andpartialknowl-edge,wherethelearnerobservestheoutcomeso...
Maximization Optimal Sequential is One
2023-11-14 21:45:421969267.42 KB15
下载文档
Naive Exploration is Optimal for Online LQR
NaiveExplorationisOptimalforOnlineLQRMaxSimchowitz1DylanJ.Foster2Abstractdevelopanon-asymptotictheoryofdata-drivencontinuouscontrol,withanemphasisonunderstandingkeyalgorithmicWeconsidertheproblemof...
for Online Optimal Exploration is
2023-11-14 21:45:181773778.26 KB25
下载文档
is There a Trade-Off Between Fairness and Accuracy A Perspective Using Mismatched Hypothesis Testing
isThereaTrade-OffBetweenFairnessandAccuracy?APerspectiveUsingMismatchedHypothesisTestingSanghamitraDutta12DennisWei1HazarYueksel1Pin-YuChen1SijiaLiu1KushR.Varshney1Abstract2012;Agarwaletal.,2018;Ha...
and Fairness is There Accuracy
2023-11-14 21:44:4518251.11 MB18
下载文档
is Local SGD Better than Minibatch SGD
isLocalSGDBetterthanMinibatchSGD?BlakeWoodworth1KumarKshitijPatel1SebastianU.Stich2ZhenDai3BrianBullins1H.BrendanMcMahan4OhadShamir5NathanSrebro1Abstractcludingindatacenterand“FederatedLearning”s...
Local is Better than SGD
2023-11-14 21:44:44800617.25 KB26
下载文档
How Good is the Bayes Posterior in Deep Neural Networks Really
HowGoodistheBayesPosteriorinDeepNeuralNetworksReally?FlorianWenzel1KevinRoth+2BastiaanS.Veeling+31JakubS´wia˛tkowski4+LinhTran5+StephanMandt6+JasperSnoek1TimSalimans1RodolpheJenatton1SebastianNow...
Bayes the in How Good
2023-11-14 21:44:3114998.55 MB20
下载文档

首页上页 1 2 下页尾页