"Transformer"的相关文档

标签“Transformer”的相关文档，共16条

ViLT Vision-and-Language Transformer Without Convolution or Region Supervision
ViLT:Vision-and-LanguageTransformerWithoutConvolutionorRegionSupervisionWonjaeKim1†BokyungSon1IldooKim2AbstractVisualEmbeddingSchemaVision-and-LanguagePre-training(VLP)hasim-RegionFeatureImageCNNR...
without Convolution Transformer or Vision-and-Language
2023-11-16 19:42:2319926.04 MB19
下载文档
Synthesizer Rethinking Self-Attention for Transformer Models
Synthesizer:RethinkingSelf-AttentionforTransformerModelsYiTay1DaraBahri1DonaldMetzler1Da-ChengJuan1ZheZhao1CheZheng1Abstractwidelyattributedtothisself-attentionmechanismsincefullyconnectedtokengrap...
for Models Transformer Rethinking Self-Attention
2023-11-16 19:42:00766678 KB1
下载文档
Which Transformer architecture fits my data A vocabulary bottleneck in self-attention
WhichTransformerarchitectureﬁtsmydata?Avocabularybottleneckinself-attentionNoamWies1YoavLevine1DanielJannai1AmnonShashua1Abstractunchanged,thechosenratiobetweenthenumberofself-attentionlayers(dept...
Data Architecture Transformer Which Vocabulary
2023-11-16 19:41:27563564.4 KB15
下载文档
MSA Transformer
MSATransformerRoshanRao12JasonLiu3RobertVerkuil3JoshuaMeier3JohnF.Canny1PieterAbbeel1TomSercu3AlexanderRives34AbstractColumnAttentionUntiedRowAttentionFeedForwardUnsupervisedproteinlanguagemodelstr...
Transformer MSA
2023-11-16 19:15:2919753.28 MB12
下载文档
Generative Video Transformer Can Objects be the Words
GenerativeVideoTransformer:CanObjectsbetheWords?Yi-FuWu1JaesikYoon12SungjinAhn13Abstractinteresttodevelopananalogousgenerativepre-trainingpro-cedureforvideos,thecomputationaloverheadindealingTransf...
Generative the Can Be Video
2023-11-16 18:46:469003.08 MB16
下载文档
Transformer Hawkes Process
TransformerHawkesProcessSimiaoZuo1HaomingJiang1ZichongLi2TuoZhao13HongyuanZha456Abstractdredsofmillionsofusersgeneratelargeamountsoftweets,whichareessentiallysequencesofeventsatdifferenttimeModernd...
Process Hawkes Transformer
2023-11-14 21:46:515291.07 MB2
下载文档
On Layer Normalization in the Transformer Architecture
OnLayerNormalizationintheTransformerArchitectureRuibinXiong†12YunchangYang3DiHe45KaiZheng4ShuxinZheng5ChenXing6HuishuaiZhang5YanyanLan12LiweiWang43Tie-YanLiu5Abstract1.IntroductionTheTransformeris...
on the in Architecture Layer
2023-11-14 21:45:30624380.15 KB28
下载文档
Non-autoregressive Translation with Disentangled Context Transformer
Non-autoregressiveMachineTranslationwithDisentangledContextTransformerJungoKasai1JamesCross2MarjanGhazvininejad2JiataoGu2Abstractconditionalindependenceandpreventsthemodelfromprop-erlycapturingtheh...
with Translation Disentangled Transformer Context
2023-11-14 21:45:24519561.35 KB24
下载文档
Learning to Encode Position for Transformer with Continuous Dynamical Model
LearningtoEncodePositionforTransformerwithContinuousDynamicalModelXuanqingLiu1Hsiang-FuYu2InderjitS.Dhillon32Cho-JuiHsieh1Abstract1.IntroductionWeintroduceanewwayoflearningtoencodeTransformerbasedm...
Learning for with to Transformer
2023-11-14 21:44:581288811.48 KB10
下载文档
Improving Transformer Optimization Through Better Initialization
ImprovingTransformerOptimizationThroughBetterInitializationXiaoShiHuang12FelipePe´rez1JimmyBa32MaksimsVolkovs1Abstractetal.,2019;Sunetal.,2019).Despitethebroadapplications,optimizationintheTransfo...
Optimization through Improving Better initialization
2023-11-14 21:44:3612743.62 MB9
下载文档
Encoding Musical Style with Transformer Autoencoders
EncodingMusicalStylewithTransformerAutoencodersKristyChoi1CurtisHawthorne2IanSimon2MonicaDinculescu2JesseEngel2Abstracttwofold.First,Transformers(Vaswanietal.,2017)andtheirvariantsexcelasunconditio...
Autoencoders with Musical Transformer Style
2023-11-14 21:44:021058638.5 KB2
下载文档
The Evolved Transformer
TheEvolvedTransformerDavidR.So1ChenLiang1QuocV.Le1Abstractmodels,althoughsomeefforthasalsobeeninvestedinsearchingforsequencemodels(Zoph&Le,2017;PhamRecentworkshavehighlightedthestrengthofetal.,2018...
the Transformer Evolved
2023-11-13 14:48:45661490.42 KB8
下载文档
Set Transformer A Framework for Attention-based Permutation-Invariant Neural Networks
SetTransformer:AFrameworkforAttention-basedPermutation-InvariantNeuralNetworksJuhoLee12YoonhoLee3JungtaekKim4AdamR.Kosiorek15SeungjinChoi4YeeWhyeTeh1Abstract1997;Maron&Lozano-Pe´rez,1998)isanexamp...
Neural for Set Framework Attention-based
2023-11-13 14:48:326994.65 MB28
下载文档
Insertion Transformer Flexible Sequence Generation via Insertion Operations
InsertionTransformer:FlexibleSequenceGenerationviaInsertionOperationsMitchellStern12WilliamChan1JamieKiros1JakobUszkoreit1AbstractChanetal.,2016),speechsynthesis(Oordetal.,2016a;Wangetal.,2017),ima...
Generation via Sequence Flexible Transformer
2023-11-13 14:47:32937313.36 KB16
下载文档
Equivariant Transformer Networks
EquivariantTransformerNetworksKaiShengTai1PeterBailis1GregoryValiant1Abstractscalingtoeachtrainingimage).WhiledataaugmentationtypicallyhelpsreducethetesterrorofCNN-basedmodels,Howcanpriorknowledgeo...
Networks Transformer Equivariant
2023-11-13 14:47:04973520.54 KB3
下载文档
Image Transformer
ImageTransformerNikiParmar1AshishVaswani1JakobUszkoreit1ŁukaszKaiser1NoamShazeer1AlexanderKu23DustinTran4AbstractTable1.ThreeoutputsofaCelebAsuper-resolutionmodelfol-lowedbythreeimagecompletionsby...
Image Transformer
2023-11-13 11:59:4410051016.51 KB13
下载文档

首页上页 1 下页尾页