Synthesizer:RethinkingSelf-AttentionforTransformerModelsYiTay1DaraBahri1DonaldMetzler1Da-ChengJuan1ZheZhao1CheZheng1Abstractwidelyattributedtothisself-attentionmechanismsincefullyconnectedtokengrap...