Multi-Stream Transformers

Burtsev, Mikhail; Rumshisky, Anna

Computer Science > Computation and Language

arXiv:2107.10342 (cs)

[Submitted on 21 Jul 2021]

Title:Multi-Stream Transformers

Authors:Mikhail Burtsev, Anna Rumshisky

View PDF

Abstract:Transformer-based encoder-decoder models produce a fused token-wise representation after every encoder layer. We investigate the effects of allowing the encoder to preserve and explore alternative hypotheses, combined at the end of the encoding process. To that end, we design and examine a $\textit{Multi-stream Transformer}$ architecture and find that splitting the Transformer encoder into multiple encoder streams and allowing the model to merge multiple representational hypotheses improves performance, with further improvement obtained by adding a skip connection between the first and the final encoder layer.

Subjects:	Computation and Language (cs.CL); Neural and Evolutionary Computing (cs.NE)
Cite as:	arXiv:2107.10342 [cs.CL]
	(or arXiv:2107.10342v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2107.10342

Submission history

From: Mikhail Burtsev [view email]
[v1] Wed, 21 Jul 2021 20:16:57 UTC (10,532 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.CL

< prev | next >

new | recent | 2021-07

Change to browse by:

cs
cs.NE

References & Citations

DBLP - CS Bibliography

listing | bibtex

Mikhail S. Burtsev
Mikhail Burtsev
Anna Rumshisky

export BibTeX citation

✅2024-10-01: arxiv.org is back to normal.✅

Computer Science > Computation and Language

Title:Multi-Stream Transformers

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

✅2024-10-01: arxiv.org is back to normal.✅

Computer Science > Computation and Language

Title:Multi-Stream Transformers

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators