Tied & Reduced RNN-T Decoder

Botros, Rami; Sainath, Tara N.; David, Robert; Guzman, Emmanuel; Li, Wei; He, Yanzhang

doi:10.21437/Interspeech.2021-212

Computer Science > Computation and Language

arXiv:2109.07513 (cs)

[Submitted on 15 Sep 2021]

Title:Tied & Reduced RNN-T Decoder

Authors:Rami Botros (1), Tara N. Sainath (1), Robert David (1), Emmanuel Guzman (1), Wei Li (1), Yanzhang He (1) ((1) Google Inc. USA)

View PDF

Abstract:Previous works on the Recurrent Neural Network-Transducer (RNN-T) models have shown that, under some conditions, it is possible to simplify its prediction network with little or no loss in recognition accuracy (arXiv:2003.07705 [eess.AS], [2], arXiv:2012.06749 [cs.CL]). This is done by limiting the context size of previous labels and/or using a simpler architecture for its layers instead of LSTMs. The benefits of such changes include reduction in model size, faster inference and power savings, which are all useful for on-device applications.
In this work, we study ways to make the RNN-T decoder (prediction network + joint network) smaller and faster without degradation in recognition performance. Our prediction network performs a simple weighted averaging of the input embeddings, and shares its embedding matrix weights with the joint network's output layer (a.k.a. weight tying, commonly used in language modeling arXiv:1611.01462 [cs.LG]). This simple design, when used in conjunction with additional Edit-based Minimum Bayes Risk (EMBR) training, reduces the RNN-T Decoder from 23M parameters to just 2M, without affecting word-error rate (WER).

Subjects:	Computation and Language (cs.CL); Machine Learning (cs.LG); Sound (cs.SD); Audio and Speech Processing (eess.AS)
Cite as:	arXiv:2109.07513 [cs.CL]
	(or arXiv:2109.07513v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2109.07513
Journal reference:	Proc. Interspeech 2021, 4563-4567
Related DOI:	https://doi.org/10.21437/Interspeech.2021-212

Submission history

From: Rami Botros [view email]
[v1] Wed, 15 Sep 2021 18:19:16 UTC (155 KB)

Computer Science > Computation and Language

Title:Tied & Reduced RNN-T Decoder

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Tied & Reduced RNN-T Decoder

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators