On Training Recurrent Networks with Truncated Backpropagation Through Time in Speech Recognition

Tang, Hao; Glass, James

Computer Science > Computation and Language

arXiv:1807.03396 (cs)

[Submitted on 9 Jul 2018 (v1), last revised 31 Oct 2018 (this version, v3)]

Title:On Training Recurrent Networks with Truncated Backpropagation Through Time in Speech Recognition

Authors:Hao Tang, James Glass

View PDF

Abstract:Recurrent neural networks have been the dominant models for many speech and language processing tasks. However, we understand little about the behavior and the class of functions recurrent networks can realize. Moreover, the heuristics used during training complicate the analyses. In this paper, we study recurrent networks' ability to learn long-term dependency in the context of speech recognition. We consider two decoding approaches, online and batch decoding, and show the classes of functions to which the decoding approaches correspond. We then draw a connection between batch decoding and a popular training approach for recurrent networks, truncated backpropagation through time. Changing the decoding approach restricts the amount of past history recurrent networks can use for prediction, allowing us to analyze their ability to remember. Empirically, we utilize long-term dependency in subphonetic states, phonemes, and words, and show how the design decisions, such as the decoding approach, lookahead, context frames, and consecutive prediction, characterize the behavior of recurrent networks. Finally, we draw a connection between Markov processes and vanishing gradients. These results have implications for studying the long-term dependency in speech data and how these properties are learned by recurrent networks.

Subjects:	Computation and Language (cs.CL); Machine Learning (cs.LG); Sound (cs.SD); Audio and Speech Processing (eess.AS)
Cite as:	arXiv:1807.03396 [cs.CL]
	(or arXiv:1807.03396v3 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.1807.03396

Submission history

From: Hao Tang [view email]
[v1] Mon, 9 Jul 2018 21:31:49 UTC (31 KB)
[v2] Mon, 24 Sep 2018 15:41:08 UTC (31 KB)
[v3] Wed, 31 Oct 2018 17:17:30 UTC (31 KB)

Computer Science > Computation and Language

Title:On Training Recurrent Networks with Truncated Backpropagation Through Time in Speech Recognition

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:On Training Recurrent Networks with Truncated Backpropagation Through Time in Speech Recognition

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators