A Study of All-Convolutional Encoders for Connectionist Temporal Classification

Krishna, Kalpesh; Lu, Liang; Gimpel, Kevin; Livescu, Karen

Computer Science > Computation and Language

arXiv:1710.10398 (cs)

[Submitted on 28 Oct 2017 (v1), last revised 15 Feb 2018 (this version, v2)]

Title:A Study of All-Convolutional Encoders for Connectionist Temporal Classification

Authors:Kalpesh Krishna, Liang Lu, Kevin Gimpel, Karen Livescu

View PDF

Abstract:Connectionist temporal classification (CTC) is a popular sequence prediction approach for automatic speech recognition that is typically used with models based on recurrent neural networks (RNNs). We explore whether deep convolutional neural networks (CNNs) can be used effectively instead of RNNs as the "encoder" in CTC. CNNs lack an explicit representation of the entire sequence, but have the advantage that they are much faster to train. We present an exploration of CNNs as encoders for CTC models, in the context of character-based (lexicon-free) automatic speech recognition. In particular, we explore a range of one-dimensional convolutional layers, which are particularly efficient. We compare the performance of our CNN-based models against typical RNNbased models in terms of training time, decoding time, model size and word error rate (WER) on the Switchboard Eval2000 corpus. We find that our CNN-based models are close in performance to LSTMs, while not matching them, and are much faster to train and decode.

Comments:	Accepted to ICASSP-2018
Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:1710.10398 [cs.CL]
	(or arXiv:1710.10398v2 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.1710.10398

Submission history

From: Kalpesh Krishna [view email]
[v1] Sat, 28 Oct 2017 06:24:36 UTC (88 KB)
[v2] Thu, 15 Feb 2018 18:55:30 UTC (86 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.CL

< prev | next >

new | recent | 2017-10

Change to browse by:

References & Citations

DBLP - CS Bibliography

listing | bibtex

Kalpesh Krishna
Liang Lu
Kevin Gimpel
Karen Livescu

export BibTeX citation

Computer Science > Computation and Language

Title:A Study of All-Convolutional Encoders for Connectionist Temporal Classification

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:A Study of All-Convolutional Encoders for Connectionist Temporal Classification

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators