default search action
Ashish Vaswani
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2023
- [j3]Salman H. Khan, Fahad Shahbaz Khan, Ashish Vaswani, Niki Parmar, Ming-Hsuan Yang, Mubarak Shah:
Guest Editorial Introduction to the Special Section on Transformer Models in Vision. IEEE Trans. Pattern Anal. Mach. Intell. 45(11): 12721-12725 (2023) - 2022
- [c32]Mostafa Dehghani, Yi Tay, Anurag Arnab, Lucas Beyer, Ashish Vaswani:
The Efficiency Misnomer. ICLR 2022 - [c31]Yi Tay, Mostafa Dehghani, Jinfeng Rao, William Fedus, Samira Abnar, Hyung Won Chung, Sharan Narang, Dani Yogatama, Ashish Vaswani, Donald Metzler:
Scale Efficiently: Insights from Pretraining and Finetuning Transformers. ICLR 2022 - 2021
- [j2]Aurko Roy, Mohammad Saffar, Ashish Vaswani, David Grangier:
Efficient Content-Based Sparse Attention with Routing Transformers. Trans. Assoc. Comput. Linguistics 9: 53-68 (2021) - [c30]Ashish Vaswani, Prajit Ramachandran, Aravind Srinivas, Niki Parmar, Blake A. Hechtman, Jonathon Shlens:
Scaling Local Self-Attention for Parameter Efficient Visual Backbones. CVPR 2021: 12894-12904 - [c29]Aravind Srinivas, Tsung-Yi Lin, Niki Parmar, Jonathon Shlens, Pieter Abbeel, Ashish Vaswani:
Bottleneck Transformers for Visual Recognition. CVPR 2021: 16519-16529 - [i20]Aravind Srinivas, Tsung-Yi Lin, Niki Parmar, Jonathon Shlens, Pieter Abbeel, Ashish Vaswani:
Bottleneck Transformers for Visual Recognition. CoRR abs/2101.11605 (2021) - [i19]Ashish Vaswani, Prajit Ramachandran, Aravind Srinivas, Niki Parmar, Blake A. Hechtman, Jonathon Shlens:
Scaling Local Self-Attention for Parameter Efficient Visual Backbones. CoRR abs/2103.12731 (2021) - [i18]Vidhisha Balachandran, Ashish Vaswani, Yulia Tsvetkov, Niki Parmar:
Simple and Efficient ways to Improve REALM. CoRR abs/2104.08710 (2021) - [i17]Yi Tay, Mostafa Dehghani, Jinfeng Rao, William Fedus, Samira Abnar, Hyung Won Chung, Sharan Narang, Dani Yogatama, Ashish Vaswani, Donald Metzler:
Scale Efficiently: Insights from Pre-training and Fine-tuning Transformers. CoRR abs/2109.10686 (2021) - [i16]Mostafa Dehghani, Anurag Arnab, Lucas Beyer, Ashish Vaswani, Yi Tay:
The Efficiency Misnomer. CoRR abs/2110.12894 (2021) - 2020
- [i15]Aurko Roy, Mohammad Saffar, Ashish Vaswani, David Grangier:
Efficient Content-Based Sparse Attention with Routing Transformers. CoRR abs/2003.05997 (2020)
2010 – 2019
- 2019
- [c28]Vihan Jain, Gabriel Magalhães, Alexander Ku, Ashish Vaswani, Eugene Ie, Jason Baldridge:
Stay on the Path: Instruction Fidelity in Vision-and-Language Navigation. ACL (1) 2019: 1862-1872 - [c27]Irwan Bello, Barret Zoph, Quoc Le, Ashish Vaswani, Jonathon Shlens:
Attention Augmented Convolutional Networks. ICCV 2019: 3285-3294 - [c26]Cheng-Zhi Anna Huang, Ashish Vaswani, Jakob Uszkoreit, Ian Simon, Curtis Hawthorne, Noam Shazeer, Andrew M. Dai, Matthew D. Hoffman, Monica Dinculescu, Douglas Eck:
Music Transformer: Generating Music with Long-Term Structure. ICLR (Poster) 2019 - [c25]Niki Parmar, Prajit Ramachandran, Ashish Vaswani, Irwan Bello, Anselm Levskaya, Jonathon Shlens:
Stand-Alone Self-Attention in Vision Models. NeurIPS 2019: 68-80 - [i14]Irwan Bello, Barret Zoph, Ashish Vaswani, Jonathon Shlens, Quoc V. Le:
Attention Augmented Convolutional Networks. CoRR abs/1904.09925 (2019) - [i13]Vihan Jain, Gabriel Magalhães, Alexander Ku, Ashish Vaswani, Eugene Ie, Jason Baldridge:
Stay on the Path: Instruction Fidelity in Vision-and-Language Navigation. CoRR abs/1905.12255 (2019) - [i12]Prajit Ramachandran, Niki Parmar, Ashish Vaswani, Irwan Bello, Anselm Levskaya, Jonathon Shlens:
Stand-Alone Self-Attention in Vision Models. CoRR abs/1906.05909 (2019) - 2018
- [c24]Mia Xu Chen, Orhan Firat, Ankur Bapna, Melvin Johnson, Wolfgang Macherey, George F. Foster, Llion Jones, Mike Schuster, Noam Shazeer, Niki Parmar, Ashish Vaswani, Jakob Uszkoreit, Lukasz Kaiser, Zhifeng Chen, Yonghui Wu, Macduff Hughes:
The Best of Both Worlds: Combining Recent Advances in Neural Machine Translation. ACL (1) 2018: 76-86 - [c23]Ashish Vaswani, Samy Bengio, Eugene Brevdo, François Chollet, Aidan N. Gomez, Stephan Gouws, Llion Jones, Lukasz Kaiser, Nal Kalchbrenner, Niki Parmar, Ryan Sepassi, Noam Shazeer, Jakob Uszkoreit:
Tensor2Tensor for Neural Machine Translation. AMTA (1) 2018: 193-199 - [c22]Lukasz Kaiser, Samy Bengio, Aurko Roy, Ashish Vaswani, Niki Parmar, Jakob Uszkoreit, Noam Shazeer:
Fast Decoding in Sequence Models Using Discrete Latent Variables. ICML 2018: 2395-2404 - [c21]Niki Parmar, Ashish Vaswani, Jakob Uszkoreit, Lukasz Kaiser, Noam Shazeer, Alexander Ku, Dustin Tran:
Image Transformer. ICML 2018: 4052-4061 - [c20]Peter Shaw, Jakob Uszkoreit, Ashish Vaswani:
Self-Attention with Relative Position Representations. NAACL-HLT (2) 2018: 464-468 - [c19]Noam Shazeer, Youlong Cheng, Niki Parmar, Dustin Tran, Ashish Vaswani, Penporn Koanantakool, Peter Hawkins, HyoukJoong Lee, Mingsheng Hong, Cliff Young, Ryan Sepassi, Blake A. Hechtman:
Mesh-TensorFlow: Deep Learning for Supercomputers. NeurIPS 2018: 10435-10444 - [i11]Niki Parmar, Ashish Vaswani, Jakob Uszkoreit, Lukasz Kaiser, Noam Shazeer, Alexander Ku:
Image Transformer. CoRR abs/1802.05751 (2018) - [i10]Peter Shaw, Jakob Uszkoreit, Ashish Vaswani:
Self-Attention with Relative Position Representations. CoRR abs/1803.02155 (2018) - [i9]Lukasz Kaiser, Aurko Roy, Ashish Vaswani, Niki Parmar, Samy Bengio, Jakob Uszkoreit, Noam Shazeer:
Fast Decoding in Sequence Models using Discrete Latent Variables. CoRR abs/1803.03382 (2018) - [i8]Ashish Vaswani, Samy Bengio, Eugene Brevdo, François Chollet, Aidan N. Gomez, Stephan Gouws, Llion Jones, Lukasz Kaiser, Nal Kalchbrenner, Niki Parmar, Ryan Sepassi, Noam Shazeer, Jakob Uszkoreit:
Tensor2Tensor for Neural Machine Translation. CoRR abs/1803.07416 (2018) - [i7]Aurko Roy, Ashish Vaswani, Arvind Neelakantan, Niki Parmar:
Theory and Experiments on Vector Quantized Autoencoders. CoRR abs/1805.11063 (2018) - [i6]Peter W. Battaglia, Jessica B. Hamrick, Victor Bapst, Alvaro Sanchez-Gonzalez, Vinícius Flores Zambaldi, Mateusz Malinowski, Andrea Tacchetti, David Raposo, Adam Santoro, Ryan Faulkner, Çaglar Gülçehre, H. Francis Song, Andrew J. Ballard, Justin Gilmer, George E. Dahl, Ashish Vaswani, Kelsey R. Allen, Charles Nash, Victoria Langston, Chris Dyer, Nicolas Heess, Daan Wierstra, Pushmeet Kohli, Matthew M. Botvinick, Oriol Vinyals, Yujia Li, Razvan Pascanu:
Relational inductive biases, deep learning, and graph networks. CoRR abs/1806.01261 (2018) - [i5]Cheng-Zhi Anna Huang, Ashish Vaswani, Jakob Uszkoreit, Noam Shazeer, Curtis Hawthorne, Andrew M. Dai, Matthew D. Hoffman, Douglas Eck:
An Improved Relative Self-Attention Mechanism for Transformer with Application to Music Generation. CoRR abs/1809.04281 (2018) - [i4]Noam Shazeer, Youlong Cheng, Niki Parmar, Dustin Tran, Ashish Vaswani, Penporn Koanantakool, Peter Hawkins, HyoukJoong Lee, Mingsheng Hong, Cliff Young, Ryan Sepassi, Blake A. Hechtman:
Mesh-TensorFlow: Deep Learning for Supercomputers. CoRR abs/1811.02084 (2018) - 2017
- [c18]Ashish Vaswani, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N. Gomez, Lukasz Kaiser, Illia Polosukhin:
Attention is All you Need. NIPS 2017: 5998-6008 - [i3]Ashish Vaswani, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N. Gomez, Lukasz Kaiser, Illia Polosukhin:
Attention Is All You Need. CoRR abs/1706.03762 (2017) - [i2]Lukasz Kaiser, Aidan N. Gomez, Noam Shazeer, Ashish Vaswani, Niki Parmar, Llion Jones, Jakob Uszkoreit:
One Model To Learn Them All. CoRR abs/1706.05137 (2017) - 2016
- [j1]Ashish Vaswani, Kenji Sagae:
Efficient Structured Inference for Transition-Based Parsing with Neural Networks and Error States. Trans. Assoc. Comput. Linguistics 4: 183-196 (2016) - [c17]Ke M. Tran, Yonatan Bisk, Ashish Vaswani, Daniel Marcu, Kevin Knight:
Unsupervised Neural Hidden Markov Models. SPNLP@EMNLP 2016: 63-71 - [c16]Ashish Vaswani, Yonatan Bisk, Kenji Sagae, Ryan Musa:
Supertagging With LSTMs. HLT-NAACL 2016: 232-237 - [c15]Boliang Zhang, Xiaoman Pan, Tianlu Wang, Ashish Vaswani, Heng Ji, Kevin Knight, Daniel Marcu:
Name Tagging for Low-resource Incident Languages based on Expectation-driven Learning. HLT-NAACL 2016: 249-259 - [c14]Barret Zoph, Ashish Vaswani, Jonathan May, Kevin Knight:
Simple, Fast Noise-Contrastive Estimation for Large RNN Vocabularies. HLT-NAACL 2016: 1217-1222 - [i1]Ke M. Tran, Yonatan Bisk, Ashish Vaswani, Daniel Marcu, Kevin Knight:
Unsupervised Neural Hidden Markov Models. CoRR abs/1609.09007 (2016) - 2015
- [c13]Qing Dou, Ashish Vaswani, Kevin Knight, Chris Dyer:
Unifying Bayesian Inference and Vector Space Models for Improved Decipherment. ACL (1) 2015: 836-845 - [c12]Tomer Levinboim, Ashish Vaswani, David Chiang:
Model Invertibility Regularization: Sequence Alignment With or Without Parallel Data. HLT-NAACL 2015: 609-618 - 2014
- [c11]Leila Wehbe, Ashish Vaswani, Kevin Knight, Tom M. Mitchell:
Aligning context-based statistical models of language with brain activity during reading. EMNLP 2014: 233-243 - [c10]Qing Dou, Ashish Vaswani, Kevin Knight:
Beyond Parallel Data: Joint Word Alignment and Decipherment Improves Machine Translation. EMNLP 2014: 557-565 - 2013
- [c9]Ashish Vaswani, Yinggong Zhao, Victoria Fossum, David Chiang:
Decoding with Large-Scale Neural Language Models Improves Translation. EMNLP 2013: 1387-1392 - [c8]Dirk Hovy, Taylor Berg-Kirkpatrick, Ashish Vaswani, Eduard H. Hovy:
Learning Whom to Trust with MACE. HLT-NAACL 2013: 1120-1130 - 2012
- [c7]Ashish Vaswani, Liang Huang, David Chiang:
Smaller Alignment Models for Better Translations: Unsupervised Word Alignment with the l0-norm. ACL (1) 2012: 311-319 - 2011
- [c6]Dirk Hovy, Ashish Vaswani, Stephen Tratz, David Chiang, Eduard H. Hovy:
Models and Training for Unsupervised Preposition Sense Disambiguation. ACL (2) 2011: 323-328 - [c5]Ashish Vaswani, Haitao Mi, Liang Huang, David Chiang:
Rule Markov Models for Fast Tree-to-String Translation. ACL 2011: 856-864 - 2010
- [c4]Ashish Vaswani, Adam Pauls, David Chiang:
Efficient Optimization of an MDL-Inspired Objective Function for Unsupervised Part-Of-Speech Tagging. ACL (2) 2010: 209-214 - [c3]Sujith Ravi, Ashish Vaswani, Kevin Knight, David Chiang:
Fast, Greedy Model Minimization for Unsupervised Tagging. COLING 2010: 940-948
2000 – 2009
- 2007
- [c2]David R. Traum, Antonio Roque, Anton Leuski, Panayiotis G. Georgiou, Jillian Gerten, Bilyana Martinovski, Shrikanth Narayanan, Susan Robinson, Ashish Vaswani:
Hassan: A Virtual Human for Tactical Questioning. SIGdial 2007: 71-74 - 2006
- [c1]Antonio Roque, Anton Leuski, Vivek Kumar Rangarajan Sridhar, Susan Robinson, Ashish Vaswani, Shrikanth S. Narayanan, David R. Traum:
Radiobot-CFF: a spoken dialogue system for military training. INTERSPEECH 2006
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-10-04 20:05 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint