[CITATION][C] Deja-vu: Double feature presentation in deep transformer networks

A Tjandra, C Liu, F Zhang, X Zhang, Y Wang… - arXiv preprint, 2019
Showing the best result for this search. See all results