default search action
Rengan Xu
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [c14]Rengan Xu, Junjie Yang, Yifan Xu, Hong Li, Xing Liu, Devashish Shankar, Haoci Zhang, Meng Liu, Boyang Li, Yuxi Hu, Mingwei Tang, Zehua Zhang, Tunhou Zhang, Dai Li, Sijia Chen, Gian-Paolo Musumeci, Jiaqi Zhai, Bill Zhu, Hong Yan, Srihari Reddy:
Enhancing Performance and Scalability of Large-Scale Recommendation Systems with Jagged Flash Attention. RecSys 2024: 778-780 - [i3]Mingwei Tang, Meng Liu, Hong Li, Junjie Yang, Chenglin Wei, Boyang Li, Dai Li, Rengan Xu, Yifan Xu, Zehua Zhang, Xiangyu Wang, Linfeng Liu, Yuelei Xie, Chengye Liu, Labib Fawaz, Li Li, Hongnan Wang, Bill Zhu, Sri Reddy:
Async Learned User Embeddings for Ads Delivery Optimization. CoRR abs/2406.05898 (2024) - [i2]Rengan Xu, Junjie Yang, Yifan Xu, Hong Li, Xing Liu, Devashish Shankar, Haoci Zhang, Meng Liu, Boyang Li, Yuxi Hu, Mingwei Tang, Zehua Zhang, Tunhou Zhang, Dai Li, Sijia Chen, Gian-Paolo Musumeci, Jiaqi Zhai, Bill Zhu, Hong Yan, Srihari Reddy:
Enhancing Performance and Scalability of Large-Scale Recommendation Systems with Jagged Flash Attention. CoRR abs/2409.15373 (2024)
2010 – 2019
- 2019
- [c13]Derya Cavdar, Valeriu Codreanu, Can Karakus, John A. Lockman III, Damian Podareanu, Vikram A. Saletore, Alexander Sergeev, Don D. Smith II, Victor Suthichai, Quy Ta, Srinivas Varadharajan, Lucas A. Wilson, Rengan Xu, Pei Yang:
Densifying Assumed-Sparse Tensors - Improving Memory Efficiency and MPI Collective Performance During Tensor Accumulation for Parallelized Training of Neural Machine Translation Models. ISC 2019: 23-39 - [i1]Derya Çavdar, Valeriu Codreanu, Can Karakus, John A. Lockman III, Damian Podareanu, Vikram A. Saletore, Alexander Sergeev, Don D. Smith II, Victor Suthichai, Quy Ta, Srinivas Varadharajan, Lucas A. Wilson, Rengan Xu, Pei Yang:
Densifying Assumed-sparse Tensors: Improving Memory Efficiency and MPI Collective Performance during Tensor Accumulation for Parallelized Training of Neural Machine Translation Models. CoRR abs/1905.04035 (2019) - 2018
- [j3]Michael Wolfe, Seyong Lee, Jungwon Kim, Xiaonan Tian, Rengan Xu, Barbara M. Chapman, Sunita Chandrasekaran:
The OpenACC data model: Preliminary study on its major challenges and implementations. Parallel Comput. 78: 15-27 (2018) - [c12]Rengan Xu, Frank Han, Quy Ta:
Deep Learning at Scale on NVIDIA V100 Accelerators. PMBS@SC 2018: 23-32 - 2017
- [c11]Michael Wolfe, Seyong Lee, Jungwon Kim, Xiaonan Tian, Rengan Xu, Sunita Chandrasekaran, Barbara M. Chapman:
Implementing the OpenACC Data Model. IPDPS Workshops 2017: 662-672 - 2016
- [j2]Xiaonan Tian, Rengan Xu, Yonghong Yan, Sunita Chandrasekaran, Deepak Eachempati, Barbara M. Chapman:
Compiler transformation of nested loops for general purpose GPUs. Concurr. Comput. Pract. Exp. 28(2): 537-556 (2016) - [c10]Xiaonan Tian, Dounia Khaldi, Deepak Eachempati, Rengan Xu, Barbara M. Chapman:
Optimizing GPU Register Usage: Extensions to OpenACC and Compiler Optimizations. ICPP 2016: 572-581 - [c9]Rengan Xu, Sunita Chandrasekaran, Xiaonan Tian, Barbara M. Chapman:
An Analytical Model-Based Auto-tuning Framework for Locality-Aware Loop Scheduling. ISC 2016: 3-20 - 2015
- [j1]Rengan Xu, Xiaonan Tian, Sunita Chandrasekaran, Barbara M. Chapman:
Multi-GPU Support on Single Node Using Directive-Based Programming Model. Sci. Program. 2015: 621730:1-621730:15 (2015) - 2014
- [c8]Cheng Wang, Rengan Xu, Sunita Chandrasekaran, Barbara M. Chapman, Oscar R. Hernandez:
A Validation Testsuite for OpenACC 1.0. IPDPS Workshops 2014: 1407-1416 - [c7]Rengan Xu, Xiaonan Tian, Sunita Chandrasekaran, Yonghong Yan, Barbara M. Chapman:
NAS Parallel Benchmarks for GPGPUs Using a Directive-Based Programming Model. LCPC 2014: 67-81 - [c6]Rengan Xu, Xiaonan Tian, Yonghong Yan, Sunita Chandrasekaran, Barbara M. Chapman:
Reduction Operations in Parallel Loops for GPGPUs. PMAM 2014: 10 - [c5]Rengan Xu, Maxime R. Hugues, Henri Calandra, Sunita Chandrasekaran, Barbara M. Chapman:
Accelerating Kirchhoff migration on GPU using directives. WACCPD@SC 2014: 37-46 - [c4]Guido Juckeland, William C. Brantley, Sunita Chandrasekaran, Barbara M. Chapman, Shuai Che, Mathew E. Colgrove, Huiyu Feng, Alexander Grund, Robert Henschel, Wen-mei W. Hwu, Huian Li, Matthias S. Müller, Wolfgang E. Nagel, Maxim Perminov, Pavel Shelepugin, Kevin Skadron, John A. Stratton, Alexey Titov, Ke Wang, G. Matthijs van Waveren, Brian Whitney, Sandra Wienke, Rengan Xu, Kalyan Kumaran:
SPEC ACCEL: A Standard Application Suite for Measuring Hardware Accelerator Performance. PMBS@SC 2014: 46-67 - 2013
- [c3]Rengan Xu, Sunita Chandrasekaran, Barbara M. Chapman:
Exploring Programming Multi-GPUs Using OpenMP and OpenACC-Based Hybrid Model. IPDPS Workshops 2013: 1169-1176 - [c2]Rengan Xu, Mauricio Araya-Polo, Barbara M. Chapman:
Filesystem Aware Scalable I/O Framework for Data-Intensive Parallel Applications. IPDPS Workshops 2013: 2007-2014 - [c1]Xiaonan Tian, Rengan Xu, Yonghong Yan, Zhifeng Yun, Sunita Chandrasekaran, Barbara M. Chapman:
Compiling a High-Level Directive-Based Programming Model for GPGPUs. LCPC 2013: 105-120
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-10-23 20:36 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint