default search action
Shuai Che
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2023
- [i3]Zhewei Yao, Reza Yazdani Aminabadi, Olatunji Ruwase, Samyam Rajbhandari, Xiaoxia Wu, Ammar Ahmad Awan, Jeff Rasley, Minjia Zhang, Conglong Li, Connor Holmes, Zhongzhu Zhou, Michael Wyatt, Molly Smith, Lev Kurilenko, Heyang Qin, Masahiro Tanaka, Shuai Che, Shuaiwen Leon Song, Yuxiong He:
DeepSpeed-Chat: Easy, Fast and Affordable RLHF Training of ChatGPT-like Models at All Scales. CoRR abs/2308.01320 (2023) - 2022
- [j5]Hongxu Yin, Guoyang Chen, Yingmin Li, Shuai Che, Weifeng Zhang, Niraj K. Jha:
Towards Execution-Efficient LSTMs via Hardware-Guided Grow-and-Prune Paradigm. IEEE Trans. Emerg. Top. Comput. 10(4): 1799-1809 (2022) - 2021
- [j4]Ye Yu, Yingmin Li, Shuai Che, Niraj K. Jha, Weifeng Zhang:
Software-Defined Design Space Exploration for an Efficient DNN Accelerator Architecture. IEEE Trans. Computers 70(1): 45-56 (2021) - 2020
- [c25]Tong Geng, Ang Li, Runbin Shi, Chunshu Wu, Tianqi Wang, Yanfei Li, Pouya Haghi, Antonino Tumeo, Shuai Che, Steven K. Reinhardt, Martin C. Herbordt:
AWB-GCN: A Graph Convolutional Network Accelerator with Runtime Workload Rebalancing. MICRO 2020: 922-936 - [c24]Bita Darvish Rouhani, Daniel Lo, Ritchie Zhao, Ming Liu, Jeremy Fowers, Kalin Ovtcharov, Anna Vinogradsky, Sarah Massengill, Lita Yang, Ray Bittner, Alessandro Forin, Haishan Zhu, Taesik Na, Prerak Patel, Shuai Che, Lok Chand Koppaka, Xia Song, Subhojit Som, Kaustav Das, Saurabh Tiwary, Steven K. Reinhardt, Sitaram Lanka, Eric S. Chung, Doug Burger:
Pushing the Limits of Narrow Precision Inferencing at Cloud Scale with Microsoft Floating Point. NeurIPS 2020
2010 – 2019
- 2019
- [c23]Shuai Che, Jieming Yin:
Northup: Divide-and-Conquer Programming in Systems with Heterogeneous Memories and Processors. IPDPS 2019: 335-344 - [i2]Hongxu Yin, Guoyang Chen, Yingmin Li, Shuai Che, Weifeng Zhang, Niraj K. Jha:
Hardware-Guided Symbiotic Training for Compact, Accurate, yet Execution-Efficient LSTM. CoRR abs/1901.10997 (2019) - [i1]Ye Yu, Yingmin Li, Shuai Che, Niraj K. Jha, Weifeng Zhang:
Software-Defined Design Space Exploration for an Efficient AI Accelerator Architecture. CoRR abs/1903.07676 (2019) - 2017
- [j3]Shuai Che, Bradford M. Beckmann, Steven K. Reinhardt:
Programming GPGPU Graph Applications with Linear Algebra Building Blocks. Int. J. Parallel Program. 45(3): 657-679 (2017) - [c22]Nicholas Malaya, Shuai Che, Joseph L. Greathouse, René van Oostrum, Michael J. Schulte:
Accelerating Matrix Processing with GPUs. ARITH 2017: 139-141 - [c21]Shuai Che, Marc S. Orr, Jonathan Gallmeier:
Work Stealing in a Shared Virtual-Memory Heterogeneous Environment: A Case Study with Betweenness Centrality. Conf. Computing Frontiers 2017: 164-173 - [c20]Kaixi Hou, Wu-chun Feng, Shuai Che:
Auto-Tuning Strategies for Parallelizing Sparse Matrix-Vector (SpMV) Multiplication on Multi- and Many-Core Processors. IPDPS Workshops 2017: 713-722 - [c19]Marc S. Orr, Shuai Che, Bradford M. Beckmann, Mark Oskin, Steven K. Reinhardt, David A. Wood:
Gravel: fine-grain GPU-initiated network messages. SC 2017: 23 - 2016
- [c18]Shuai Che, Marc S. Orr, Gregory Rodgers, Jonathan Gallmeier:
Betweenness Centrality in an HSA-enabled System. HPGP@HPDC 2016: 35-38 - [c17]Shuai Che, Arkaprava Basu, Jonathan Gallmeier:
Challenges of Programming a System with Heterogeneous Memories and Heterogeneous Processors: A Programmer's View. MEMSYS 2016: 99-103 - [c16]Arkaprava Basu, Sooraj Puthoor, Shuai Che, Bradford M. Beckmann:
Software Assisted Hardware Cache Coherence for Heterogeneous Processors. MEMSYS 2016: 279-288 - [c15]Sooraj Puthoor, Ashwin M. Aji, Shuai Che, Mayank Daga, Wei Wu, Bradford M. Beckmann, Gregory Rodgers:
Implementing directed acyclic graphs with the heterogeneous system architecture. GPGPU@PPoPP 2016: 53-62 - 2015
- [c14]Marc S. Orr, Shuai Che, Ayse Yilmazer, Bradford M. Beckmann, Mark D. Hill, David A. Wood:
Synchronization Using Remote-Scope Promotion. ASPLOS 2015: 73-86 - [c13]Shuai Che, Gregory Rodgers, Bradford M. Beckmann, Steven K. Reinhardt:
Graph Coloring on the GPU and Some Techniques to Improve Load Imbalance. IPDPS Workshops 2015: 610-617 - 2014
- [j2]Shuai Che, Kevin Skadron:
BenchFriend: Correlating the performance of GPU benchmarks. Int. J. High Perform. Comput. Appl. 28(2): 238-250 (2014) - [c12]Blake A. Hechtman, Shuai Che, Derek R. Hower, Yingying Tian, Bradford M. Beckmann, Mark D. Hill, Steven K. Reinhardt, David A. Wood:
QuickRelease: A throughput-oriented approach to release consistency on GPUs. HPCA 2014: 189-200 - [c11]Shuai Che:
GasCL: A vertex-centric graph model for GPUs. HPEC 2014: 1-6 - [c10]Shuai Che, Bradford M. Beckmann, Steven K. Reinhardt:
BelRed: Constructing GPGPU graph applications with software building blocks. HPEC 2014: 1-6 - [c9]Shuai Che, Jiayuan Meng, Kevin Skadron:
Dymaxion++: A Directive-Based API to Optimize Data Layout and Memory Mapping for Heterogeneous Systems. IPDPS Workshops 2014: 916-924 - [c8]Guido Juckeland, William C. Brantley, Sunita Chandrasekaran, Barbara M. Chapman, Shuai Che, Mathew E. Colgrove, Huiyu Feng, Alexander Grund, Robert Henschel, Wen-mei W. Hwu, Huian Li, Matthias S. Müller, Wolfgang E. Nagel, Maxim Perminov, Pavel Shelepugin, Kevin Skadron, John A. Stratton, Alexey Titov, Ke Wang, G. Matthijs van Waveren, Brian Whitney, Sandra Wienke, Rengan Xu, Kalyan Kumaran:
SPEC ACCEL: A Standard Application Suite for Measuring Hardware Accelerator Performance. PMBS@SC 2014: 46-67 - 2013
- [c7]Michael Boyer, Kevin Skadron, Shuai Che, Nuwan Jayasena:
Load balancing in a changing world: dealing with heterogeneity and performance variability. Conf. Computing Frontiers 2013: 21:1-21:10 - [c6]Shuai Che, Bradford M. Beckmann, Steven K. Reinhardt, Kevin Skadron:
Pannotia: Understanding irregular GPGPU graph applications. IISWC 2013: 185-195 - 2011
- [c5]Wim Heirman, Trevor E. Carlson, Shuai Che, Kevin Skadron, Lieven Eeckhout:
Using cycle stacks to understand scaling bottlenecks in multi-threaded workloads. IISWC 2011: 38-49 - [c4]Shuai Che, Jeremy W. Sheaffer, Kevin Skadron:
Dymaxion: optimizing memory access patterns for heterogeneous systems. SC 2011: 13:1-13:11 - 2010
- [c3]Shuai Che, Jeremy W. Sheaffer, Michael Boyer, Lukasz G. Szafaryn, Liang Wang, Kevin Skadron:
A characterization of the Rodinia benchmark suite with comparison to contemporary CMP workloads. IISWC 2010: 1-11
2000 – 2009
- 2009
- [c2]Shuai Che, Michael Boyer, Jiayuan Meng, David Tarjan, Jeremy W. Sheaffer, Sang-Ha Lee, Kevin Skadron:
Rodinia: A benchmark suite for heterogeneous computing. IISWC 2009: 44-54 - 2008
- [j1]Shuai Che, Michael Boyer, Jiayuan Meng, David Tarjan, Jeremy W. Sheaffer, Kevin Skadron:
A performance study of general-purpose applications on graphics processors using CUDA. J. Parallel Distributed Comput. 68(10): 1370-1380 (2008) - [c1]Shuai Che, Jie Li, Jeremy W. Sheaffer, Kevin Skadron, John C. Lach:
Accelerating Compute-Intensive Applications with GPUs and FPGAs. SASP 2008: 101-107
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-08-05 20:21 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint