


default search action
SC 2022: Dallas, TX, USA
- Felix Wolf, Sameer Shende, Candace Culhane, Sadaf R. Alam, Heike Jagode:
SC22: International Conference for High Performance Computing, Networking, Storage and Analysis, Dallas, TX, USA, November 13-18, 2022. IEEE 2022, ISBN 978-1-6654-5444-5 - Oguz Selvitopi, Saliya Ekanayake, Giulia Guidi, Muaaz G. Awan, Georgios A. Pavlopoulos, Ariful Azad, Nikos Kyrpides, Leonid Oliker, Katherine A. Yelick
, Aydin Buluç
:
Extreme-Scale Many-against-Many Protein Similarity Search. 1:1-1:12 - Qinglei Cao
, Sameh Abdulah, Rabab Alomairy
, Yu Pei, Pratik Nag
, George Bosilca, Jack J. Dongarra, Marc G. Genton
, David E. Keyes
, Hatem Ltaief
, Ying Sun
:
Reshaping Geostatistical Modeling and Prediction for Extreme-Scale Environmental Applications. 2:1-2:12 - Luca Fedeli, Axel Huebl
, France Boillod-Cerneux, Thomas Clark, Kevin Gott, Conrad Hillairet, Stephan Jaure, Adrien Leblanc, Rémi Lehe, Andrew Myers, Christelle Piechurski, Mitsuhisa Sato, Neïl Zaïm, Weiqun Zhang, Jean-Luc Vay, Henri Vincenti:
Pushing the Frontier in the Design of Laser-Based Electron Accelerators with Groundbreaking Mesh-Refined Particle-In-Cell Simulations on Exascale-Class Supercomputers. 3:1-3:12 - Tsuyoshi Ichimura, Kohei Fujita, Ryota Kusakabe, Kentaro Koyama, Sota Murakami, Yuma Kikuchi, Takane Hori, Muneo Hori, Hikaru Inoue, Takafumi Nose, Takahiro Kawashima, Maddegedara Lalith:
Extreme Scale Earthquake Simulation with Uncertainty Quantification. 4:1-4:11 - Wei Hu, Hong An, Zhuoqiang Guo, Qingcai Jiang, Xinming Qin, Junshi Chen, Weile Jia, Chao Yang, Zhaolong Luo, Jielan Li, Wentiao Wu, Guangming Tan, Dongning Jia, Qinglin Lu
, Fangfang Liu, Min Tian, Fang Li, Yeqi Huang, Liyi Wang, Sha Liu, Jinlong Yang:
2.5 Million-Atom Ab Initio Electronic-Structure Simulation of Complex Metallic Heterostructures with DGDFT. 5:1-5:13 - Ramakrishnan Kannan, Piyush Sao
, Hao Lu
, Jakub Kurzak, Gundolf Schenk, Yongmei Shi, Seung-Hwan Lim, Sharat Israni, Vijay Thakkar, Guojing Cong, Robert M. Patton, Sergio E. Baranzini
, Richard W. Vuduc
, Thomas E. Potok:
Exaflops Biomedical Knowledge Graph Analytics. 6:1-6:11 - Giuseppe M. J. Barca
, Calum Snowdon, Jorge L. Galvez Vallejo, Fazeleh S. Kazemian, Alistair P. Rendell, Mark S. Gordon:
Scaling Correlated Fragment Molecular Orbital Calculations on Summit. 7:1-7:14 - Xiao Wang
, Aristeidis Tsaris, Debangshu Mukherjee, Mohamed Wahib, Peng Chen, Mark Oxley, Olga Ovchinnikova, Jacob D. Hinkle:
Image Gradient Decomposition for Parallel and Memory-Efficient Ptychographic Reconstruction. 8:1-8:13 - Narangerelt Batsoyol, Benjamin S. Pullman, Mingxun Wang, Nuno Bandeira, Steven Swanson
:
P-Massive: A Real-Time Search Engine for a Multi-Terabyte Mass Spectrometry Database. 9:1-9:15 - Salvatore Di Girolamo, Daniele De Sensi, Konstantin Taranov, Milos Malesevic, Maciej Besta, Timo Schneider, Severin Kistler, Torsten Hoefler:
Building Blocks for Network-Accelerated Distributed File Systems. 10:1-10:14 - Torsten Hoefler, Tommaso Bonato
, Daniele De Sensi, Salvatore Di Girolamo, Shigang Li, Marco Heddes, Jon Belk, Deepak Goel, Miguel Castro, Steve Scott:
HammingMesh: A Network Topology for Large-Scale Deep Learning. 11:1-11:18 - Kartik Lakhotia, Maciej Besta, Laura Monroe, Kelly Isham
, Patrick Iff, Torsten Hoefler, Fabrizio Petrini:
PolarFly: A Cost-Effective and Flexible Low-Diameter Topology. 12:1-12:15 - Ellis Wilson, Frank Mueller, Scott Pakin:
Combining Hard and Soft Constraints in Quantum Constraint-Satisfaction Systems. 13:1-13:14 - Honghui Shang, Li Shen, Yi Fan, Zhiqian Xu
, Chu Guo, Jie Liu
, Wenhao Zhou, Huan Ma, Rongfen Lin, Yuling Yang, Fang Li, Zhuoya Wang, Yunquan Zhang, Zhenyu Li:
Large-Scale Simulation of Quantum Computational Chemistry on a New Sunway Supercomputer. 14:1-14:14 - Tirthak Patel
, Daniel Silver, Devesh Tiwari:
Charter: Identifying the Most-Critical Gate Operations in Quantum Circuits via Amplified Gate Reversibility. 15:1-15:16 - Mihailo Isakov, Mikaela Currier, Eliakin Del Rosario, Sandeep Madireddy
, Prasanna Balaprakash, Philip H. Carns, Robert B. Ross, Glenn K. Lockwood, Michel A. Kinsy:
A Taxonomy of Error Sources in HPC I/O Machine Learning Models. 16:1-16:14 - Yafan Huang
, Shengjian Guo, Sheng Di, Guanpeng Li, Franck Cappello:
Mitigating Silent Data Corruptions in HPC Applications across Multiple Program Inputs. 17:1-17:14 - Feng Zhang, Yihua Hu, Haipeng Ding, Zhiming Yao, Zhewei Wei, Xiao Zhang, Xiaoyong Du:
Optimizing Random Access to Hierarchically-Compressed Data on GPU. 18:1-18:15 - Yuanwei Wang, Huanqi Cao, Zixuan Ma, Wanwang Yin, Wenguang Chen:
Scaling Graph 500 SSSP to 140 Trillion Edges with over 40 Million Cores. 19:1-19:15 - Yao Kang, Xin Wang, Zhiling Lan:
Study of Workload Interference with Intelligent Routing on Dragonfly. 20:1-20:14 - Srinivasan Ramesh, Hank Childs, Allen D. Malony:
SERVIZ: A Shared In Situ Visualization Service. 21:1-21:14 - Rohan Basu Roy, Tirthak Patel
, Devesh Tiwari:
DayDream: Executing Dynamic Scientific Workflows on Serverless Platforms with Hot Starts. 22:1-22:18 - Luke Logan, Jaime Cernuda Garcia, Jay F. Lofstead, Xian-He Sun, Anthony Kougkas:
LabStor: A Modular and Extensible Platform for Developing High-Performance, Customized I/O Stacks in Userspace. 23:1-23:15 - Yiqin Dai, Yong Dong, Kai Lu, Ruibo Wang, Wei Zhang, Juan Chen, Mingtian Shao, Zheng Wang:
Towards Scalable Resource Management for Supercomputers. 24:1-24:15 - Alexandros Nikolaos Ziogas
, Grzegorz Kwasniewski, Tal Ben-Nun, Timo Schneider, Torsten Hoefler:
Deinsum: Practically I/O Optimal Multi-Linear Algebra. 25:1-25:15 - Ahmad Abdelfattah, Pieter Ghysels, Wajih Boukaram, Stanimire Tomov, Xiaoye Sherry Li, Jack J. Dongarra:
Addressing Irregular Patterns of Matrix Computations on GPUs and Their Impact on Applications Powered by Sparse Direct Solvers. 26:1-26:14 - Zonghao Feng, Qipeng Xie, Qiong Luo
, Yujie Chen, Haoxuan Li, Huizhong Li, Qiang Yan:
Accelerating Elliptic Curve Digital Signature Algorithms on GPUs. 27:1-27:13 - Hua Huang, Edmond Chow:
CA3DMM: A New Algorithm Based on a Unified View of Parallel Matrix Multiplication. 28:1-28:15 - Olivier Beaumont, Philippe Duchon, Lionel Eyraud-Dubois, Julien Langou
, Mathieu Vérité:
Symmetric Block-Cyclic Distribution: Fewer Communications Leads to Faster Dense Cholesky Factorization. 29:1-29:15 - Mathias Jacquelin
, Mauricio Araya-Polo
, Jie Meng
:
Scalable Distributed High-Order Stencil Computations. 30:1-30:13 - Philip Munksgaard
, Troels Henriksen, Ponnuswamy Sadayappan, Cosmin E. Oancea
:
Memory Optimizations in an Array Language. 31:1-31:15 - Kazem Cheshmi, Zachary Cetinic, Maryam Mehri Dehnavi:
Vectorizing Sparse Matrix Computations with Partially-Strided Codelets. 32:1-32:15 - Ignacio Laguna, Ganesh Gopalakrishnan:
Finding Inputs that Trigger Floating-Point Exceptions in GPUs via Bayesian Optimization. 33:1-33:14 - Farid Zakaria, Thomas R. W. Scogland, Todd Gamblin, Carlos Maltzahn:
Mapping Out the HPC Dependency Chaos. 34:1-34:12 - Todd Gamblin, Massimiliano Culpo, Gregory Becker, Sergei Shudler:
Using Answer Set Programming for HPC Dependency Solving. 35:1-35:15 - Sixing Yu, Phuong Nguyen, Waqwoya Abebe, Wei Qian, Ali Anwar
, Ali Jannesari
:
SPATL: Salient Parameter Aggregation and Transfer Learning for Heterogeneous Federated Learning. 36:1-36:14 - Shigang Li
, Kazuki Osawa, Torsten Hoefler:
Efficient Quantized Sparse Matrix Operations on Tensor Cores. 37:1-37:15 - Xiaohui Wang, Yang Wei, Ying Xiong, Guyue Huang, Xian Qian, Yufei Ding, Mingxuan Wang, Lei Li:
LightSeq2: Accelerated Training for Transformer-Based Models on GPUs. 38:1-38:14 - Qingxiao Sun, Yi Liu, Hailong Yang, Ruizhe Zhang, Ming Dun, Mingzhen Li, Xiaoyan Liu, Wencong Xiao, Yong Li, Zhongzhi Luan, Depei Qian:
CoGNN: Efficient Scheduling for Concurrent GNN Training on GPUs. 39:1-39:15 - Bartlomiej Przybylski, Maciej Pawlik, Pawel Zuk, Bartlomiej Lagosz, Maciej Malawski, Krzysztof Rzadca:
Using Unused: Non-Invasive Dynamic FaaS Infrastructure with HPC-Whisk. 40:1-40:15 - Moiz Arif, Kevin Assogba
, M. Mustafa Rafique:
Canary: Fault-Tolerant FaaS for Stateful Time-Sensitive Applications. 41:1-41:16 - Yuqi Fu, Li Liu, Haoliang Wang, Yue Cheng, Songqing Chen:
SFS: Smart OS Scheduling for Serverless Functions. 42:1-42:16 - Maciej Besta, Cesare Miglioli
, Paolo Sylos Labini
, Jakub Tetek, Patrick Iff, Raghavendra Kanakagiri, Saleh Ashkboos, Kacper Janda, Michal Podstawski, Grzegorz Kwasniewski, Niels Gleinig, Flavio Vella
, Onur Mutlu, Torsten Hoefler:
ProbGraph: High-Performance and High-Accuracy Graph Mining with Probabilistic Set Representations. 43:1-43:17 - Juno Kim, Steven Swanson
:
Blaze: Fast Graph Processing on Fast SSDs. 44:1-44:15 - Dan Chen, Chuangyi Gui, Yi Zhang, Hai Jin, Long Zheng, Yu Huang, Xiaofei Liao:
GraphFly: Efficient Asynchronous Streaming Graphs Processing via Dependency-Flow. 45:1-45:14 - Reza Yazdani Aminabadi, Samyam Rajbhandari, Ammar Ahmad Awan, Cheng Li, Du Li, Elton Zheng, Olatunji Ruwase, Shaden Smith, Minjia Zhang, Jeff Rasley, Yuxiong He:
DeepSpeed- Inference: Enabling Efficient Inference of Transformer Models at Unprecedented Scale. 46:1-46:15 - Baorun Mu, Saeed Soori, Bugra Can, Mert Gürbüzbalaban, Maryam Mehri Dehnavi:
HyLo: A Hybrid Low-Rank Natural Gradient Descent Method. 47:1-47:16 - Xuncheng Zhao, Mingfan Li, Qian Xiao, Junshi Chen, Fei Wang, Li Shen, Meijia Zhao, Wenhao Wu, Hong An, Lixin He, Xiao Liang
:
AI for Quantum Mechanics: High Performance Quantum Many-Body Simulations via Deep Learning. 48:1-48:15 - Chen Zhang, Haojie Wang, Zixuan Ma, Lei Xie, Zeyu Song, Jidong Zhai:
UniQ: A Unified Programming Model for Efficient Quantum Circuit Simulation. 49:1-49:16 - Yuxin Chen, Benjamin Brock, Serban D. Porumbescu, Aydin Buluç
, Katherine A. Yelick
, John D. Owens:
Scalable Irregular Parallelism with GPUs: Getting CPUs Out of the Way. 50:1-50:16 - Hochan Lee, William Ruys, Ian Henriksen, Arthur Peters, Yineng Yan, Sean Stephens, Bozhi You, Henrique Fingler, Martin Burtscher, Milos Gligoric, Karl Schulz, Keshav Pingali, Christopher J. Rossbach, Mattan Erez
, George Biros:
Parla: A Python Orchestration System for Heterogeneous Architectures. 51:1-51:15 - Guanxian Jiang, Qihui Zhou, Tatiana Jin, Boyang Li, Yunjian Zhao, Yichao Li, James Cheng:
VSGM: View-Based GPU-Accelerated Subgraph Matching on Large Graphs. 52:1-52:15 - Yihua Wei, Peng Jiang:
STMatch: Accelerating Graph Pattern Matching on GPU with Stack-Based Loop Optimizations. 53:1-53:13 - Dongxu Yang, Junhong Liu, Jiaxing Qi, Junjie Lai:
WholeGraph: A Fast Graph Neural Network Training Framework with Multi-GPU Distributed Shared Memory Architecture. 54:1-54:14 - Qi Chen, Shaonan Ma, Kang Chen, Teng Ma
, Xin Liu, Dexun Chen, Yongwei Wu, Zuoning Chen:
SeqDLM: A Sequencer-Based Distributed Lock Manager for Efficient Shared File Access in a Parallel File System. 55:1-55:14 - Yingjin Qian, Wen Cheng, Lingfang Zeng
, Marc-André Vef, Oleg Drokin, Andreas Dilger, Shuichi Ihara, Wusheng Zhang, Yang Wang, André Brinkmann:
MetaWBC: POSIX-Compliant Metadata Write-Back Caching for Distributed File Systems. 56:1-56:20 - Dominic Manno, Jason Lee, Prajwal Challa, Qing Zheng
, David Bonnie, Gary Grider, Bradley W. Settlemyer:
GUFI: Fast, Secure File System Metadata Search for Both Privileged and Unprivileged Users. 57:1-57:14 - Robert Schenck
, Ola Rønning, Troels Henriksen, Cosmin E. Oancea
:
AD for an Array Language with Nested Parallelism. 58:1-58:15 - Rohan Yadav, Alex Aiken, Fredrik Kjolstad:
SpDISTAL: Compiling Distributed Sparse Tensor Computations. 59:1-59:15 - William S. Moses, Sri Hari Krishna Narayanan
, Ludger Paehler, Valentin Churavy, Michel Schanen, Jan Hückelheim
, Johannes Doerfert, Paul D. Hovland
:
Scalable Automatic Differentiation of Multiple Parallel Paradigms through Compiler Augmentation. 60:1-60:18 - Sian Jin
, Dingwen Tao, Houjun Tang
, Sheng Di, Suren Byna
, Zarija Lukic, Franck Cappello:
Accelerating Parallel Write via Deeply Integrating Predictive Lossy Compression with HDF5. 61:1-61:15 - Jinyang Liu
, Sheng Di, Kai Zhao, Xin Liang, Zizhong Chen, Franck Cappello:
Dynamic Quality Metric Oriented Error Bounded Lossy Compression for Scientific Datasets. 62:1-62:15 - Menghan Jia, Yiming Zhang, Xinbiao Gan, Dongsheng Li, Erci Xu, Ruibo Wang, Kai Lu:
vGraph: Memory-Efficient Multicore Graph Processing for Traversal-Centric Algorithms. 63:1-63:14 - Philipp Schaad
, Tal Ben-Nun, Torsten Hoefler:
Boosting Performance Optimization with Interactive Data Movement Visualization. 64:1-64:16 - Prasoon Sinha, Akhil Guliani, Rutwik Jain, Brandon Tran, Matthew D. Sinclair, Shivaram Venkataraman:
Not All GPUs Are Created Equal: Characterizing Variability in Large-Scale, Accelerator-Rich Systems. 65:1-65:15 - Zhen Du, Jiajia Li, Yinshan Wang, Xueqi Li, Guangming Tan, Ninghui Sun:
AlphaSparse: Generating High Performance SpMV Codes Directly from Sparse Matrices. 66:1-66:15 - Konstantinos Parasyris, James Diffenderfer, Harshitha Menon, Ignacio Laguna, Jackson Vanover
, Ryan Vogt, Daniel Osei-Kuffuor
:
Approximate Computing Through the Lens of Uncertainty Quantification. 67:1-67:14 - Jose P. Pinilla, Steven J. E. Wilton:
Positive-Phase Temperature Scaling for Quantum-Assisted Boltzmann Machine Training. 68:1-68:12 - Kaihua Fu, Jiuchen Shi, Quan Chen, Ningxin Zheng, Wei Zhang, Deze Zeng, Minyi Guo:
QoS-Aware Irregular Collaborative Inference for Improving Throughput of DNN Services. 69:1-69:14 - Zheng Wang, Yuke Wang, Boyuan Feng, Dheevatsa Mudigere, Bharath Muthiah, Yufei Ding:
EL-Rec: Efficient Large-Scale Recommendation Model Training via Tensor-Train Embedding Table. 70:1-70:14 - Xiaoyang Sun, Wei Wang, Shenghao Qiu, Renyu Yang, Songfang Huang, Jie Xu, Zheng Wang:
STRONGHOLD: Fast and Affordable Billion-Scale Deep Learning Model Training. 71:1-71:17 - Yuntao Gui, Yidi Wu, Han Yang, Tatiana Jin, Boyang Li, Qihui Zhou, James Cheng, Fan Yu:
HGL: Accelerating Heterogeneous GNN Training with Holistic Representation and Optimization. 72:1-72:15 - Tal Ben-Nun, Linus Groner, Florian Deconinck
, Tobias Wicky, Eddie Davis, Johann Dahm, Oliver Elbert, Rhea George, Jeremy McGibbon, Lukas Trümper, Elynn Wu, Oliver Fuhrer, Thomas C. Schulthess, Torsten Hoefler:
Productive Performance Engineering for Weather and Climate Modeling with Python. 73:1-73:14 - Misun Min, Yu-Hsiang Lan, Paul F. Fischer, Elia Merzari
, Stefan Kerkemeier, Malachi Phillips, Thilina Rathnayake
, April Novak, Derek Gaston, Noel Chalmers, Tim Warburton
:
Optimization of Full-Core Reactor Simulations on Summit. 74:1-74:11 - Milinda Fernando, David Neilsen, Eric W. Hirschmann, Yosef Zlochower, Hari Sundar, Omar Ghattas
, George Biros:
A GPU-Accelerated AMR Solver for Gravitational Wave Propagation. 75:1-75:15 - Cong Li
, Yu Zhang, Jialei Wang, Hang Chen, Xian Liu, Tai Huang, Liang Peng, Shen Zhou, Lixin Wang, Shijian Ge:
From Correctable Memory Errors to Uncorrectable Memory Errors: What Error Bits Tell. 76:1-76:14 - Rohit Zambre, Aparna Chandramowlishwaran
:
Lessons Learned on MPI+Threads Communication. 77:1-77:16 - Hao Lu
, Michael A. Matheson, Vladyslav Oles, J. Austin Ellis, Wayne Joubert, Feiyi Wang:
Climbing the Summit and Pushing the Frontier of Mixed Precision Benchmarks at Extreme Scale. 78:1-78:15 - Santosh Pandey
, Lingda Li
, Thomas Flynn, Adolfy Hoisie, Hang Liu:
Scalable Deep Learning-Based Microarchitecture Simulation on GPUs. 79:1-79:15 - Paul Caheny, Lluc Alvarez, Marc Casas, Miquel Moretó:
TD-NUCA: Runtime Driven Management of NUCA Caches in Task Dataflow Programming Models. 80:1-80:15 - Pengmiao Zhang, Rajgopal Kannan, Ajitesh Srivastava, Anant V. Nori, Viktor K. Prasanna:
ReSemble: Reinforced Ensemble Framework for Data Prefetching. 81:1-81:14 - Junmin Xiao, Yunfei Pang, Qing Xue, Chaoyang Shui, Ke Meng, Hui Ma, Mingyi Li, Xiaoyang Zhang, Guangming Tan:
W-Cycle SVD: A Multilevel Algorithm for Batched SVD on GPUs. 82:1-82:16 - Qianxiang Ma, Sameer Deshmukh, Rio Yokota:
Scalable Linear Time Dense Direct Solver for 3-D Problems without Trailing Sub-Matrix Dependencies. 83:1-83:12 - Chao Chen
, Per-Gunnar Martinsson:
Solving Linear Systems on a GPU with Hierarchically Off-Diagonal Low-Rank Approximations. 84:1-84:15 - Pengcheng Li, Yixin Guo, Yingwei Luo, Xiaolin Wang, Zhenlin Wang, Xu Liu:
Graph Neural Networks Based Memory Inefficiency Detection Using Selective Sampling. 85:1-85:14 - Pengcheng Li, Yixin Guo, Yongbin Gu:
Predicting Reuse Interval for Optimized Web Caching: An LSTM-Based Machine Learning Approach. 86:1-86:15 - Stella Bitchebe, Alain Tchana:
Out of Hypervisor (OoH): Efficient Dirty Page Tracking in Userspace Using Hardware Virtualization Features. 87:1-87:14

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.