default search action
SC 2014: New Orleans, LA, USA
- Trish Damkroger, Jack J. Dongarra:
International Conference for High Performance Computing, Networking, Storage and Analysis, SC 2014, New Orleans, LA, USA, November 16-21, 2014. IEEE Computer Society 2014, ISBN 978-1-4799-5500-8
ACM Gordon Bell Finalist I
- Alexander Heinecke, Alexander Breuer, Sebastian Rettenberger, Michael Bader, Alice-Agnes Gabriel, Christian Pelties, Arndt Bode, William Barth, Xiangke Liao, Karthikeyan Vaidyanathan, Mikhail Smelyanskiy, Pradeep Dubey:
Petascale High Order Dynamic Rupture Earthquake Simulations on Heterogeneous Supercomputers. 3-14 - Tsuyoshi Ichimura, Kohei Fujita, Seizo Tanaka, Muneo Hori, Wijerathne Maddegedara Lalith Lakshman, Yoshihisa Shizawa, Hiroshi Kobayashi:
Physics-Based Urban Earthquake Simulation Enhanced by 10.7 BlnDOF × 30 K Time-Step Unstructured FE Non-Linear Seismic Wave Simulation. 15-26 - Andrew S. Cassidy, Rodrigo Alvarez-Icaza, Filipp Akopyan, Jun Sawada, John V. Arthur, Paul Merolla, Pallab Datta, Marc González Tallada, Brian Taba, Alexander Andreopoulos, Arnon Amir, Steven K. Esser, Jeff Kusnitz, Rathinakumar Appuswamy, Chuck Haymes, Bernard Brezzo, Roger Moussalli, Ralph Bellofatto, Christian W. Baks, Michael Mastro, Kai Schleupen, Charles E. Cox, Ken Inoue, Steven E. Millman, Nabil Imam, Emmett McQuinn, Yutaka Y. Nakamura, Ivan Vo, Chen Guok, Don Nguyen, Scott Lekuch, Sameh W. Asaad, Daniel J. Friedman, Bryan L. Jackson, Myron Flickner, William P. Risk, Rajit Manohar, Dharmendra S. Modha:
Real-Time Scalable Cortical Computing at 46 Giga-Synaptic OPS/Watt with ~100× Speedup in Time-to-Solution and ~100, 000× Reduction in Energy-to-Solution. 27-38
ACM Gordon Bell Finalist II
- David E. Shaw, J. P. Grossman, Joseph A. Bank, Brannon Batson, J. Adam Butts, Jack C. Chao, Martin M. Deneroff, Ron O. Dror, Amos Even, Christopher H. Fenton, Anthony Forte, Joseph Gagliardo, Gennette Gill, Brian Greskamp, C. Richard Ho, Douglas J. Ierardi, Lev Iserovich, Jeffrey Kuskin, Richard H. Larson, Timothy Layman, Li-Siang Lee, Adam K. Lerer, Chester Li, Daniel Killebrew, Kenneth M. Mackenzie, Shark Yeuk-Hai Mok, Mark A. Moraes, Rolf Mueller, Lawrence J. Nociolo, Jon L. Peticolas, Terry Quan, Daniel Ramot, John K. Salmon, Daniele Paolo Scarpazza, U. Ben Schafer, Naseer Siddique, Christopher W. Snyder, Jochen Spengler, Ping Tak Peter Tang, Michael Theobald, Horia Toma, Brian Towles, Benjamin Vitale, Stanley C. Wang, Cliff Young:
Anton 2: Raising the Bar for Performance and Programmability in a Special-Purpose Molecular Dynamics Supercomputer. 41-53 - Jeroen Bédorf, Evghenii Gaburov, Michiko S. Fujii, Keigo Nitadori, Tomoaki Ishiyama, Simon Portegies Zwart:
24.77 Pflops on a Gravitational Tree-Code to Simulate the Milky Way Galaxy with 18600 GPUs. 54-65
Heterogeneity and Scaling in Applications
- Simon Heybrock, Bálint Joó, Dhiraj D. Kalamkar, Mikhail Smelyanskiy, Karthikeyan Vaidyanathan, Tilo Wettig, Pradeep Dubey:
Lattice QCD with Domain Decomposition on Intel® Xeon Phi Co-Processors. 69-80 - James C. Phillips, Yanhua Sun, Nikhil Jain, Eric J. Bohm, Laxmikant V. Kalé:
Mapping to Irregular Torus Topologies and Other Techniques for Petascale Biomolecular Simulation. 81-91 - Dhairya Malhotra, Amir Gholami, George Biros:
A Volume Integral Equation Stokes Solver for Problems with Variable Coefficients. 92-102
Memory and Microarchitecture
- Changhui Lin, Vijay Nagarajan, Rajiv Gupta:
Fence Scoping. 105-116 - Ralph Nathan, Bryan Anthonio, Shih-Lien Lu, Helia Naeimi, Daniel J. Sorin, Xiaobai Sun:
Recycled Error Bits: Energy-Efficient Architectural Support for Floating Point Accuracy. 117-127 - Niladrish Chatterjee, Mike O'Connor, Gabriel H. Loh, Nuwan Jayasena, Rajeev Balasubramonian:
Managing DRAM Latency Divergence in Irregular GPGPU Applications. 128-139
Performance Measurement
- Jidong Zhai, Jianfei Hu, Xiongchao Tang, Xiaosong Ma, Wenguang Chen:
CYPRESS: Combining Static and Dynamic Analysis for Top-Down Communication Trace Compression. 143-153 - Anthony M. Agelastos, Benjamin A. Allan, Jim M. Brandt, Paul Cassella, Jeremy Enos, Joshi Fullop, Ann C. Gentile, Steve Monk, Nichamon Naksinehaboon, Jeff Ogden, Mahesh Rajan, Michael T. Showerman, Joel Stevenson, Narate Taerat, Thomas W. Tucker:
The Lightweight Distributed Metric Service: A Scalable Infrastructure for Continuous Monitoring of Large Scale Computing Systems and Applications. 154-165 - Alfredo Giménez, Todd Gamblin, Barry Rountree, Abhinav Bhatele, Ilir Jusufi, Peer-Timo Bremer, Bernd Hamann:
Dissecting On-Node Memory Access Performance: A Semantic Approach. 166-176
Accelerators
- Peng Li, Guodong Li, Ganesh Gopalakrishnan:
Practical Symbolic Race Checking of GPU Programs. 179-190 - Mohamed Wahib, Naoya Maruyama:
Scalable Kernel Fusion for Memory-Bound GPU Applications. 191-202 - Matthias Noack, Florian Wende, Thomas Steinke, Frank Cordes:
A Unified Programming Model for Intra- and Inter-Node Offloading on Xeon Phi Clusters. 203-214
Best Practices in File Systems
- Sarp Oral, James Simmons, Jason Hill, Dustin Leverman, Feiyi Wang, Matthew A. Ezell, Ross G. Miller, Douglas Fuller, Raghul Gunasekaran, Youngjae Kim, Saurabh Gupta, Devesh Tiwari, Sudharshan S. Vazhkudai, James H. Rogers, David Dillow, Galen M. Shipman, Arthur S. Bland:
Best Practices and Lessons Learned from Deploying and Operating Large-Scale Data-Centric Parallel File Systems. 217-228 - Robert T. McLay, Doug James, Si Liu, John Cazes, William L. Barth:
A User-Friendly Approach for Tuning Parallel File Operations. 229-236 - Kai Ren, Qing Zheng, Swapnil Patil, Garth A. Gibson:
IndexFS: Scaling File System Metadata Performance with Stateless Caching and Bulk Insertion. 237-248
Earth and Space Sciences
- Takashi Shimokawabe, Takayuki Aoki, Naoyuki Onodera:
High-Productivity Framework on GPU-Rich Supercomputers for Operational Weather Prediction Code ASUCA. 251-261 - Ali Charara, Hatem Ltaief, Damien Gratadour, David E. Keyes, Arnaud Sevin, Ahmad Abdelfattah, Eric Gendron, Carine Morel, Fabrice Vidal:
Pipelining Computational Stages of the Tomographic Reconstructor for Multi-Object Adaptive Optics on a Multi-GPU System. 262-273 - Dave A. May, Jed Brown, Laetitia Le Pourhiet:
pTatin3D: High-Performance Methods for Long-Term Lithospheric Dynamics. 274-284
Compiler Analysis and Optimization
- Jun Shirako, Louis-Noël Pouchet, Vivek Sarkar:
Oil and Water Can Mix: An Integration of Polyhedral and AST-Based Transformations. 287-298 - Timothy G. Armstrong, Justin M. Wozniak, Michael Wilde, Ian T. Foster:
Compiler Techniques for Massively Scalable Implicit Task Parallelism. 299-310 - Zhilei Xu, Shoaib Kamil, Armando Solar-Lezama:
MSL: A Synthesis Enabled Language for Distributed Implementations. 311-322
Networks
- Ahmed H. Abdel-Gawad, Mithuna Thottethodi, Abhinav Bhatele:
RAHTM: Routing Algorithm Aware Hierarchical Task Mapping. 325-335 - Nikhil Jain, Abhinav Bhatele, Xiang Ni, Nicholas J. Wright, Laxmikant V. Kalé:
Maximizing Throughput on a Dragonfly Network. 336-347 - Maciej Besta, Torsten Hoefler:
Slim Fly: A Cost Effective Low-Diameter Network Topology. 348-359
Parallel Algorithms
- Penporn Koanantakool, Katherine A. Yelick:
A Computation- and Communication-Optimal Parallel Direct 3-Body Algorithm. 363-374 - Samyam Rajbhandari, Akshay Nikam, Pai-Wei Lai, Kevin Stock, Sriram Krishnamoorthy, P. Sadayappan:
A Communication-Optimal Framework for Contracting Distributed Tensors. 375-386 - Julian Shun:
Fast Parallel Computation of Longest Common Prefixes. 387-398
Big Data Analysis
- Pingpeng Yuan, Wenya Zhang, Changfeng Xie, Hai Jin, Ling Liu, Kisung Lee:
Fast Iterative Graph Computation: A Path Centric Approach. 401-412 - Sidharth Kumar, John Edwards, Peer-Timo Bremer, Aaron Knoll, Cameron Christensen, Venkatram Vishwanath, Philip H. Carns, John A. Schmidt, Valerio Pascucci:
Efficient I/O and Storage of Adaptive-Resolution Data. 413-423 - James P. Ahrens, Sébastien Jourdain, Patrick O'Leary, John Patchett, David H. Rogers, Mark R. Petersen:
An Image-Based Approach to Extreme Scale in Situ Visualization and Analysis. 424-434
High Performance Genomics
- Evangelos Georganas, Aydin Buluç, Jarrod Chapman, Leonid Oliker, Daniel Rokhsar, Katherine A. Yelick:
Parallel De Bruijn Graph Construction and Traversal for De Novo Genome Assembly. 437-448 - Kanak Mahadik, Somali Chaterji, Bowen Zhou, Milind Kulkarni, Saurabh Bagchi:
Orion: Scaling Genomic Sequence Matching with Fine-Grained Parallelization. 449-460 - Sanchit Misra, Md. Vasimuddin, Kiran Pamnany, Sriram P. Chockalingam, Yong Dong, Min Xie, Maneesha R. Aluru, Srinivas Aluru:
Parallel Bayesian Network Structure Learning for Genome-Scale Gene Networks. 461-472
MPI
- Judicael A. Zounmevo, Xin Zhao, Pavan Balaji, William Gropp, Ahmad Afsahi:
Nonblocking Epochs in MPI One-Sided Communication. 475-486 - Srinivas Sridharan, James Dinan, Dhiraj D. Kalamkar:
Enabling Efficient Multithreaded MPI Communication through a Library-Based Implementation of MPI Endpoints. 487-498 - Zhezhe Chen, James Dinan, Zhen Tang, Pavan Balaji, Hua Zhong, Jun Wei, Tao Huang, Feng Qin:
MC-Checker: Detecting Memory Consistency Errors in MPI One-Sided Applications. 499-510
Cloud Computing I
- Dipanjan Sengupta, Anshuman Goswami, Karsten Schwan, Krishna Pallavi:
Scheduling Multi-tenant Cloud Workloads on Accelerator-Based Systems. 513-524 - Ismail El-Helw, Rutger F. H. Hofman, Henri E. Bal:
Scaling MapReduce Vertically and Horizontally. 525-535 - Daniele D'Agostino, Andrea Clematis, Antonella Galizia, Alfonso Quarati, Emanuele Danovaro, Luca Roverelli, Gabriele Zereik, Dieter Kranzlmüller, Michael Schiffers, Nils gentschen Felde, Christian Straube, Olivier Caumont, Evelyne Richard, Luis Garrote, Quillon K. Harpham, H. R. A. Jagers, Vladimir Dimitrijevic, Ljiljana Dekic, Elisabetta Fiori, Fabio Delogu, Antonio Parodi:
The DRIHM Project: A Flexible Approach to Integrate HPC, Grid and Cloud Resources for Hydro-Meteorological Research. 536-546
Graph Algorithms
- Roger A. Pearce, Maya B. Gokhale, Nancy M. Amato:
Faster Parallel Traversal of Scale Free Graphs at Extreme Scale with Vertex Delegates. 549-559 - Md. Mostofa Ali Patwary, Nadathur Satish, Narayanan Sundaram, Fredrik Manne, Salman Habib, Pradeep Dubey:
Pardicle: Parallel Approximate Density-Based Clustering. 560-571 - Adam McLaughlin, David A. Bader:
Scalable and High Performance Betweenness Centrality on the GPU. 572-583
Hardware Vulnerability and Recovery
- Chen-Yong Cher, Meeta Sharma Gupta, Pradip Bose, K. Paul Muller:
Understanding Soft Error Resiliency of Blue Gene/Q Compute Chip through Hardware Proton Irradiation and Software Fault Injection. 587-596 - Jens Domke, Torsten Hoefler, Satoshi Matsuoka:
Fail-in-Place Network Design: Interaction Between Topology, Routing Algorithm and Failures. 597-608 - Sarah Ellen Michalak, William N. Rust, John T. Daly, Rew J. Dubois, David H. Dubois:
Correctness Field Testing of Production and Decommissioned High Performance Computing Platforms at Los Alamos National Laboratory. 609-619
I/O and Dynamic Optimization
- Matthieu Dorier, Shadi Ibrahim, Gabriel Antoniu, Robert B. Ross:
Omnisc'IO: A Grammar-Based Approach to Spatial and Temporal I/O Patterns Prediction. 623-634 - Dong Dai, Yong Chen, Dries Kimpe, Robert B. Ross:
Two-Choice Randomized Dynamic I/O Scheduler for Object Storage Systems. 635-646 - Bilge Acun, Abhishek Gupta, Nikhil Jain, Akhil Langer, Harshitha Menon, Eric Mikida, Xiang Ni, Michael P. Robson, Yanhua Sun, Ehsan Totoni, Lukasz Wesolowski, Laxmikant V. Kalé:
Parallel Programming with Migratable Objects: Charm++ in Practice. 647-658
Quantum Simulations in Materials and Chemistry
- Ken-ichi Nomura, Rajiv K. Kalia, Aiichiro Nakano, Priya Vashishta, Kohei Shimamura, Fuyuki Shimojo, Manaschai Kunaseth, Paul C. Messina, Nichols A. Romero:
Metascalable Quantum Molecular Dynamics Simulations of Hydrogen-on-Demand. 661-673 - Edoardo Aprà, Michael Klemm, Karol Kowalski:
Efficient Implementation of Many-Body Quantum Chemical Methods on the Intel® Xeon Phi Coprocessor. 674-684 - William Dawson, François Gygi:
Optimized Scheduling Strategies for Hybrid Density Functional theory Electronic Structure Calculations. 685-692
Resilience
- Li Yu, Dong Li, Sparsh Mittal, Jeffrey S. Vetter:
Quantitatively Modeling Application Resilience with the Data Vulnerability Factor. 695-706 - Carlos H. A. Costa, Yoonho Park, Bryan S. Rosenburg, Chen-Yong Cher, Kyung Dong Ryu:
A System Software Approach to Proactive Memory-Error Avoidance. 707-718 - Mehmet Can Kurt, Sriram Krishnamoorthy, Kunal Agrawal, Gagan Agrawal:
Fault-Tolerant Dynamic Task Graph Scheduling. 719-730
Machine Learning and Data Analytics
- Zhengzhang Chen, Seung Woo Son, William Hendrix, Ankit Agrawal, Wei-keng Liao, Alok N. Choudhary:
NUMARCK: Machine Learning Algorithm for Resiliency and Checkpointing. 733-744 - I-Hsin Chung, Tara N. Sainath, Bhuvana Ramabhadran, Michael Picheny, John A. Gunnels, Vernon Austel, Upendra V. Chaudhari, Brian Kingsbury:
Parallel Deep Neural Network Training for Big Data on Blue Gene/Q. 745-753 - Yu Hua, Hong Jiang, Dan Feng:
FAST: Near Real-Time Searchable Data Analytics for the Cloud. 754-765
Numerical Kernels
- Joseph L. Greathouse, Mayank Daga:
Efficient Sparse Matrix-Vector Multiplication on GPUs Using the CSR Storage Format. 769-780 - Arash Ashari, Naser Sedaghati, John Eisenlohr, Srinivasan Parthasarathy, P. Sadayappan:
Fast Sparse Matrix-Vector Multiplication on GPUs for Graph Applications. 781-792 - Catherine Mills Olschanowsky, Michelle Mills Strout, Stephen M. Guzik, John Loffeld, Jeffrey Hittinger:
A Study on Balancing Parallelism, Data Locality, and Recomputation in Existing PDE Solvers. 793-804
Power and Energy Efficiency
- Osman Sarood, Akhil Langer, Abhishek Gupta, Laxmikant V. Kalé:
Maximizing Throughput of Overprovisioned HPC Data Centers Under a Strict Power Budget. 807-818 - Ben Cumming, Gilles Fourestey, Oliver Fuhrer, Tobias Gysi, Massimiliano Fatica, Thomas C. Schulthess:
Application Centric Energy-Efficiency Study of Distributed Multi-Core and Hybrid CPU-GPU Systems. 819-829 - Oreste Villa, Daniel R. Johnson, Mike O'Connor, Evgeny Bolotin, David W. Nellans, Justin Luitjens, Nikolai Sakharnykh, Peng Wang, Paulius Micikevicius, Anthony Scudiero, Stephen W. Keckler, William J. Dally:
Scaling the Power Wall: A Path to Exascale. 830-841
Data Locality and Load Balancing
- Michael Bauer, Sean Treichler, Elliott Slaughter, Alex Aiken:
Structure Slicing: Extending Logical Regions with Fields. 845-856 - Jonathan Lifflander, Sriram Krishnamoorthy, Laxmikant V. Kalé:
Optimizing Data Locality for Fork/Join Programs Using Constrained Work Stealing. 857-868 - Mehmet Can Kurt, Gagan Agrawal:
DISC: A Domain-Interaction Based Programming Model with Support for Heterogeneous Execution. 869-880
Optimized Checkpointing
- Kurt B. Ferreira, Patrick M. Widener, Scott Levy, Dorian C. Arnold, Torsten Hoefler:
Understanding the Effects of Communication and Coordination on Checkpointing at Scale. 883-894 - Marc Gamell, Daniel S. Katz, Hemanth Kolla, Jacqueline Chen, Scott Klasky, Manish Parashar:
Exploring Automatic, Online Failure Recovery for Scientific Applications at Extreme Scales. 895-906 - Sheng Di, Leonardo Arturo Bautista-Gomez, Franck Cappello:
Optimization of a Multilevel Checkpoint Model with Uncertain Execution Scales. 907-918
Sparse Solvers
- Konstantinos I. Karantasis, Andrew Lenharth, Donald Nguyen, María Jesús Garzarán, Keshav Pingali:
Parallelization of Reordering Algorithms for Bandwidth and Wavefront Reduction. 921-932 - Ichitaro Yamazaki, Sivasankaran Rajamanickam, Erik G. Boman, Mark Hoemmen, Michael A. Heroux, Stanimire Tomov:
Domain Decomposition Preconditioners for Communication-Avoiding Krylov Methods on a Hybrid CPU/GPU Cluster. 933-944 - Jongsoo Park, Mikhail Smelyanskiy, Karthikeyan Vaidyanathan, Alexander Heinecke, Dhiraj D. Kalamkar, Xing Liu, Md. Mostofa Ali Patwary, Yutong Lu, Pradeep Dubey:
Efficient Shared-Memory Implementation of High-Performance Conjugate Gradient Benchmark and its Application to Unstructured Matrices. 945-955
Cloud Computing II
- Yanfei Guo, Jia Rao, Changjun Jiang, Xiaobo Zhou:
FlexSlot: Moving Hadoop Into the Cloud with Flexible Slot Management. 959-969 - Haikun Liu, Bingsheng He:
Reciprocal Resource Fairness: Towards Cooperative Multiple-Resource Fair Sharing in IaaS Clouds. 970-981 - Yifan Gong, Bingsheng He, Dan Li:
Finding Constant from Change: Revisiting Network Performance Aware Optimizations on IaaS Clouds. 982-993
Large-Scale Visualization
- Tom Peterka, Dmitriy Morozov, Carolyn L. Phillips:
High-Performance Computation of Distributed-Memory Parallel 3D Voronoi and Delaunay Tessellation. 997-1007 - Kewei Lu, Han-Wei Shen, Tom Peterka:
Scalable Computation of Stream Surfaces on Large Scale Vector Fields. 1008-1019 - Aaditya G. Landge, Valerio Pascucci, Attila Gyulassy, Janine Bennett, Hemanth Kolla, Jacqueline Chen, Peer-Timo Bremer:
In-Situ Feature Extraction of Large Scale Combustion Simulations Using Segmented Merge Trees. 1020-1031
Memory System Energy Efficiency
- Xun Jian, Rakesh Kumar:
ECC Parity: A Technique for Efficient Memory Error Resilience for Multi-Channel Memory Systems. 1035-1046 - Ehsan Totoni, Josep Torrellas, Laxmikant V. Kalé:
Using an Adaptive HPC Runtime System to Reconfigure the Cache Hierarchy. 1047-1058 - Young Hoon Son, Seongil O, Hyunggyun Yang, Daejin Jung, Jung Ho Ahn, John Kim, Jangwoo Kim, Jae W. Lee:
Microbank: Architecting Through-Silicon Interposer-Based Main Memory Systems. 1059-1070
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.