default search action
IPDPS 2015: Hyderabad, India
- 2015 IEEE International Parallel and Distributed Processing Symposium, IPDPS 2015, Hyderabad, India, May 25-29, 2015. IEEE Computer Society 2015, ISBN 978-1-4799-8649-1
Keynote 1
- Phillip B. Gibbons:
Big data: Scale down, scale up, scale out. 3
Session 1: Graph and Social Analytics
- Hao Lu, Mahantesh Halappanavar, Daniel G. Chavarría-Miranda, Assefaw Hadish Gebremedhin, Ananth Kalyanaraman:
Balanced Coloring for Parallel Computing Applications. 7-16 - George M. Slota, Sivasankaran Rajamanickam, Kamesh Madduri:
High-Performance Graph Analytics on Manycore Processors. 17-27 - Xinyu Que, Fabio Checconi, Fabrizio Petrini, John A. Gunnels:
Scalable Community Detection with the Louvain Algorithm. 28-37 - Jonathan W. Berry, Michael J. Collins, Aaron Kearns, Cynthia A. Phillips, Jared Saia, Randy Smith:
Cooperative Computing for Autonomous Data Centers. 38-47
Session 2: Numerical Linear Algebra
- Gregoire Pichon, Azzam Haidar, Mathieu Faverge, Jakub Kurzak:
Divide and Conquer Symmetric Tridiagonal Eigensolver for Multicore Architectures. 51-60 - Shaden Smith, Niranjay Ravindran, Nicholas D. Sidiropoulos, George Karypis:
SPLATT: Efficient and Parallel Sparse Tensor-Matrix Multiplication. 61-70 - Piyush Sao, Xing Liu, Richard W. Vuduc, Xiaoye S. Li:
A Sparse Direct Solver for Distributed Memory Xeon Phi-Accelerated Systems. 71-81 - Tobias Maier, Peter Sanders, Jochen Speck:
Locality Aware DAG-Scheduling for LU-Decomposition. 82-92
Session 3: High Performance Networks and Congestion Management
- Jiwei Liu, Jun Yang, Rami G. Melhem:
GASOLIN: Global Arbitration for Streams of Data in Optical Links. 93-102 - Pablo Fuentes, Enrique Vallejo, Marina García, Ramón Beivide, Germán Rodríguez, Cyriel Minkenberg, Mateo Valero:
Contention-Based Nonminimal Adaptive Routing in High-Radix Networks. 103-112 - Abhinav Bhatele, Andrew R. Titus, Jayaraman J. Thiagarajan, Nikhil Jain, Todd Gamblin, Peer-Timo Bremer, Martin Schulz, Laxmikant V. Kalé:
Identifying the Culprits Behind Network Congestion. 113-122 - Jun Duan, Zhiyang Guo, Yuanyuan Yang:
Embedding Nonblocking Multicast Virtual Networks in Fat-Tree Data Centers. 123-132
Session 4: Software for Heterogeneous Many-Core Systems
- Pieter Hijma, Ceriel J. H. Jacobs, Rob van Nieuwpoort, Henri E. Bal:
Cashmere: Heterogeneous Many-Core Computing. 135-145 - Tarun Beri, Sorav Bansal, Subodh Kumar:
A Scheduling and Runtime Framework for a Cluster of Heterogeneous Machines with Multiple Accelerators. 146-155 - Wei Wu, Aurélien Bouteiller, George Bosilca, Mathieu Faverge, Jack J. Dongarra:
Hierarchical DAG Scheduling for Hybrid Distributed Systems. 156-165 - Niall Emmart, Charles C. Weems:
Pushing the Performance Envelope of Modular Exponentiation Across Multiple Generations of GPUs. 166-176
Session 5: Scheduling Algorithms
- Sanjoy K. Baruah:
Federated Scheduling of Sporadic DAG Task Systems. 179-186 - Josué Feliu, Julio Sahuquillo, Salvador Petit, José Duato:
Addressing Fairness in SMT Multicores with a Progress-Aware Scheduler. 187-196 - Mehmet Deveci, Kamer Kaya, Bora Uçar, Ümit V. Çatalyürek:
Fast and High Quality Topology-Aware Task Mapping. 197-206 - Hao Lin, Xin Qi, Shuo Yang, Samuel P. Midkiff:
Workload-Driven VM Consolidation in Cloud Data Centers. 207-216
Session 6: Concurrency in Memory Systems
- Matthieu Perrin, Achour Mostéfaoui, Claude Jard:
Update Consistency for Wait-Free Concurrent Objects. 219-228 - Aras Atalar, Anders Gidenstam, Paul Renaud-Goud, Philippas Tsigas:
Modeling Energy Consumption of Lock-Free Queue Implementations. 229-238 - Yiannis Nikolakopoulos, Anders Gidenstam, Marina Papatriantafilou, Philippas Tsigas:
A Consistency Framework for Iteration Operations in Concurrent Data Structures. 239-248 - Aditya Dhoke, Roberto Palmieri, Binoy Ravindran:
An Automated Framework for Decomposing Memory Transactions to Exploit Partial Rollback. 249-258
Session 7: MapReduce Advances
- Yandong Wang, Huansong Fu, Weikuan Yu:
Cracking Down MapReduce Failure Amplification through Analytics Logging and Migration. 261-270 - Xiao Yu, Bo Hong:
Grouping Blocks for MapReduce Co-Locality. 271-280 - Feng Liang, Francis C. M. Lau:
SMapReduce: Optimising Resource Allocation by Managing Working Slots at Runtime. 281-290 - Md. Wasi-ur-Rahman, Xiaoyi Lu, Nusrat Sharmin Islam, Raghunath Rajachandrasekar, Dhabaleswar K. Panda:
High-Performance Design of YARN MapReduce on Modern HPC Clusters with Lustre and RDMA. 291-300
Session 8: Performance and Energy Optimizations
- Jesmin Jahan Tithi, Pramod Ganapathi, Aakrati Talati, Sonal Aggarwal, Rezaul Alam Chowdhury:
High-Performance Energy-Efficient Recursive Dynamic Programming with Matrix-Multiplication-Like Flexible Kernels. 303-312 - Protonu Basu, Mary W. Hall, Samuel Williams, Brian van Straalen, Leonid Oliker, Phillip Colella:
Compiler-Directed Transformation for Higher-Order Stencils. 313-323 - Hung-Ching Chang, Bo Li, Godmar Back, Ali Raza Butt, Kirk W. Cameron:
LUC: Limiting the Unintended Consequences of Power Scaling on Parallel Transaction-Oriented Workloads. 324-333 - Kuangyu Zheng, Xiaodong Wang, Xiaorui Wang:
PowerFCT: Power Optimization of Data Center Network with Flow Completion Time Constraints. 334-343
Session 9: Dynamic Networks
- John Augustine, Tejas Kulkarni, Sumathi Sivasubramaniam:
Leader Election in Sparse Dynamic Networks with Churn. 347-356 - Alexander Mäcker, Manuel Malatyali, Friedhelm Meyer auf der Heide:
Online Top-k-Position Monitoring of Distributed Data Streams. 357-364 - Ashutosh Bhatia, R. C. Hansdah:
DSLR: A Distributed Schedule Length Reduction Algorithm for WSNs. 365-374 - Ramachandran Vaidyanathan, Costas Busch, Jerry L. Trahan, Gokarna Sharma, Suresh Rai:
Logarithmic-Time Complete Visibility for Robots with Lights. 375-384
Session 10: Applications on GPUs
- Michael G. Gowanlock, Henri Casanova:
Indexing of Spatiotemporal Trajectories for Efficient Distance Threshold Similarity Searches on the GPU. 387-396 - Xiaoxin Tang, Zhiyi Huang, David M. Eyers, Steven Mills, Minyi Guo:
Efficient Selection Algorithm for Fast k-NN Search on GPUs. 397-406 - Steven Dalton, Sean Baxter, Duane Merrill, Luke N. Olson, Michael Garland:
Optimizing Sparse Matrix Operations on GPUs Using Merge Path. 407-416 - Moritz Kreutzer, Andreas Pieper, Georg Hager, Gerhard Wellein, Andreas Alvermann, Holger Fehske:
Performance Engineering of the Kernel Polynomal Method on Large-Scale CPU-GPU Systems. 417-426
Session 11: Scheduling on Clusters
- Suraj Prabhakaran, Marcel Neumann, Sebastian Rinke, Felix Wolf, Abhishek Gupta, Laxmikant V. Kalé:
A Batch System with Efficient Adaptive Scheduling for Malleable and Evolving Applications. 429-438 - Zhou Zhou, Xu Yang, Zhiling Lan, Paul Rich, Wei Tang, Vitali A. Morozov, Narayan Desai:
Improving Batch Scheduling on Blue Gene/Q by Relaxing 5D Torus Network Allocation Constraints. 439-448 - Ana Jokanovic, José Carlos Sancho, Germán Rodríguez, Alejandro Lucero, Cyriel Minkenberg, Jesús Labarta:
Quiet Neighborhoods: Key to Protect Job Performance Predictability. 449-459 - Jeeva Paudel, Levi H. S. Lelis, José Nelson Amaral:
Stratified Sampling for Even Workload Partitioning Applied to IDA* and Delaunay Algorithms. 460-469
Session 12: Debugging and Verification
- Nicklas Bo Jensen, Niklas Quarfot Nielsen, Gregory L. Lee, Sven Karlsson, Matthew P. LeGendre, Martin Schulz, Dong H. Ahn:
A Scalable Prescriptive Parallel Debugging Model. 473-483 - Zhen Li, Ali Jannesari, Felix Wolf:
An Efficient Data-Dependence Profiler for Sequential and Parallel Programs. 484-493 - Menna Mostafa, Borzoo Bonakdarpour:
Decentralized Runtime Verification of LTL Specifications in Distributed Systems. 494-503 - Jingyu Zhou, Jiannong Cao, Bin Yao, Minyi Guo:
Fast Proof Generation for Verifying Cloud Search. 504-513
Keynote 2
- Alan Edelman:
Julia: A fresh approach to parallel programming. 517
Session 13: Randomized Algorithms
- Robert Elsässer, Dominik Kaaser:
On the Influence of Graph Density on Randomized Gossiping. 521-531 - Srikanta Tirthapura:
Distinct Random Sampling from a Distributed Stream. 532-541 - Petra Berenbrink, André Brinkmann, Robert Elsässer, Tom Friedetzky, Lars Nagel:
Randomized Renaming in Shared Memory Systems. 542-549 - Petra Berenbrink, Tom Friedetzky, Frederik Mallmann-Trenn, Sepehr Meshkinfamfard, Chris Wastell:
Threshold Load Balancing with Weighted Tasks. 550-558
Session 14: Scientific Applications I
- Evangelos Georganas, Aydin Buluç, Jarrod Chapman, Leonid Oliker, Daniel Rokhsar, Katherine A. Yelick:
merAligner: A Fully Parallel Sequence Aligner. 561-570 - William B. March, Bo Xiao, Chenhan D. Yu, George Biros:
An Algebraic Parallel Treecode in Arbitrary Dimensions. 571-580 - Salli Moustafa, Mathieu Faverge, Laurent Plagne, Pierre Ramet:
3D Cartesian Transport Sweep for Massively Parallel Architectures with PaRSEC. 581-590 - Linchuan Chen, Xin Huo, Gagan Agrawal:
A Pattern Specification and Optimizations Framework for Accelerating Scientific Computations on Heterogeneous Clusters. 591-600
Session 15: Storage Systems Architecture
- Yingxun Fu, Jiwu Shu:
D-Code: An Efficient RAID-6 Code to Optimize I/O Loads and Read Performance. 603-612 - Shuibing He, Xian-He Sun, Adnan Haider:
HAS: Heterogeneity-Aware Selective Data Layout Scheme for Parallel File Systems on Hybrid Servers. 613-622 - Jiangling Yin, Jun Wang, Jian Zhou, Tyler Lukasiewicz, Dan Huang, Junyao Zhang:
Opass: Analysis and Optimization of Parallel Data Access on Distributed File Systems. 623-632 - Bo Mao, Suzhen Wu, Hong Jiang:
Improving Storage Availability in Cloud-of-Clouds with Hybrid Redundant Data Distribution. 633-642
Session 16: MPI and Charm++ Advances
- Thomas Ropars, Arnaud Lefray, Dohyun Kim, André Schiper:
Efficient Process Replication for MPI Applications: Sharing Work between Replicas. 645-654 - Nikhil Jain, Abhinav Bhatele, Jae-Seung Yeom, Mark F. Adams, Francesco Miniati, Chao Mei, Laxmikant V. Kalé:
Charm++ and MPI: Combining the Best of Both Worlds. 655-664 - Min Si, Antonio J. Peña, Jeff R. Hammond, Pavan Balaji, Masamichi Takagi, Yutaka Ishikawa:
Casper: An Asynchronous Progress Model for MPI RMA on Many-Core Architectures. 665-676 - Xiang Ni, Laxmikant V. Kalé, Rasmus Tamstorf:
Scalable Asynchronous Contact Mechanics Using Charm++. 677-686
Session 17: Combinatorial Algorithms and Optimization
- Ke Wang, Yanjun Qi, Jeffrey J. Fox, Mircea R. Stan, Kevin Skadron:
Association Rule Mining with the Micron Automata Processor. 689-699 - Rong Gu, Shanyong Wang, Fangfang Wang, Chunfeng Yuan, Yihua Huang:
Cichlid: Efficient Large Scale RDFS/OWL Reasoning with Spark. 700-709 - Guojing Cong, Carol Meyers, Deepak Rajan, Tiziano Parriani:
Parallel Strategies for Solving Large Unit Commitment Problems in the California ISO Planning Model. 710-719
Session 18: Scientific Applications II
- Dheevatsa Mudigere, Srinivas Sridharan, Anand M. Deshpande, Jongsoo Park, Alexander Heinecke, Mikhail Smelyanskiy, Bharat Kaul, Pradeep Dubey, Dinesh K. Kaushik, David E. Keyes:
Exploring Shared-Memory Optimizations for an Unstructured Mesh CFD Application on Modern Parallel Systems. 723-732 - David Ozog, Allen D. Malony, Andrew R. Siegel:
A Performance Analysis of SIMD Algorithms for Monte Carlo Simulations of Nuclear Reactor Cores. 733-742 - Doru-Thom Popovici, Francis P. Russell, Karl A. Wilkinson, Chris-Kriton Skylaris, Paul H. J. Kelly, Franz Franchetti:
Generating Optimized Fourier Interpolation Routines for Density Functional Theory Using SPIRAL. 743-752 - Scott French, Yili Zheng, Barbara Romanowicz, Katherine A. Yelick:
Parallel Hessian Assembly for Seismic Waveform Inversion Using Global Updates. 753-762
Session 19: Resilience
- Chongxiao Cao, Thomas Hérault, George Bosilca, Jack J. Dongarra:
Design for a Soft Error Resilient Dynamic Task-Based Runtime. 765-774 - Jeremy P. Erickson, Namhoon Kim, James H. Anderson:
Recovering from Overload in Multicore Mixed-Criticality Systems. 775-785 - Li Tan, Shuaiwen Leon Song, Panruo Wu, Zizhong Chen, Rong Ge, Darren J. Kerbyson:
Investigating the Interplay between Energy Efficiency and Resilience in High Performance Computing. 786-796
Session 20: Graph Analytics
- Harshvardhan, Brandon West, Adam Fidel, Nancy M. Amato, Lawrence Rauchwerger:
A Hybrid Approach to Processing Big Data Graphs on Memory-Restricted Systems. 799-808 - Yogesh L. Simmhan, Neel Choudhury, Charith Wickramaarachchi, Alok Gautam Kumbhare, Marc Frîncu, Cauligi S. Raghavendra, Viktor K. Prasanna:
Distributed Programming over Time-Series Graphs. 809-818 - Linchuan Chen, Xin Huo, Bin Ren, Surabhi Jain, Gagan Agrawal:
Efficient and Simplified Parallel Graph Processing over CPU and MIC. 819-828
Keynote 3
- Madhav V. Marathe:
Assisting H1N1 and Ebola Outbreak Response through High Performance Networked Epidemiology. 831
Best Papers Session
- Michael A. Bender, Jonathan W. Berry, Simon D. Hammond, K. Scott Hemmert, Samuel McCauley, Branden Moore, Benjamin Moseley, Cynthia A. Phillips, David S. Resnick, Arun Rodrigues:
Two-Level Main Memory Co-Design: Multi-threaded Algorithmic Primitives, Analysis, and Simulation. 835-846 - Yang You, James Demmel, Kenneth Czechowski, Le Song, Richard W. Vuduc:
CA-SVM: Communication-Avoiding Support Vector Machines on Distributed Systems. 847-859 - J. P. Grossman, Brian Towles, Brian Greskamp, David E. Shaw:
Filtering, Reductions and Synchronization in the Anton 2 Network. 860-870 - Roberto Belli, Torsten Hoefler:
Notified Access: Extending Remote Memory Access Programming Models for Producer-Consumer Synchronization. 871-881
Session 21: Algorithms for Fault Tolerance
- Alejandro Z. Tomsic, Pierre Sens, João Garcia, Luciana Arantes, Julien Sopena:
2W-FD: A Failure Detector Algorithm with QoS. 885-893 - Silvia Bonomi, Maria Potop-Butucaru, Sébastien Tixeuil:
Stabilizing Byzantine-Fault Tolerant Storage. 894-903 - Jean Paul Bahsoun, Rachid Guerraoui, Ali Shoker:
Making BFT Protocols Really Adaptive. 904-913 - Naoto Sasaki, Kento Sato, Toshio Endo, Satoshi Matsuoka:
Exploration of Lossy Compression for Application-Level Checkpoint/Restart. 914-922
Session 22: Scheduling and Load Balancing
- Max Rietmann, Daniel Peter, Olaf Schenk, Bora Uçar, Marcus J. Grote:
Load-Balanced Local Time Stepping for Large-Scale Wave Propagation. 925-935 - Yinglong Xia, Lifeng Nai, Jui-Hsin Lai:
Towards Balance-Affinity Tradeoff in Concurrent Subgraph Traversals. 936-945 - Jingjing Wang, Nael B. Abu-Ghazaleh, Dmitry V. Ponomarev:
Controlled Contention: Balancing Contention and Reservation in Multicore Application Scheduling. 946-955 - Dazhao Cheng, Jia Rao, Changjun Jiang, Xiaobo Zhou:
Resource and Deadline-Aware Job Scheduling in Dynamic Hadoop Clusters. 956-965
Session 23: Heterogeneous Systems
- Jingweijia Tan, Xin Fu:
Mitigating the Susceptibility of GPGPUs Register File to Process Variations. 969-978 - Jayvant Anantpur, R. Govindarajan:
PRO: Progress Aware GPU Warp Scheduling Algorithm. 979-988 - Tobias Fjalling, Per Stenström:
Performance Impact of Batching Web-Application Requests Using Hot-Spot Processing on GPUs. 989-999 - Lavanya Ramapantulu, Dumitrel Loghin, Yong Meng Teo:
An Approach for Energy Efficient Execution of Hybrid Parallel Programs. 1000-1009
Session 24: I/O Optimizations
- Ana Gainaru, Guillaume Aupy, Anne Benoit, Franck Cappello, Yves Robert, Marc Snir:
Scheduling the I/O of HPC Applications Under Congestion. 1013-1022 - Bogdan Nicolae:
Leveraging Naturally Distributed Data Redundancy to Reduce Collective I/O Replication Overhead. 1023-1032 - Tong Jin, Fan Zhang, Qian Sun, Hoang Bui, Melissa Romanus, Norbert Podhorszki, Scott Klasky, Hemanth Kolla, Jacqueline Chen, Robert Hager, Choong-Seock Chang, Manish Parashar:
Exploring Data Staging Across Deep Memory Hierarchies for Coupled Data Intensive Simulation Workflows. 1033-1042 - Pham Nguyen Quang Anh, Rui Fan, Yonggang Wen:
Reducing Vector I/O for Faster GPU Sparse Matrix-Vector Multiplication. 1043-1052
Session 25: Graph Algorithms
- Henning Meyerhenke, Peter Sanders, Christian Schulz:
Parallel Graph Partitioning for Complex Networks. 1055-1064 - Lélia Blin, Fadwa Boubekeur, Swan Dubois:
A Self-Stabilizing Memory Efficient Algorithm for the Minimum Diameter Spanning Tree under an Omnipotent Daemon. 1065-1074 - Ariful Azad, Aydin Buluç, Alex Pothen:
A Parallel Tree Grafting Algorithm for Maximum Cardinality Matching in Bipartite Graphs. 1075-1084
Session 26: Resource Management
- Koyel Mukherjee, Partha Dutta, Gurulingesh Raravi, Thangaraj Rajasubramaniam, Koustuv Dasgupta, Atul Singh:
Fair Resource Allocation for Heterogeneous Tasks. 1087-1096 - Tan Li, Yufei Ren, Dantong Yu, Shudong Jin:
Resources-Conscious Asynchronous High-Speed Data Transfer in Multicore Systems: Design, Optimizations, and Evaluation. 1097-1106 - Tridib Mukherjee, Partha Dutta, Vinay Gangadhar Hegde, Sujit Gujar:
RISC: Robust Infrastructure over Shared Computing Resources through Dynamic Pricing and Incentivization. 1107-1116
Session 27: Architectural Support for Runtime and Thermal Management
- Alberto Ros, Alexandra Jimborean:
A Dual-Consistency Cache Coherence Protocol. 1119-1128 - Tamer Dallou, Nina Engelhardt, Ahmed Elhossini, Ben H. H. Juurlink:
Nexus#: A Distributed Hardware Task Manager for Task-Based Programming Models. 1129-1138 - Kaicheng Zhang, Seda Ogrenci Memik, Gokhan Memik, Kazutomo Yoshii, Rajesh Sankaran, Peter H. Beckman:
Minimizing Thermal Variation Across System Components. 1139-1148
Session 28: Performance Monitoring and Prediction
- Mihail Popov, Chadi Akel, Florent Conti, William Jalby, Pablo de Oliveira Castro:
PCERE: Fine-Grained Parallel Benchmark Decomposition for Scalability Prediction. 1151-1160 - Anirudh Jayakumar, Prakash Murali, Sathish Vadhiyar:
Matching Application Signatures for Performance Predictions Using a Single Execution. 1161-1170 - Hammad Khan, Julien Gascon-Samson, Jörg Kienzle, Bettina Kemme:
Monitoring Large-Scale Location-Based Information Systems. 1171-1181
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.