


default search action
ISPASS 2023: Raleigh, NC, USA
- IEEE International Symposium on Performance Analysis of Systems and Software, ISPASS 2023, Raleigh, NC, USA, April 23-25, 2023. IEEE 2023, ISBN 979-8-3503-9739-0
- Geonhwa Jeong, Bikash Sharma, Nick Terrell, Abhishek Dhanotia, Zhiwei Zhao, Niket Agarwal, Arun Kejariwal, Tushar Krishna:
Characterization of Data Compression in Datacenters. 1-12 - Fatemeh Ghasemi, Lukas Liedtke
, Magnus Jahre:
PES: An Energy and Throughput Model for Energy Harvesting IoT Systems. 13-23 - Jiaao Ma, Ceyu Xu
, Lisa Wu Wills:
PyTFHE: An End-to-End Compilation and Execution Framework for Fully Homomorphic Encryption Applications. 24-34 - Juan Gómez-Luna, Yuxin Guo, Sylvan Brocard, Julien Legriel, Remy Cimadomo, Geraldo F. Oliveira
, Gagandeep Singh, Onur Mutlu:
Evaluating Machine LearningWorkloads on Memory-Centric Computing Systems. 35-49 - Shruti Yadav Narayana, Jie Tong, Anish Krishnakumar, Nuriye Yildirim, Emily Shriver, Mahesh Ketkar, Ümit Y. Ogras:
MQL: ML-Assisted Queuing Latency Analysis for Data Center Networks. 50-60 - Gino Chacon, Nathan Gober, Krishnendra Nathella, Paul V. Gratz, Daniel A. Jiménez:
A Characterization of the Effects of Software Instruction Prefetching on an Aggressive Front-end. 61-70 - Emilio Domínguez-Sánchez, Alberto Ros:
MBPlib: Modular Branch Prediction Library. 71-80 - John Alistair Kressel, Guillermo Callaghan, Cosmin Gorgovan, Mikel Luján:
Evaluating the Impact of Optimizations for Dynamic Binary Modification on 64-bit RISC-V. 81-91 - Anna Yue, Sanyam Mehta:
An Application-Oriented Approach to Designing Hybrid CPU Architectures. 92-102 - Johnson Umeike
, Neel Patel, Alex Manley, Amin Mamandipoor
, Heechul Yun, Mohammad Alian
:
Profiling gem5 Simulator. 103-113 - Markos Kynigos, Javier Navaridas, Jose Antonio Pascual, Mikel Luján:
A Novel Simulation Methodology for Silicon Photonic Switching Fabrics. 114-123 - Stijn Eyerman, Sam Van den Steen, Wim Heirman, Ibrahim Hur:
Simulating Wrong-Path Instructions in Decoupled Functional-First Simulation. 124-133 - Alexander Hankin, Lillian Pentecost, Dongmoon Min, David Brooks, Gu-Yeon Wei:
Is the Future Cold or Tall? Design Space Exploration of Cryogenic and 3D Embedded Cache Memory. 134-144 - Mohsin Shan, Deniz Gurevin, Jared Nye, Caiwen Ding, Omer Khan:
MergePath-SpMM: Parallel Sparse Matrix-Matrix Algorithm for Graph Neural Network Acceleration. 145-156 - Shvetank Prakash, Tim Callahan, Joseph Bushagour, Colby R. Banbury, Alan V. Green, Pete Warden, Tim Ansell, Vijay Janapa Reddi:
CFU Playground: Full-Stack Open-Source Framework for Tiny Machine Learning (TinyML) Acceleration on FPGAs. 157-167 - Matthew Joseph Adiletta, Jesmin Jahan Tithi, Emmanouil-Ioannis Farsarakis
, Gerasimos Gerogiannis, Robert Adolf, Robert Benke, Sidharth Kashyap, Samuel Hsia, Kartik Lakhotia, Fabrizio Petrini, Gu-Yeon Wei, David Brooks:
Characterizing the Scalability of Graph Convolutional Networks on Intel® PIUMA. 168-177 - Zhuren Liu, Shouzhe Zhang, Justin Garrigus, Hui Zhao:
Genomics-GPU: A Benchmark Suite for GPU-accelerated Genome Analysis. 178-188 - Lauren Biernacki, Biniyam Mengist Tiruye, Meron Zerihun Demissie
, Fitsum Assamnew Andargie
, Brandon Reagen
, Todd M. Austin:
Exploring the Efficiency of Data-Oblivious Programs. 189-200 - Yanwen Xu, Ang Li
, Tyler Sorensen
:
Redwood: Flexible and Portable Heterogeneous Tree Traversal Workloads. 201-213 - Vignesh Balaji
, Neal Clayton Crago, Aamer Jaleel, Stephen W. Keckler:
Community-based Matrix Reordering for Sparse Linear Algebra Optimization. 214-223 - Mahmood Naderan-Tahan, Hossein SeyyedAghaei, Lieven Eeckhout:
Sieve: Stratified GPU-Compute Workload Sampling. 224-234 - Maurus Item, Geraldo F. Oliveira
, Juan Gómez-Luna, Mohammad Sadrosadati, Yuxin Guo, Onur Mutlu:
TransPimLib: Efficient Transcendental Functions for Processing-in-Memory Systems. 235-247 - Seokjin Go
, Hyunwuk Lee
, Junsung Kim, Jiwon Lee
, Myung Kuk Yoon
, Won Woo Ro:
Early-Adaptor: An Adaptive Framework forProactive UVM Memory Management. 248-258 - MohammadHossein Olyaiy, Christopher Ng, Alexandra (Sasha) Fedorova, Mieszko Lis:
Sunstone: A Scalable and Versatile Scheduler for Mapping Tensor Algebra on Spatial Accelerators. 259-271 - Deepraj Soni, Negar Neda, Naifeng Zhang
, Benedict Reynwar, Homer Gamil, Benjamin Heyman, Mohammed Nabeel, Ahmad Al Badawi, Yuriy Polyakov
, Kellie Canida, Massoud Pedram, Michail Maniatakos, David Bruce Cousins, Franz Franchetti, Matthew French, Andrew G. Schmidt, Brandon Reagen
:
RPU: The Ring Processing Unit. 272-282 - William Won
, Taekyung Heo, Saeed Rashidi, Srinivas Sridharan, Sudarshan Srinivasan, Tushar Krishna:
ASTRA-sim2.0: Modeling Hierarchical Networks and Disaggregated Systems for Large-model Training at Scale. 283-294 - Maziar Amiraski, David Werner, Alexander Hankin, Julien Sebot, Kaushik Vaidyanathan, Mark Hempstead
:
Boreas: A Cost-Effective Mitigation Method for Advanced Hotspots using Machine Learning and Hardware Telemetry. 295-305 - Diksha Moolchandani
, Joyjit Kundu
, Frederik Ruelens, Peter Vrancx, Timon Evenblij
, Manu Perumkunnil:
AMPeD: An Analytical Model for Performance in Distributed Training of Transformers. 306-315 - Michael Gilbert, Yannan Nellie Wu, Angshuman Parashar, Vivienne Sze, Joel S. Emer:
LoopTree: Enabling Exploration of Fused-layer Dataflow Accelerators. 316-318 - Sanya Srivastava, Tyler Sorensen:
Degree-Aware Kernel Mapping for Graph Processing on GPUs. 319-321 - Mahita Nagabhiru
, Greg Byrd
:
lfbench: a lock-free microbenchmark suite. 322-324 - Zheming Jin, Jeffrey S. Vetter:
A Benchmark Suite for Improving Performance Portability of the SYCL Programming Model. 325-327 - Tom Glint, Aryan Gupta, Daniel Giftson, Gaurav Shah, Vrajesh Patel, Ruchit Chudasama, Sukanya More, Joycee Mekie:
Impact of Optimal Design Point on Performance Metrics of DNN accelerators in FPGA. 328-330 - Lina Sawalha, Grant Deljevic:
Workload Characterization Using Hierarchical PCA. 331-333 - Jinghan Huang, Jiaqi Lou, Yan Sun, Tianchen Wang, Eun Kyung Lee, Nam Sung Kim:
Analyzing Energy Efficiency of a Server with a SmartNIC under SLO Constraints. 334-336 - Athanasios Kordelas, Thanasis Spyrou, Spyros Voulgaris, Vasileios Megalooikonomou, Nikos Deligiannis:
KORDI: A Framework for Real-Time Performance and Cost Optimization of Apache Spark Streaming. 337-339 - Maryam Babaie, Ayaz Akram, Jason Lowe-Power:
Enabling Design Space Exploration of DRAM Caches for Emerging Memory Systems. 340-342 - Ying Li, Yifan Sun, Adwait Jog:
A Regression-based Model for End-to-End Latency Prediction for DNN Execution on GPUs. 343-345 - Massimo Coluzzi, Amos Brocco, Patrizio Contu, Tiziano Leidi
:
A survey and comparison of consistent hashing algorithms. 346-348 - Tom Glint, Chandan Kumar Jha, Manu Awasthi, Joycee Mekie:
Analysis of Conventional, Near-Memory, and In-Memory DNN Accelerators. 349-351 - Stavroula Zouzoula, Muhammad Waqar Azhar, Pedro Trancoso:
RAINBOW: Multi-Dimensional Hardware-Software Co-Design for DL Accelerator On-Chip Memory. 352-354 - Arne Symons
, Linyan Mei, Steven Colleman, Pouya Houshmand, Sebastian Karl, Marian Verhelst
:
Stream: A Modeling Framework for Fine-grained Layer Fusion on Multi-core DNN Accelerators. 355-357

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.