default search action
30th ASAP 2019: New York, NY, USA
- 30th IEEE International Conference on Application-specific Systems, Architectures and Processors, ASAP 2019, New York, NY, USA, July 15-17, 2019. IEEE 2019, ISBN 978-1-7281-1601-3
Applications: Machine Learning I
- Hongxiang Fan, Cheng Luo, Chenglong Zeng, Martin Ferianc, Zhiqiang Que, Shuanglong Liu, Xinyu Niu, Wayne Luk:
F-E3D: FPGA-based Acceleration of an Efficient 3D Convolutional Neural Network for Human Action Recognition. 1-8 - Tong Geng, Tianqi Wang, Chunshu Wu, Chen Yang, Shuaiwen Leon Song, Ang Li, Martin C. Herbordt:
LP-BNN: Ultra-low-Latency BNN Inference with Layer Parallelism. 9-16
Applications: Machine Learning II
- Zhiqiang Que, Thomas Nugent, Shuanglong Liu, Li Tian, Xinyu Niu, Yongxin Zhu, Wayne Luk:
Efficient Weight Reuse for Large LSTMs. 17-24 - Jeff Anderson, Shuai Sun, Yousra Al-Kabani, Volker J. Sorger, Tarek A. El-Ghazawi:
Photonic Processor for Fully Discretized Neural Networks. 25-32
Lighting Session for Posters
- Jessica Vandebon, José Gabriel F. Coutinho, Wayne Luk, Thomas Chau:
Transparent Heterogeneous Cloud Acceleration. 33 - Yuancheng Li, Jiaqi Shi:
CRbS: A Code Reordering Based Speeding-up Method of Irregular Loops on CMP. 34 - Krishna Teja Chitty-Venkata, Arun K. Somani:
Impact of Structural Faults on Neural Network Performance. 35 - M. Hassan Najafi, S. Rasoul Faraji, Kia Bazargan, David J. Lilja:
Energy-Efficient Near-Sensor Convolution using Pulsed Unary Processing. 36 - Wei-pei Huang, Ray C. C. Cheung, Hong Yan:
An Efficient Application Specific Instruction Set Processor (ASIP) for Tensor Computation. 37 - Riadh Ben Abdelhamid, Yoshiki Yamaguchi, Taisuke Boku:
MITRACA: Manycore Interlinked Torus Reconfigurable Accelerator Architecture. 38 - Yuka Oba, Kota Ando, Tetsuya Asai, Masato Motomura, Shinya Takamaeda-Yamazaki:
DeltaNet: Differential Binary Neural Network. 39 - Kamyar Givaki, Reza Hojabr, M. Hassan Najafi, Ahmad Khonsari, M. Hossein Gholamrezayi, Saeid Gorgin, Dara Rahmati:
Using Residue Number Systems to Accelerate Deterministic Bit-stream Multiplication. 40 - Nicolas Brunie, Christoph Quirin Lauter, Guillaume Revy:
Precision Adaptation for Fast and Accurate Polynomial Evaluation Generation. 41
Architecture and Synthesis
- H. T. Kung, Bradley McDanel, Sai Qian Zhang, Xin Dong, Chih-Chiang Chen:
Maestro: A Memory-on-Logic Architecture for Coordinated Parallel Use of Many Systolic Arrays. 42-50 - Björn Sigurbergsson, Tom Hogervorst, Tong Dong Qiu, Razvan Nane:
Sparstition: A Partitioning Scheme for Large-Scale Sparse Matrix Vector Multiplication on FPGA. 51-58 - Charalampos Kritikakis, Dirk Koch:
End-to-end Dynamic Stream Processing on Maxeler HLS Platforms. 59-66 - Sharad Malik, Pareesa Ameneh Golnari:
Sparse Matrix to Matrix Multiplication: A Representation and Architecture for Acceleration. 67-70 - Dionysios Diamantopoulos, Christoph Hagleitner:
HelmGemm: Managing GPUs and FPGAs for Transprecision GEMM Workloads in Containerized Environments. 71-74
Applications: Machine Learning, Robotics, and Simulation I
- Oumaima Matoussi, Yves Durand, Olivier Sentieys, Anca Molnos:
Error Analysis of the Square Root Operation for the Purpose of Precision Tuning: A Case Study on K-means. 75-82 - Quentin Gautier, Alric Althoff, Ryan Kastner:
FPGA Architectures for Real-time Dense SLAM. 83-90 - Ce Guo, Wayne Luk, Stanley Qing Shui Loh, Alexander Warren, Joshua M. Levine:
Customisable Control Policy Learning for Robotics. 91-98 - Dawen Xu, KouZi Xing, Cheng Liu, Ying Wang, Yulin Dai, Long Cheng, Huawei Li, Lei Zhang:
Resilient Neural Network Training for Accelerators with Computing Errors. 99-102 - Dilshan Kumarathunga, Omega Gamage, Asitha Samarasinghe, Nipuna Saranga, Ranga Rodrigo, Ajith A. Pasqual:
VLIW Based Runtime Reconfigurable Machine Vision Coprocessor Architecture for Edge Computing. 103-106
Hardware Acceleration
- Lixue Xia, Lansong Diao, Zhao Jiang, Hao Liang, Kai Chen, Li Ding, Shunli Dou, Zibin Su, Meng Sun, Jiansong Zhang, Wei Lin:
PAI-FCNN: FPGA Based Inference System for Complex CNN Models. 107-114
Applications: Image Processing, Networking, and Floating Point Arithmetic I
- Pankaj Bhowmik, Md Jubaer Hossain Pantho, Christophe Bobda:
Event-Based Re-configurable Hierarchical Processors for Smart Image Sensors. 115-122 - Madushan Abeysinghe, Jesse Villarreal, Lucas Weaver, Jason D. Bakos:
OpenVX Graph Optimization for Visual Processor Units. 123-130 - Piyumal Ranawaka, Mongkol Ekpanyapong, Adriano Tavares, Jorge Cabral, Krit Athikulwongse, Vitor Alberto Silva:
Application Specific Architecture for Hardware Accelerating HOG-SVM to Achieve High Throughput on HD Frames. 131-134
Lighting Session for Posters
- Wei-Kuo Chiang, He-Xin Chen:
A Quantitative Approach for Refactoring NFV-based Mobile Core Networks. 135 - Haoqiang Guo, Lu Peng, Jian Zhang, Fang Qi, Lide Duan:
Fooling AI with AI: An Accelerator for Adversarial Attacks on Deep Learning Visual Classification. 136 - Tianqi Gao, Rob A. Rutenbar:
A Virtual Image Accelerator for Graph Cuts Inference on FPGA. 137 - Byeong Kil Lee, Jordan Pattee:
Implications for Hardware Acceleration of Malware Detection. 138 - Yehia Arafa, Abdel-Hameed A. Badawy, Gopinath Chennupati, Nandakishore Santhi, Stephan J. Eidenbenz:
POSTER: GPUs Pipeline Latency Analysis. 139 - Sina Asadi, M. Hassan Najafi:
Context-Aware Number Generator for Deterministic Bit-stream Computing. 140 - Wenpei Zheng, Sheng-Yang Chiu, Jui-Chien Hsieh, Chaochang Chiu:
Smart Rabbit - A Wearable Device As An Intelligent Pacer for Marathon Runners. 141
In Memory Computing
- Shahar Kvatinsky:
Real Processing-in-Memory with Memristive Memory Processing Unit (mMPU). 142-148 - Oscar Castañeda, Maria Bobbett, Alexandra Gallyas-Sanhueza, Christoph Studer:
PPAC: A Versatile In-Memory Accelerator for Matrix-Vector-Product-Like Operations. 149-156 - Feng Wang, Guojie Luo, Guangyu Sun, Jiaxi Zhang, Peng Huang, Jinfeng Kang:
Parallel Stateful Logic in RRAM: Theoretical Analysis and Arithmetic Design. 157-164
Applications: Machine Learning, Robotics, and Simulation II
- Rasha Karakchi, Charles Daniels, Jason D. Bakos:
An Overlay Architecture for Pattern Matching. 165-172 - Nils Voss, Peter Ziegenhein, Lukas Vermond, Joost Hoozemans, Oskar Mencer, Uwe Oelfke, Wayne Luk, Georgi Gaydadjiev:
Towards Real Time Radiotherapy Simulation. 173-180 - Tianqi Wang, Tong Geng, Xi Jin, Martin C. Herbordt:
Accelerating AP3M-Based Computational Astrophysics Simulations with Reconfigurable Clusters. 181-184 - Sean Murray, Will Floyd-Jones, George Dimitri Konidaris, Daniel J. Sorin:
A Programmable Architecture for Robot Motion Planning Acceleration. 185-188
Emerging Technologies
- Zejun Shi, Dongqin Zhou, Keni Qiu, Jiwu Shu:
Leveraging Energy Cycle Regularity to Predict Adaptive Mode for Non-volatile Processors. 189-196 - Saunak Saha, Henry Duwe, Joseph Zambreno:
An Adaptive Memory Management Strategy Towards Energy Efficient Machine Inference in Event-Driven Neuromorphic Accelerators. 197-205 - Naveed Mahmud, Esam El-Araby:
Improving Emulation of Quantum Algorithms using Space-Efficient Hardware Architectures. 206-213 - Jacqueline Lagasse, Christopher Bartoli, Wayne P. Burleson:
Combining Clock and Voltage Noise Countermeasures Against Power Side-Channel Analysis. 214-217
Applications: Image Processing, Networking, and Floating Point Arithmetic II
- Jiuxi Meng, Nadeen Gebara, Ho-Cheung Ng, Paolo Costa, Wayne Luk:
Investigating the Feasibility of FPGA-based Network Switches. 218-226 - Brett Mathis, James E. Stine:
A Well-Equipped Implementation: Normal/Denormalized Half/Single/Double Precision IEEE 754 Floating-Point Adder/Subtracter. 227-234 - Nisal Ranasinghe, Ravindu Bangamuarachchi, Jayath Seneviratne, Achini Jayawardane, Ajith Pasqual, R. M. A. U. Senarath:
SMPTE ST 2110 Compliant Scalable Architecture on FPGA for end to end Uncompressed Professional Video Transport Over IP Networks. 235-238
Hardware Acceleration II
- Zhenman Fang, Farnoosh Javadi, Jason Cong, Glenn Reinman:
Understanding Performance Gains of Accelerator-Rich Architectures. 239-246
Design Methodologies I
- Zheming Jin, Hal Finkel:
Base64 Encoding on Heterogeneous Computing Platforms. 247-254 - Oliver Jakob Arndt, Matthias Lüders, Holger Blume:
Statistical Performance Prediction for Multicore Applications Based on Scalability Characteristics. 255-262
Design Methodologies II
- Chen Yang, Tong Geng, Tianqi Wang, Charles Lin, Jiayi Sheng, Vipin Sachdeva, Woody Sherman, Martin C. Herbordt:
Molecular Dynamics Range-Limited Force Evaluation Optimized for FPGAs. 263-271 - Jian Fang, Jianyu Chen, Jinho Lee, Zaid Al-Ars, H. Peter Hofstee:
Refine and Recycle: A Method to Increase Decompression Parallelism. 272-280 - Tieu-Khanh Luong, Van-Tinh Nguyen, Anh-Thai Nguyen, Emanuel M. Popovici:
Efficient Architectures and Implementation of Arithmetic Functions Approximation Based Stochastic Computing. 281-287 - Muhammad Irfan, Ray C. C. Cheung, Zahid Ullah:
Bank-selective Strategy for Gate-based Ternary Content-addressable Memory on FPGAs. 288-291
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.