default search action
Amey Agrawal
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [c5]Amey Agrawal, Sameer Reddy, Satwik Bhattamishra, Venkata Prabhakara Sarath Nookala, Vidushi Vashishth, Kexin Rong, Alexey Tumanov:
Inshrinkerator: Compressing Deep Learning Training Checkpoints via Dynamic Quantization. SoCC 2024: 1012-1031 - [c4]Amey Agrawal, Nitin Kedia, Jayashree Mohan, Ashish Panwar, Nipun Kwatra, Bhargav S. Gulavani, Ramachandran Ramjee, Alexey Tumanov:
VIDUR: A Large-Scale Simulation Framework for LLM Inference. MLSys 2024 - [c3]Amey Agrawal, Nitin Kedia, Ashish Panwar, Jayashree Mohan, Nipun Kwatra, Bhargav S. Gulavani, Alexey Tumanov, Ramachandran Ramjee:
Taming Throughput-Latency Tradeoff in LLM Inference with Sarathi-Serve. OSDI 2024: 117-134 - [i9]Amey Agrawal, Nitin Kedia, Ashish Panwar, Jayashree Mohan, Nipun Kwatra, Bhargav S. Gulavani, Alexey Tumanov, Ramachandran Ramjee:
Taming Throughput-Latency Tradeoff in LLM Inference with Sarathi-Serve. CoRR abs/2403.02310 (2024) - [i8]Amey Agrawal, Nitin Kedia, Jayashree Mohan, Ashish Panwar, Nipun Kwatra, Bhargav S. Gulavani, Ramachandran Ramjee, Alexey Tumanov:
Vidur: A Large-Scale Simulation Framework For LLM Inference. CoRR abs/2405.05465 (2024) - [i7]Amey Agrawal, Anmol Agarwal, Nitin Kedia, Jayashree Mohan, Souvik Kundu, Nipun Kwatra, Ramachandran Ramjee, Alexey Tumanov:
Metron: Holistic Performance Evaluation Framework for LLM Inference Systems. CoRR abs/2407.07000 (2024) - [i6]Amey Agrawal, Junda Chen, Íñigo Goiri, Ramachandran Ramjee, Chaojie Zhang, Alexey Tumanov, Esha Choukse:
Mnemosyne: Parallelization Strategies for Efficiently Serving Multi-Million Context Length LLM Inference Requests Without Approximations. CoRR abs/2409.17264 (2024) - 2023
- [i5]Amey Agrawal, Sameer Reddy, Satwik Bhattamishra, Venkata Prabhakara Sarath Nookala, Vidushi Vashishth, Kexin Rong, Alexey Tumanov:
DynaQuant: Compressing Deep Learning Training Checkpoints via Dynamic Quantization. CoRR abs/2306.11800 (2023) - [i4]Amey Agrawal, Ashish Panwar, Jayashree Mohan, Nipun Kwatra, Bhargav S. Gulavani, Ramachandran Ramjee:
SARATHI: Efficient LLM Inference by Piggybacking Decodes with Chunked Prefills. CoRR abs/2308.16369 (2023) - 2022
- [i3]Dharma Shukla, Muthian Sivathanu, Srinidhi Viswanatha, Bhargav S. Gulavani, Rimma Nehme, Amey Agrawal, Chen Chen, Nipun Kwatra, Ramachandran Ramjee, Pankaj Sharma, Atul Katiyar, Vipul Modi, Vaibhav Sharma, Abhishek Singh, Shreshth Singhal, Kaustubh Welankar, Lu Xun, Ravi Anupindi, Karthik Elangovan, Hasibur Rahman, Zhou Lin, Rahul Seetharaman, Cheng Xu, Eddie Ailijiang, Suresh Krishnappa, Mark Russinovich:
Singularity: Planet-Scale, Preemptive and Elastic Scheduling of AI Workloads. CoRR abs/2202.07848 (2022)
2010 – 2019
- 2019
- [c2]Amey Agrawal, Abhishek Dixit, Namrata A. Shettar, Darshil Kapadia, Vikram Agrawal, Rajat Gupta, Rohit Karlupia:
Delog: A High-Performance Privacy Preserving Log Filtering Framework. IEEE BigData 2019: 1739-1748 - [c1]Amey Agrawal, Rohit Karlupia, Rajat Gupta:
Logan: A Distributed Online Log Parser. ICDE 2019: 1946-1951 - [i2]Amey Agrawal, Abhishek Dixit, Darshil Kapadia, Rohit Karlupia, Vikram Agrawal, Rajat Gupta:
Delog: A Privacy Preserving Log Filtering Framework for Online Compute Platforms. CoRR abs/1902.04843 (2019) - [i1]Amey Agrawal, Rohit Karlupia:
Learning Digital Circuits: A Journey Through Weight Invariant Self-Pruning Neural Networks. CoRR abs/1909.00052 (2019)
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-11-19 20:45 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint