NSDI 25
|
MTP: A Transport for In-Network Computing
Tao Ji, Rohan Vardekar, Balajee Vamanan, Brent Stephens and Aditya Akella.
|
NeurIPS 24
|
Read-ME: Refactorizing LLMs as Router-Decoupled Mixture of Experts with System Co-Design
Ruisi Cai, Yeonju Ro, Geon-Woo Kim, Peihao Wang, Babak Ehteshami Bejnordi, Aditya Akella, Zhangyang Wang.
|
NSDI 24
|
Cassini: Network-Aware Job Scheduling in Machine Learning Clusters
Sudarsanan Rajasekaran, Manya Ghobadi, Aditya Akella.
|
EMNLP 24
|
MOSEL: Inference Serving Using Dynamic Modality Selection
Bodun Hu, Le Xu, Jeongyoon Moon, Neeraja Yadwadkar, and Aditya Akella.
|
EMNLP 24
|
FFN-SkipLLM: A Hidden Gem for Autoregressive Decoding with Adaptive Feed Forward Skipping
Ajay Jaiswal, Bodun Hu, Lu Yin, Yeonju Ro, Shiwei Liu, Tianlong Chen, Aditya Akella
|