Follow
Saurav Muralidharan
Saurav Muralidharan
Verified email at nvidia.com - Homepage
Title
Cited by
Cited by
Year
Nitro: A framework for adaptive code variant tuning
S Muralidharan, M Shantharam, M Hall, M Garland, B Catanzaro
Parallel and Distributed Processing Symposium, 2014 IEEE 28th International …, 2014
862014
Compact Language Models via Pruning and Knowledge Distillation
S Muralidharan, ST Sreenivas, R Joshi, M Chochowski, M Patwary, ...
arXiv preprint arXiv:2407.14679, 2024
602024
A programmable approach to neural network compression
V Joseph, GL Gopalakrishnan, S Muralidharan, M Garland, A Garg
IEEE Micro 40 (5), 17-25, 2020
36*2020
Architecture-adaptive code variant tuning
S Muralidharan, A Roy, M Hall, M Garland, P Rai
ACM SIGARCH Computer Architecture News 44 (2), 325-338, 2016
342016
Highlight: Efficient and Flexible DNN Acceleration with Hierarchical Structured Sparsity
YN Wu, PA Tsai, S Muralidharan, A Parashar, V Sze, J Emer
Proceedings of the 56th Annual IEEE/ACM International Symposium on …, 2023
292023
Towards making autotuning mainstream
P Basu, M Hall, M Khan, S Maindola, S Muralidharan, S Ramalingam, ...
The International journal of high performance computing applications 27 (4 …, 2013
272013
LLM Pruning and Distillation in Practice: The Minitron Approach
ST Sreenivas, S Muralidharan, R Joshi, M Chochowski, ...
arXiv preprint arXiv:2408.11796, 2024
252024
Going beyond classification accuracy metrics in model compression
V Joseph, SA Siddiqui, A Bhaskara, G Gopalakrishnan, S Muralidharan, ...
arXiv preprint arXiv:2012.01604, 2020
24*2020
Flextron: Many-in-one flexible large language model
R Cai, S Muralidharan, G Heinrich, H Yin, Z Wang, J Kautz, P Molchanov
arXiv preprint arXiv:2406.10260, 2024
142024
Maskllm: Learnable semi-structured sparsity for large language models
G Fang, H Yin, S Muralidharan, G Heinrich, J Pool, J Kautz, P Molchanov, ...
arXiv preprint arXiv:2409.17481, 2024
102024
Bayesian optimization of sparsity ratios in model compression
S Muralidharan, V Joseph, G Animesh, M Garland
US Patent App. 16/785,044, 2021
102021
Uniform Sparsity in Deep Neural Networks
S Muralidharan
Proceedings of Machine Learning and Systems 5, 2023
72023
A collection-oriented programming model for performance portability
S Muralidharan, M Garland, B Catanzaro, A Sidelnik, M Hall
Proceedings of the 20th ACM SIGPLAN Symposium on Principles and Practice of …, 2015
62015
Efficient Sparsely Activated Transformers
S Latifi, S Muralidharan, M Garland
arXiv preprint arXiv:2208.14580, 2022
32022
Designing a tunable nested data-parallel programming system
S Muralidharan, M Garland, A Sidelnik, M Hall
ACM Transactions on Architecture and Code Optimization (TACO) 13 (4), 1-24, 2016
32016
Eora: Training-free compensation for compressed llm with eigenspace low-rank approximation
SY Liu, M Khadkevich, NC Fung, C Sakr, CHH Yang, CY Wang, ...
arXiv preprint arXiv:2410.21271, 2024
22024
The sparsity roofline: Understanding the hardware limits of sparse neural networks
C Shinn, C McCarthy, S Muralidharan, M Osama, JD Owens
arXiv preprint arXiv:2310.00496, 2023
22023
Understanding the Effect of the Long Tail on Neural Network Compression
H Dam, V Joseph, A Bhaskara, G Gopalakrishnan, S Muralidharan, ...
arXiv preprint arXiv:2306.06238, 2023
22023
Galaxia: A Semi-decentralized System for Implementing Secure-Group P2P Networks
S Muralidharan, S Koroth, N Anto, R Pandarachalil
2009 First International Conference on Networks & Communications, 289-294, 2009
22009
Abstractions and Strategies for Adaptive Programming
S Muralidharan
The University of Utah, 2016
12016
The system can't perform the operation now. Try again later.
Articles 1–20