Follow
Anand Venkat
Anand Venkat
Verified email at nvidia.com
Title
Cited by
Cited by
Year
Loop and data transformations for sparse matrix code
A Venkat, M Hall, M Strout
ACM SIGPLAN Notices 50 (6), 521-532, 2015
1162015
Non-affine extensions to polyhedral code generation
A Venkat, M Shantharam, M Hall, MM Strout
Proceedings of Annual IEEE/ACM International Symposium on Code Generation …, 2014
852014
Automating wavefront parallelization for sparse matrix computations
A Venkat, MS Mohammadi, J Park, H Rong, R Barik, MM Strout, M Hall
SC'16: Proceedings of the International Conference for High Performance …, 2016
612016
Compiler generation and autotuning of communication-avoiding operators for geometric multigrid
P Basu, A Venkat, M Hall, S Williams, B Van Straalen, L Oliker
20th Annual International Conference on High Performance Computing, 452-461, 2013
362013
Sparse computation data dependence simplification for efficient compiler-generated inspectors
MS Mohammadi, T Yuki, K Cheshmi, EC Davis, M Hall, MM Dehnavi, ...
Proceedings of the 40th ACM SIGPLAN Conference on Programming Language …, 2019
342019
SWIRL: High-performance many-core CPU code generation for deep neural networks
A Venkat, T Rusira, R Barik, M Hall, L Truong
The International Journal of High Performance Computing Applications 33 (6 …, 2019
322019
Towards making autotuning mainstream
P Basu, M Hall, M Khan, S Maindola, S Muralidharan, S Ramalingam, ...
The International journal of high performance computing applications 27 (4 …, 2013
262013
Misim: An end-to-end neural code similarity system
F Ye, S Zhou, A Venkat, R Marucs, N Tatbul, JJ Tithi, P Petersen, ...
arXiv preprint arXiv:2006.05265, 2020
212020
Synchronization Trade-offs in GPU implementations of Graph Algorithms
R Kaleem, A Venkat, S Pai, M Hall, K Pingali
IEEE International Parallel & Distributed Processing Symposium (IPDPS 2016), 2016
212016
Harnessing deep learning via a single building block
E Georganas, K Banerjee, D Kalamkar, S Avancha, A Venkat, ...
2020 IEEE International Parallel and Distributed Processing Symposium (IPDPS …, 2020
192020
Extending index-array properties for data dependence analysis
MS Mohammadi, K Cheshmi, MM Dehnavi, A Venkat, T Yuki, MM Strout
Languages and Compilers for Parallel Computing: 31st International Workshop …, 2019
182019
Optimizing LOBPCG: Sparse Matrix Loop and Data Transformations in Action
K Ahmad, A Venkat, M Hall
The 29th International Workshop on Languages and Compilers for Parallel …, 2016
132016
High-performance deep learning via a single building block
E Georganas, K Banerjee, D Kalamkar, S Avancha, A Venkat, ...
arXiv preprint arXiv:1906.06440, 2019
112019
ISA mapper: a compute and hardware agnostic deep learning compiler
M Sotoudeh, A Venkat, M Anderson, E Georganas, A Heinecke, J Knight
Proceedings of the 16th ACM International Conference on Computing Frontiers …, 2019
102019
Misim: A neural code semantics similarity system using the context-aware semantics structure
F Ye, S Zhou, A Venkat, R Marcus, N Tatbul, JJ Tithi, N Hasabnis, ...
arXiv preprint arXiv:2006.05265, 2020
72020
Understanding the performance of small convolution operations for CNN on intel architecture
A Heinecke, E Georganas, K Banerjee, D Kalamkar, N Sundaram, ...
Poster in the International Conference for High Performance Computing …, 2017
72017
Combining polyhedral and ast transformations in chill
H Zhang, A Venkat, P Basu, M Hall
Proceedings of the Sixth International Workshop on Polyhedral Compilation …, 2016
72016
Predictive data locality optimization for higher-order tensor computations
TR Patabandi, A Venkat, A Kulkarni, P Ratnalikar, M Hall, J Gottschlich
Proceedings of the 5th ACM SIGPLAN International Symposium on Machine …, 2021
52021
Misim: A novel code similarity system
F Ye, S Zhou, A Venkat, R Marucs, N Tatbul, JJ Tithi, P Petersen, ...
arXiv preprint arXiv:2006.05265, 2021
52021
Context-Aware Parse Trees
F Ye, S Zhou, A Venkat, R Marcus, P Petersen, JJ Tithi, T Mattson, ...
arXiv preprint arXiv:2003.11118, 2020
42020
The system can't perform the operation now. Try again later.
Articles 1–20