GraphR: Accelerating graph processing using ReRAM L Song, Y Zhuo, X Qian, H Li, Y Chen 2018 IEEE International Symposium on High Performance Computer Architecture …, 2018 | 351 | 2018 |
CirCNN: accelerating and compressing deep neural networks using block-circulant weight matrices C Ding, S Liao, Y Wang, Z Li, N Liu, Y Zhuo, C Wang, X Qian, Y Bai, ... Proceedings of the 50th Annual IEEE/ACM International Symposium on …, 2017 | 343 | 2017 |
GraphP: Reducing communication for PIM-based graph processing with efficient data partition M Zhang, Y Zhuo, C Wang, M Gao, Y Wu, K Chen, C Kozyrakis, X Qian 2018 IEEE International Symposium on High Performance Computer Architecture …, 2018 | 270 | 2018 |
Graphq: Scalable pim-based graph processing Y Zhuo, C Wang, M Zhang, R Wang, D Niu, Y Wang, X Qian Proceedings of the 52nd Annual IEEE/ACM International Symposium on …, 2019 | 174 | 2019 |
Hypar: Towards hybrid parallelism for deep learning accelerator array L Song, J Mao, Y Zhuo, X Qian, H Li, Y Chen 2019 IEEE international symposium on high performance computer architecture …, 2019 | 138 | 2019 |
E-RNN: Design optimization for efficient recurrent neural networks in FPGAs Z Li, C Ding, S Wang, W Wen, Y Zhuo, C Liu, Q Qiu, W Xu, X Lin, X Qian, ... 2019 IEEE International Symposium on High Performance Computer Architecture …, 2019 | 91 | 2019 |
Prague: High-performance heterogeneity-aware asynchronous decentralized training Q Luo, J He, Y Zhuo, X Qian Proceedings of the Twenty-Fifth International Conference on Architectural …, 2020 | 82 | 2020 |
Accpar: Tensor partitioning for heterogeneous deep learning accelerators L Song, F Chen, Y Zhuo, X Qian, H Li, Y Chen 2020 IEEE International Symposium on High Performance Computer Architecture …, 2020 | 76 | 2020 |
Wonderland: A novel abstraction-based out-of-core graph processing system M Zhang, Y Wu, Y Zhuo, X Qian, C Huan, K Chen ACM SIGPLAN Notices 53 (2), 608-621, 2018 | 75 | 2018 |
Hop: Heterogeneity-aware decentralized training Q Luo, J Lin, Y Zhuo, X Qian Proceedings of the Twenty-Fourth International Conference on Architectural …, 2019 | 61 | 2019 |
Performance evaluation and optimization of HBM-enabled GPU for data-intensive applications M Zhu, Y Zhuo, C Wang, W Chen, Y Xie IEEE Transactions on Very Large Scale Integration (VLSI) Systems 26 (5), 831-840, 2018 | 52 | 2018 |
Scalable graph traversal on sunway taihulight with ten million cores H Lin, X Tang, B Yu, Y Zhuo, W Chen, J Zhai, W Yin, W Zheng 2017 IEEE International Parallel and Distributed Processing Symposium (IPDPS …, 2017 | 46 | 2017 |
Symplegraph: distributed graph processing with precise loop-carried dependency guarantee Y Zhuo, J Chen, Q Luo, Y Wang, H Yang, D Qian, X Qian Proceedings of the 41st ACM SIGPLAN Conference on Programming Language …, 2020 | 15 | 2020 |
Heterogeneity-aware asynchronous decentralized training Q Luo, J He, Y Zhuo, X Qian arXiv preprint arXiv:1909.08029, 2019 | 7 | 2019 |
Distributed graph processing system and processing-in-memory architecture with precise loop-carried dependency guarantee Y Zhuo, J Chen, G Rao, Q Luo, Y Wang, H Yang, D Qian, X Qian ACM Transactions on Computer Systems (TOCS) 37 (1-4), 1-37, 2021 | 6 | 2021 |
Cse: Parallel finite state machines with convergence set enumeration Y Zhuo, J Cheng, Q Luo, J Zhai, Y Wang, Z Luan, X Qian 2018 51st Annual IEEE/ACM International Symposium on Microarchitecture …, 2018 | 5 | 2018 |
{HydraRPC}:{RPC} in the {CXL} Era T Ma, Z Liu, C Wei, J Huang, Y Zhuo, H Li, N Zhang, Y Guan, D Niu, ... 2024 USENIX Annual Technical Conference (USENIX ATC 24), 387-395, 2024 | 2 | 2024 |
Klotski: DNN Model Orchestration Framework for Dataflow Architecture Accelerators C Bai, X Wei, Y Zhuo, Y Cai, H Zheng, B Yu, Y Xie 2023 IEEE/ACM International Conference on Computer Aided Design (ICCAD), 1-9, 2023 | 1 | 2023 |
TASK SCHEDULING UNIT, WAFER-SCALE CHIP, AND TASK SCHEDULING METHOD Y Zhuo, H XU, Z Zhang, S LI, D Niu, H Zheng US Patent App. 18/667,409, 2024 | | 2024 |
Klotski v2: Improved DNN Model Orchestration Framework for Dataflow Architecture Accelerators C Bai, X Wei, Y Zhuo, Y Cai, H Zheng, B Yu, Y Xie IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems, 2024 | | 2024 |