Zangwei Zheng
Zangwei Zheng
Other namesAlex Zangwei Zheng
Verified email at - Homepage
Cited by
Cited by
Prototypical cross-domain self-supervised learning for few-shot unsupervised domain adaptation
X Yue, Z Zheng, S Zhang, Y Gao, T Darrell, K Keutzer, AS Vincentelli
CVPR 2021, 2021
Prompt vision transformer for domain generalization
Z Zheng, X Yue, K Wang, Y You
arXiv preprint arXiv:2208.08914, 2022
To Repeat or Not To Repeat: Insights from Scaling LLM under Token-Crisis
F Xue, Y Fu, W Zhou, Z Zheng, Y You
Neurips 2023, 2023
Preventing Zero-Shot Transfer Degradation in Continual Learning of Vision-Language Models
Z Zheng, M Ma, K Wang, Z Qin, X Yue, Y You
ICCV 2023, 2023
Cross-token modeling with conditional computation
Y Lou, F Xue, Z Zheng, Y You
arXiv preprint arXiv:2109.02008, 2021
Instruction in the wild: A user-based instruction dataset
J Ni, F Xue, K Jain, MH Shah, Z Zheng, Y You
GitHub repository. Retrieved from https://github. com/XueFuzhao/InstructionWild, 2023
InfoBatch: Lossless Training Speed Up by Unbiased Dynamic Data Pruning
Z Qin, K Wang, Z Zheng, J Gu, X Peng, D Zhou, Y You
ICLR 2024, 2023
Openmoe: An early effort on open mixture-of-experts language models
F Xue, Z Zheng, Y Fu, J Ni, Z Zheng, W Zhou, Y You
arXiv preprint arXiv:2402.01739, 2024
Multi-source few-shot domain adaptation
X Yue, Z Zheng, HP Das, K Keutzer, AS Vincentelli
arXiv preprint arXiv:2109.12391, 2021
Scene-aware learning network for radar object detection
Z Zheng, X Yue, K Keutzer, A Sangiovanni Vincentelli
Proceedings of the 2021 International Conference on Multimedia Retrieval …, 2021
A Study on Transformer Configuration and Training Objective
F Xue, J Chen, A Sun, X Ren, Z Zheng, X He, Y Chen, X Jiang, Y You
ICML 2023, 2023
Response Length Perception and Sequence Scheduling: An LLM-Empowered LLM Inference Pipeline
Z Zheng, X Ren, F Xue, Y Luo, X Jiang, Y You
Neurips 2023, 2023
CAME: Confidence-guided Adaptive Memory Efficient Optimization
Y Luo, X Ren, Z Zheng, Z Jiang, X Jiang, Y You
ACL 2023, 2023
CowClip: Reducing CTR Prediction Model Training Time from 12 hours to 10 minutes on 1 GPU
Z Zheng, P Xu, X Zou, D Tang, Z Li, C Xi, P Wu, L Zou, Y Zhu, M Chen, ...
AAAI 2023, 2023
Dataset Growth
Z Qin, Z Xu, Y Zhou, Z Zheng, Z Cheng, H Tang, L Shang, B Sun, X Peng, ...
arXiv preprint arXiv:2405.18347, 2024
Helen: Optimizing CTR Prediction Models with Frequency-wise Hessian Eigenvalue Regularization
Z Zhu, Y Liu, Z Zheng, H Guo, Y You
Proceedings of the ACM on Web Conference 2024, 3485-3496, 2024
How Does the Textual Information Affect the Retrieval of Multimodal In-Context Learning?
Y Luo, Z Zheng, Z Zhu, Y You
arXiv preprint arXiv:2404.12866, 2024
DSP: Dynamic Sequence Parallelism for Multi-Dimensional Transformers
X Zhao, S Cheng, Z Zheng, Z Yang, Z Liu, Y You
arXiv preprint arXiv:2403.10266, 2024
The system can't perform the operation now. Try again later.
Articles 1–18