Xiaoyi Dong

Cited by

	All	Since 2019
Citations	2543	2539
h-index	20	20
i10-index	26	26

1200

600

300

900

2020202120222023202421 85 473 1181 776

Public access

View all

17 articles

0 articles

available

not available

Based on funding mandates

Co-authors

weiming zhangUniversity of Science and Technology of ChinaVerified email at ustc.edu.cn
Dongdong ChenPrincipal Research Manager, GenAI, MicrosoftVerified email at mail.ustc.edu.cn
Nenghai YuUniversity of Science and Technology of ChinaVerified email at ustc.edu.cn
Lu YuanPrincipal Research Manager, Cognition, Cloud & AI, MicrosoftVerified email at microsoft.com
Jianmin BaoMicrosoft ResearchVerified email at microsoft.com
Dong ChenPrincipal Research Manager, Microsoft Research AsiaVerified email at microsoft.com
Jiaqi WangShanghai AI LaboratoryVerified email at pjlab.org.cn
Baining GuoDistinguished Scientist, Microsoft ResearchVerified email at microsoft.com
Pan ZhangShanghai AI LaboratoryVerified email at mail.ustc.edu.cn
Qidong HuangUniversity of Science and Technology of ChinaVerified email at mail.ustc.edu.cn
Yuhang ZangShanghai AI LaboratoryVerified email at pjlab.org.cn

Xiaoyi Dong

University of Science and Technology of China

Verified email at mail.ustc.edu.cn - Homepage

Computer Vision


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
CSWin transformer: A general vision transformer backbone with cross-shaped windows X Dong, J Bao, D Chen, W Zhang, N Yu, L Yuan, D Chen, B Guo IEEE Conference on Computer Vision and Pattern Recognition (CVPR 2022), 2021	802	2021
Mobile-former: Bridging mobilenet and transformer Y Chen, X Dai, D Chen, M Liu, X Dong, L Yuan, Z Liu IEEE Conference on Computer Vision and Pattern Recognition (CVPR 2022), 2021	420	2021
Peco: Perceptual codebook for bert pre-training of vision transformers X Dong, J Bao, T Zhang, D Chen, W Zhang, L Yuan, D Chen, F Wen, N Yu Thirty-Seventh AAAI Conference on Artificial Intelligence (AAAI), 2021	205	2021
Internlm: A multilingual language model with progressively enhanced capabilities ILM Team 2023-01-06)[2023-09-27]. https://github. com/InternLM/InternLM, 2023	117	2023
Lg-gan: Label guided adversarial network for flexible targeted attack of point cloud based deep networks H Zhou, D Chen, J Liao, K Chen, X Dong, K Liu, W Zhang, G Hua, N Yu Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2020	95	2020
Protecting Celebrities from DeepFake with Identity Consistency Transformer X Dong, J Bao, D Chen, T Zhang, W Zhang, N Yu, D Chen, F Wen, B Guo IEEE Conference on Computer Vision and Pattern Recognition (CVPR 2022), 2022	90	2022
Sharegpt4v: Improving large multi-modal models with better captions L Chen, J Li, X Dong, P Zhang, C He, J Wang, F Zhao, D Lin arXiv preprint arXiv:2311.12793, 2023	78	2023
Maskclip: Masked self-distillation advances contrastive language-image pretraining X Dong, J Bao, Y Zheng, T Zhang, D Chen, H Yang, M Zeng, W Zhang, ... Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023	74	2023
Robust superpixel-guided attentional adversarial attack X Dong, J Han, D Chen, J Liu, H Bian, Z Ma, H Li, X Wang, W Zhang, N Yu Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2020	63	2020
InternLM-XComposer: A Vision-Language Large Model for Advanced Text-image Comprehension and Composition P Zhang, X Dong, B Wang, Y Cao, C Xu, L Ouyang, Z Zhao, S Ding, ... arXiv preprint arXiv:2309.15112, 2023	59	2023
GreedyFool: Distortion-Aware Sparse Adversarial Attack X Dong, D Chen, J Bao, C Qin, L Yuan, W Zhang, N Yu, D Chen Advances in Neural Information Processing Systems 33 (NeurIPS 2020), 2020	59	2020
Bootstrapped Masked Autoencoders for Vision BERT Pretraining X Dong, J Bao, T Zhang, D Chen, W Zhang, L Yuan, D Chen, F Wen, N Yu ECCV 2022, 2022	54	2022
Self-robust 3d point recognition via gather-vector guidance X Dong, D Chen, H Zhou, G Hua, W Zhang, N Yu 2020 IEEE/CVF conference on computer vision and pattern recognition (cvpr …, 2020	50*	2020
Shape-invariant 3d adversarial point clouds Q Huang, X Dong, D Chen, H Zhou, W Zhang, N Yu Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2022	48	2022
Once a man: Towards multi-target attack via learning multi-target adversarial network once J Han, X Dong, R Zhang, D Chen, W Zhang, N Yu, P Luo, X Wang Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2019	37	2019
Local geometric distortions resilient watermarking scheme based on symmetry Z Ma, W Zhang, H Fang, X Dong, L Geng, N Yu IEEE Transactions on Circuits and Systems for Video Technology 31 (12), 4826 …, 2021	36	2021
Diversity-aware meta visual prompting Q Huang, X Dong, D Chen, W Zhang, F Wang, G Hua, N Yu Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023	30	2023
Identity-driven deepfake detection X Dong, J Bao, D Chen, W Zhang, N Yu, D Chen, F Wen, B Guo arXiv preprint arXiv:2012.03930, 2020	27	2020
Vigc: Visual instruction generation and correction B Wang, F Wu, X Han, J Peng, H Zhong, P Zhang, X Dong, W Li, W Li, ... Proceedings of the AAAI Conference on Artificial Intelligence 38 (6), 5309-5317, 2024	26	2024
InternLM-XComposer2: Mastering free-form text-image composition and comprehension in vision-language large model X Dong, P Zhang, Y Zang, Y Cao, B Wang, L Ouyang, X Wei, S Zhang, ... arXiv preprint arXiv:2401.16420, 2024	25	2024

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors