Follow
Siyang Sun
Siyang Sun
Alibaba Group
Verified email at alibaba-inc.com
Title
Cited by
Cited by
Year
A novel channel pruning method for deep neural network compression
Y Hu, S Sun, J Li, X Wang, Q Gu
arXiv preprint arXiv:1805.11394, 2018
1032018
Robust landmark detection and position measurement based on monocular vision for autonomous aerial refueling of UAVs
S Sun, Y Yin, X Wang, D Xu
IEEE Transactions on Cybernetics 49 (12), 4167-4179, 2018
512018
Robust visual detection and tracking strategies for autonomous aerial refueling of UAVs
S Sun, Y Yin, X Wang, D Xu
IEEE Transactions on Instrumentation and Measurement 68 (12), 4640-4652, 2019
502019
Fast object detection based on binary deep convolution neural networks
S Sun, Y Yin, X Wang, D Xu, W Wu, Q Gu
CAAI transactions on intelligence technology 3 (4), 191-197, 2018
502018
Ra-clip: Retrieval augmented contrastive language-image pre-training
CW Xie, S Sun, X Xiong, Y Zheng, D Zhao, J Zhou
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023
372023
Deeply exploit visual and language information for social media popularity prediction
J Wu, L Zhao, D Li, CW Xie, S Sun, Y Zheng
Proceedings of the 30th ACM International Conference on Multimedia, 7045-7049, 2022
212022
Relevant intrinsic feature enhancement network for few-shot semantic segmentation
X Bao, J Qin, S Sun, X Wang, Y Zheng
Proceedings of the AAAI Conference on Artificial Intelligence 38 (2), 765-773, 2024
152024
Dual mean-teacher: An unbiased semi-supervised framework for audio-visual source localization
Y Guo, S Ma, H Su, Z Wang, Y Zhao, W Zou, S Sun, Y Zheng
Advances in Neural Information Processing Systems 36, 48639-48661, 2023
102023
Fashion focus: Multi-modal retrieval system for video commodity localization in e-commerce
Y Zhang, Q Wang, P Pan, Y Zheng, C Da, S Sun, Y Xu
Proceedings of the AAAI Conference on Artificial Intelligence 35 (18), 16127 …, 2021
102021
Multi-loss-aware channel pruning of deep networks
Y Hu, S Sun, J Li, J Zhu, X Wang, Q Gu
2019 IEEE International Conference on Image Processing (ICIP), 889-893, 2019
102019
Robust person head detection based on multi-scale representation fusion of deep convolution neural network
Y Wang, Y Yin, W Wu, S Sun, X Wang
2017 IEEE International Conference on Robotics and Biomimetics (ROBIO), 296-301, 2017
102017
Multiple receptive fields and small-object-focusing weakly-supervised segmentation network for fast object detection
S Sun, Y Yin, X Wang, D Xu, Y Zhao, H Shen
arXiv preprint arXiv:1904.12619, 2019
82019
Two stage multi-modal modeling for video interaction analysis in deep video understanding challenge
S Sun, X Xiong, Y Zheng
Proceedings of the 30th ACM International Conference on Multimedia, 7040-7044, 2022
72022
FOV constraint region analysis and path planning for mobile robot with observability to multiple feature points
H Ma, W Zou, S Sun, Z Zhu, Z Kang
International Journal of Control, Automation and Systems 19 (11), 3785-3800, 2021
72021
Cores: Orchestrating the dance of reasoning and segmentation
X Bao, S Sun, S Ma, K Zheng, Y Guo, G Zhao, Y Zheng, X Wang
European Conference on Computer Vision, 187-204, 2024
62024
Understanding the multi-modal prompts of the pre-trained vision-language model
S Ma, CW Xie, Y Wei, S Sun, J Fan, X Bao, Y Guo, Y Zheng
arXiv preprint arXiv:2312.11570, 2023
32023
Crossmae: Cross-modality masked autoencoders for region-aware audio-visual pre-training
Y Guo, S Sun, S Ma, K Zheng, X Bao, S Ma, W Zou, Y Zheng
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024
22024
Tmml: Text-guided mulimodal product location for alleviating retrieval inconsistency in e-commerce
Y Tang, X Xiong, S Sun, B Cui, Y Zheng, H Tang
Proceedings of the 46th International ACM SIGIR Conference on Research and …, 2023
22023
Data de-duplication and semantic enhancement for contrastive language-image pre-training
S Sun, CW Xie, S Ma, Y Zheng
12023
Deep Video Understanding with a Unified Multi-Modal Retrieval Framework
CW Xie, S Sun, L Zhao, J Wu, D Li, Y Zheng
Proceedings of the 30th ACM International Conference on Multimedia, 7055-7059, 2022
12022
The system can't perform the operation now. Try again later.
Articles 1–20