Shijie Geng
Shijie Geng
Research Scientist, ByteDance Inc.
Verified email at
Cited by
Cited by
Clip-adapter: Better vision-language models with feature adapters
P Gao, S Geng, R Zhang, T Ma, R Fang, Y Zhang, H Li, Y Qiao
International Journal of Computer Vision, 2024
Llama-adapter v2: Parameter-efficient visual instruction model
P Gao, J Han, R Zhang, Z Lin, S Geng, A Zhou, W Zhang, P Lu, C He, ...
arXiv preprint arXiv:2304.15010, 2023
Recommendation as Language Processing (RLP): A Unified Pretrain, Personalized Prompt & Predict Paradigm (P5)
S Geng, S Liu, Z Fu, Y Ge, Y Zhang
RecSys 2022, 2022
Fairness-aware explainable recommendation over knowledge graphs
Z Fu, Y Xian, R Gao, J Zhao, Q Huang, Y Ge, S Xu, S Geng, C Shah, ...
Proceedings of the 43rd international ACM SIGIR conference on research and …, 2020
Quantized densely connected u-nets for efficient landmark localization
Z Tang, X Peng, S Geng, L Wu, S Zhang, D Metaxas
Proceedings of the European conference on computer vision (ECCV), 339-354, 2018
Frozen clip models are efficient video learners
Z Lin, S Geng, R Zhang, P Gao, G de Melo, X Wang, J Dai, Y Qiao, H Li
ECCV 2022, 2022
Image segmentation with pyramid dilated convolution based on ResNet and U-Net
Q Zhang, Z Cui, X Niu, S Geng, Y Qiao
Neural Information Processing: 24th International Conference, ICONIP 2017 …, 2017
CAFE: Coarse-to-fine neural symbolic reasoning for explainable recommendation
Y Xian, Z Fu, H Zhao, Y Ge, X Chen, Q Huang, S Geng, Z Qin, G De Melo, ...
Proceedings of the 29th ACM International Conference on Information …, 2020
Learning and evaluating graph neural network explanations based on counterfactual and factual reasoning
J Tan, S Geng, Z Fu, Y Ge, S Xu, Y Li, Y Zhang
Proceedings of the ACM web conference 2022, 1018-1027, 2022
Path language modeling over knowledge graphsfor explainable recommendation
S Geng, Z Fu, J Tan, Y Ge, G De Melo, Y Zhang
Proceedings of the ACM Web Conference 2022, 946-955, 2022
Explainable Fairness in Recommendation
Y Ge, J Tan, Y Zhu, Y Xia, J Luo, S Liu, Z Fu, S Geng, Z Li, Y Zhang
SIGIR 2022, 2022
Dynamic graph representation learning for video dialog via multi-modal shuffled transformers
S Geng, P Gao, M Chatterjee, C Hori, J Le Roux, Y Zhang, H Li, A Cherian
Proceedings of the AAAI Conference on Artificial Intelligence 35 (2), 1415-1423, 2021
Cu-net: Coupled u-nets
Z Tang, X Peng, S Geng, Y Zhu, DN Metaxas
BMVC 2018, 2018
VIP5: Towards Multimodal Foundation Models for Recommendation
S Geng, J Tan, S Liu, Z Fu, Y Zhang
Findings of the Association for Computational Linguistics: EMNLP 2023, 2023
HiCLIP: Contrastive Language-Image Pretraining with Hierarchy-aware Attention
S Geng, J Yuan, Y Tian, Y Chen, Y Zhang
ICLR 2023, 2023
SPHINX-X: Scaling Data and Parameters for a Family of Multi-modal Large Language Models
D Liu, R Zhang, L Qiu, S Huang, W Lin, S Zhao, S Geng, Z Lin, P Jin, ...
Forty-first International Conference on Machine Learning, 2024
Unleashing the Potential of Vision-Language Models for Long-Tailed Visual Recognition
T Ma, S Geng, M Wang, S Xu, H Li, B Zhang, P Gao, Y Qiao
BMVC 2022, 2022
COMPOSER: Compositional Reasoning of Group Activity in Videos with Keypoint-Only Modality
H Zhou, A Kadav, A Shamsian, S Geng, F Lai, L Zhao, T Liu, M Kapadia, ...
ECCV 2022, 2022
Improving personalized explanation generation through visualization
S Geng, Z Fu, Y Ge, L Li, G De Melo, Y Zhang
Proceedings of the 60th Annual Meeting of the Association for Computational …, 2022
Contrastive visual-linguistic pretraining
L Shi, K Shuang, S Geng, P Su, Z Jiang, P Gao, Z Fu, G de Melo, S Su
arXiv preprint arXiv:2007.13135, 2020
The system can't perform the operation now. Try again later.
Articles 1–20