Follow
Shuang Ma
Shuang Ma
Apple AI/ML
Verified email at buffalo.edu - Homepage
Title
Cited by
Cited by
Year
A-lamp: Adaptive layout-aware multi-patch deep convolutional neural network for photo aesthetic assessment
S Ma, J Liu, C Wen Chen
Proceedings of the IEEE conference on computer vision and pattern …, 2017
2182017
Da-gan: Instance-level image translation by deep attention generative adversarial networks
S Ma, J Fu, CW Chen, T Mei
Proceedings of the IEEE conference on computer vision and pattern …, 2018
1762018
Active contrastive learning of audio-visual video representations
S Ma, Z Zeng, D McDuff, Y Song
arXiv preprint arXiv:2009.09805, 2020
1022020
A generative adversarial network for style modeling in a text-to-speech system
S Ma, D Mcduff, Y Song
International Conference on Learning Representations 2, 2019
55*2019
Characterizing bias in classifiers using generative models
D McDuff, S Ma, Y Song, A Kapoor
Advances in neural information processing systems 32, 2019
502019
Contrastive learning of global and local video representations
S Ma, Z Zeng, D McDuff, Y Song
Advances in Neural Information Processing Systems 34, 7025-7040, 2021
492021
Multi-reference neural TTS stylization with adversarial cycle consistency
M Whitehill, S Ma, D McDuff, Y Song
arXiv preprint arXiv:1910.11958, 2019
362019
Reshaping robot trajectories using natural language commands: A study of multi-modal data alignment using transformers
A Bucker, L Figueredo, S Haddadinl, A Kapoor, S Ma, R Bonatti
2022 IEEE/RSJ International Conference on Intelligent Robots and Systems …, 2022
342022
Latte: Language trajectory transformer
A Bucker, L Figueredo, S Haddadin, A Kapoor, S Ma, S Vemprala, ...
2023 IEEE International Conference on Robotics and Automation (ICRA), 7287-7294, 2023
332023
Smart: Self-supervised multi-task pretraining with control transformers
Y Sun, S Ma, R Madaan, R Bonatti, F Huang, A Kapoor
arXiv preprint arXiv:2301.09816, 2023
302023
Causalcity: Complex simulations with agency for causal discovery and reasoning
D McDuff, Y Song, J Lee, V Vineet, S Vemprala, NA Gyde, H Salman, ...
Conference on Causal Learning and Reasoning, 559-575, 2022
242022
Unpaired image-to-speech synthesis with multimodal information bottleneck
S Ma, D McDuff, Y Song
Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2019
242019
Pose maker: A pose recommendation system for person in the landscape photographing
S Ma, Y Fan, CW Chen
Proceedings of the 22nd ACM international conference on Multimedia, 1053-1056, 2014
232014
Finding your spot: A photography suggestion system for placing human in the scene
S Ma, Y Fan, CW Chen
2014 IEEE International Conference on Image Processing (ICIP), 556-560, 2014
162014
Pact: Perception-action causal transformer for autoregressive robotics pre-training
R Bonatti, S Vemprala, S Ma, F Frujeri, S Chen, A Kapoor
2023 IEEE/RSJ International Conference on Intelligent Robots and Systems …, 2023
152023
Compass: Contrastive multimodal pretraining for autonomous systems
S Ma, S Vemprala, W Wang, JK Gupta, Y Song, D McDufft, A Kapoor
2022 IEEE/RSJ International Conference on Intelligent Robots and Systems …, 2022
82022
: Temporal Latent Action-Driven Contrastive Loss for Visual Reinforcement Learning
R Zheng, X Wang, Y Sun, S Ma, J Zhao, H Xu, H Daumé III, F Huang
Advances in Neural Information Processing Systems 36, 2024
72024
Is imitation all you need? generalized decision-making with dual-phase training
Y Wei, Y Sun, R Zheng, S Vemprala, R Bonatti, S Chen, R Madaan, Z Ba, ...
Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023
62023
M3D-GAN: Multi-modal multi-domain translation with universal attention
S Ma, D McDuff, Y Song
arXiv preprint arXiv:1907.04378, 2019
42019
Approach for license plate location using edge features filter and multi-decision mechanism
MA Shuang, C Jiangning, LU Hu
Computer Engineering and Applications 50 (9), 145-149, 2014
42014
The system can't perform the operation now. Try again later.
Articles 1–20