Sledovat
Mike Z. SHOU
Mike Z. SHOU
National U. of Singapore; Facebook AI; Columbia University
E-mailová adresa ověřena na: columbia.edu - Domovská stránka
Název
Citace
Citace
Rok
Temporal action localization in untrimmed videos via multi-stage cnns
Z Shou, D Wang, SF Chang
Proceedings of the IEEE Conference on Computer Vision and Pattern …, 2016
11442016
Ego4d: Around the world in 3,000 hours of egocentric video
K Grauman, A Westbury, E Byrne, Z Chavis, A Furnari, R Girdhar, ...
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022
8532022
Cdc: Convolutional-de-convolutional networks for precise temporal action localization in untrimmed videos
Z Shou, J Chan, A Zareian, K Miyazawa, SF Chang
Proceedings of the IEEE conference on computer vision and pattern …, 2017
6902017
Tune-a-video: One-shot tuning of image diffusion models for text-to-video generation
JZ Wu, Y Ge, X Wang, SW Lei, Y Gu, Y Shi, W Hsu, Y Shan, X Qie, ...
Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023
5772023
Convnet architecture search for spatiotemporal feature learning
D Tran, J Ray, Z Shou, SF Chang, M Paluri
arXiv preprint arXiv:1708.05038, 2017
5142017
Single shot temporal action detection
T Lin, X Zhao, Z Shou
Proceedings of the 25th ACM international conference on Multimedia, 988-996, 2017
5092017
Autoloc: Weakly-supervised temporal action localization in untrimmed videos
Z Shou, H Gao, L Zhang, K Miyazawa, SF Chang
Proceedings of the european conference on computer vision (ECCV), 154-171, 2018
3242018
Channel augmented joint learning for visible-infrared recognition
M Ye, W Ruan, B Du, MZ Shou
Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2021
2312021
All in one: Exploring unified video-language pre-training
J Wang, Y Ge, R Yan, Y Ge, KQ Lin, S Tsutsui, X Lin, G Cai, J Wu, Y Shan, ...
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023
2082023
Deep tensor admm-net for snapshot compressive imaging
J Ma, XY Liu, Z Shou, X Yuan
Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2019
1972019
Actor-context-actor relation network for spatio-temporal action localization
J Pan, S Chen, MZ Shou, Y Liu, J Shao, H Li
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2021
1762021
Is someone speaking? exploring long-term temporal features for audio-visual active speaker detection
R Tao, Z Pan, RK Das, X Qian, MZ Shou, H Li
Proceedings of the 29th ACM international conference on multimedia, 3927-3935, 2021
1692021
Sf-net: Single-frame supervision for temporal action localization
F Ma, L Zhu, Y Yang, S Zha, G Kundu, M Feiszli, Z Shou
Computer Vision–ECCV 2020: 16th European Conference, Glasgow, UK, August 23 …, 2020
1672020
Low-shot learning via covariance-preserving adversarial augmentation networks
H Gao, Z Shou, A Zareian, H Zhang, SF Chang
Advances in Neural Information Processing Systems 31, 2018
1642018
Dmc-net: Generating discriminative motion cues for fast compressed video action recognition
Z Shou, X Lin, Y Kalantidis, L Sevilla-Lara, M Rohrbach, SF Chang, Z Yan
Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2019
1582019
Egocentric video-language pretraining
KQ Lin, J Wang, M Soldan, M Wray, R Yan, EZ Xu, D Gao, RC Tu, W Zhao, ...
Advances in Neural Information Processing Systems 35, 7575-7586, 2022
1572022
Diffumask: Synthesizing images with pixel-level annotations for semantic segmentation using diffusion models
W Wu, Y Zhao, MZ Shou, H Zhou, C Shen
Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023
1272023
Boxdiff: Text-to-image synthesis with training-free box-constrained diffusion
J Xie, Y Li, Y Huang, H Liu, W Zhang, Y Zheng, MZ Shou
Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023
1232023
Online detection of action start in untrimmed, streaming videos
Z Shou, J Pan, J Chan, K Miyazawa, H Mansour, A Vetro, X Giro-i-Nieto, ...
Proceedings of the European Conference on Computer Vision (ECCV), 534-551, 2018
121*2018
Show-1: Marrying pixel and latent diffusion models for text-to-video generation
DJ Zhang, JZ Wu, JW Liu, R Zhao, L Ran, Y Gu, D Gao, MZ Shou
International Journal of Computer Vision, 1-15, 2024
1202024
Systém momentálně nemůže danou operaci provést. Zkuste to znovu později.
Články 1–20