Sledovat
Filip Pavetic
Filip Pavetic
E-mailová adresa ověřena na: google.com
Název
Citace
Citace
Rok
Gemini: a family of highly capable multimodal models
G Team, R Anil, S Borgeaud, Y Wu, JB Alayrac, J Yu, R Soricut, ...
arXiv preprint arXiv:2312.11805, 2023
15482023
Scaling vision transformers to 22 billion parameters
M Dehghani, J Djolonga, B Mustafa, P Padlewski, J Heek, J Gilmer, ...
International Conference on Machine Learning, 7480-7512, 2023
3942023
Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context
M Reid, N Savinov, D Teplyashin, D Lepikhin, T Lillicrap, J Alayrac, ...
arXiv preprint arXiv:2403.05530, 2024
3772024
Pali-x: On scaling up a multilingual vision and language model
X Chen, J Djolonga, P Padlewski, B Mustafa, S Changpinyo, J Wu, ...
arXiv preprint arXiv:2305.18565, 2023
1282023
Object scene representation transformer
MSM Sajjadi, D Duckworth, A Mahendran, S Van Steenkiste, F Pavetic, ...
Advances in neural information processing systems 35, 9512-9524, 2022
922022
Flexivit: One model for all patch sizes
L Beyer, P Izmailov, A Kolesnikov, M Caron, S Kornblith, X Zhai, ...
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023
782023
Pali-3 vision language models: Smaller, faster, stronger
X Chen, X Wang, L Beyer, A Kolesnikov, J Wu, P Voigtlaender, B Mustafa, ...
arXiv preprint arXiv:2310.09199, 2023
502023
The auto arborist dataset: a large-scale benchmark for multiview urban forest monitoring under domain shift
S Beery, G Wu, T Edwards, F Pavetic, B Majewski, S Mukherjee, S Chan, ...
Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2022
322022
$ LCSk $++: Practical similarity metric for long strings
F Pavetić, G Žužić, M Šikić
arXiv preprint arXiv:1407.2407, 2014
102014
Multi-step sequence alignment
PG Anders, F Pavetic
US Patent 9,959,448, 2018
62018
Methods, systems, and media for detecting abusive stereoscopic videos by generating fingerprints for multiple portions of a video frame
V Zamaraiev, F Pavetic
US Patent 9,872,056, 2018
62018
Fast and simple algorithms for computing both and
F Pavetić, I Katanić, G Matula, G Žužić, M Šikić
arXiv preprint arXiv:1705.07279, 2017
62017
A study of autoregressive decoders for multi-tasking in computer vision
L Beyer, B Wan, G Madan, F Pavetic, A Steiner, A Kolesnikov, AS Pinto, ...
arXiv preprint arXiv:2303.17376, 2023
52023
Detecting multiple parts of a screen to fingerprint to detect abusive uploading videos
F Pavetic, MR Konrad, H Pasula
US Patent 10,614,539, 2020
42020
Detecting multiple parts of a screen to fingerprint to detect abusive uploading videos
F Pavetic, MR Konrad, H Pasula
US Patent 9,972,060, 2018
32018
LocCa: Visual Pretraining with Location-aware Captioners
B Wan, M Tschannen, Y Xian, F Pavetic, I Alabdulmohsin, X Wang, ...
arXiv preprint arXiv:2403.19596, 2024
22024
On Scaling Up a Multilingual Vision and Language Model
X Chen, J Djolonga, P Padlewski, B Mustafa, S Changpinyo, J Wu, ...
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024
22024
Methods, systems, and media for detecting abusive stereoscopic videos by generating fingerprints for multiple portions of a video frame
V Zamaraiev, F Pavetic
US Patent 10,499,097, 2019
12019
Video screening using a machine learning video screening model trained using self-supervised training
M Kandpal, B Ashirmatov, F Pavetic
US Patent 12,002,257, 2024
2024
Training large-scale vision transformer neural networks with variable patch sizes
LK Beyer, P Izmailov, S Kornblith, A Kolesnikov, M Caron, X Zhai, ...
US Patent App. 18/518,075, 2024
2024
Systém momentálně nemůže danou operaci provést. Zkuste to znovu později.
Články 1–20