Mlp-mixer: An all-mlp architecture for vision IO Tolstikhin, N Houlsby, A Kolesnikov, L Beyer, X Zhai, T Unterthiner, ... Advances in neural information processing systems 34, 24261-24272, 2021 | 2643 | 2021 |
How to train your vit? data, augmentation, and regularization in vision transformers A Steiner, A Kolesnikov, X Zhai, R Wightman, J Uszkoreit, L Beyer arXiv preprint arXiv:2106.10270, 2021 | 618 | 2021 |
Pali: A jointly-scaled multilingual language-image model X Chen, X Wang, S Changpinyo, AJ Piergiovanni, P Padlewski, D Salz, ... arXiv preprint arXiv:2209.06794, 2022 | 535 | 2022 |
Lit: Zero-shot transfer with locked-image text tuning X Zhai, X Wang, B Mustafa, A Steiner, D Keysers, A Kolesnikov, L Beyer Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2022 | 498 | 2022 |
Scaling vision transformers to 22 billion parameters M Dehghani, J Djolonga, B Mustafa, P Padlewski, J Heek, J Gilmer, ... International Conference on Machine Learning, 7480-7512, 2023 | 389 | 2023 |
Flax: A neural network library and ecosystem for JAX, 2020 J Heek, A Levskaya, A Oliver, M Ritter, B Rondepierre, A Steiner, ... URL http://github. com/google/flax 1, 2020 | 179 | 2020 |
KvarQ: targeted and direct variant calling from fastq reads of bacterial genomes A Steiner, D Stucki, M Coscolla, S Borrell, S Gagneux BMC genomics 15, 1-12, 2014 | 159 | 2014 |
Pali-x: On scaling up a multilingual vision and language model X Chen, J Djolonga, P Padlewski, B Mustafa, S Changpinyo, J Wu, ... arXiv preprint arXiv:2305.18565, 2023 | 123 | 2023 |
Flax: A neural network library and ecosystem for JAX J Heek, A Levskaya, A Oliver, M Ritter, B Rondepierre, A Steiner, ... Version 0.3 3, 14-26, 2020 | 115 | 2020 |
Patch n’pack: Navit, a vision transformer for any aspect ratio and resolution M Dehghani, B Mustafa, J Djolonga, J Heek, M Minderer, M Caron, ... Advances in Neural Information Processing Systems 36, 2024 | 41 | 2024 |
Image captioners are scalable vision learners too M Tschannen, M Kumar, A Steiner, X Zhai, N Houlsby, L Beyer Advances in Neural Information Processing Systems 36, 2024 | 39 | 2024 |
Managing research and surveillance projects in real-time with a novel open-source e Management tool designed for under-resourced countries A Steiner, J Hella, S Grüninger, G Mhalu, F Mhimbira, CI Cercamondi, ... Journal of the American Medical Informatics Association 23 (5), 916-923, 2016 | 31 | 2016 |
Screening for pulmonary tuberculosis in a Tanzanian prison and computer-aided interpretation of chest X-rays A Steiner, C Mangu, J van den Hombergh, H van Deutekom, ... Public health action 5 (4), 249-254, 2015 | 23 | 2015 |
PaliGemma: A versatile 3B VLM for transfer L Beyer, A Steiner, AS Pinto, A Kolesnikov, X Wang, D Salz, M Neumann, ... arXiv preprint arXiv:2407.07726, 2024 | 18 | 2024 |
1 kHz 2D Visual Motion Sensor Using 2020 Silicon Retina Optical Sensor and DSP Microcontroller SC Liu, MH Yang, A Steiner, R Möckel, T Delbruck IEEE Transactions on Biomedical Circuits and Systems 9 (2), 207-216, 2015 | 8 | 2015 |
Three towers: Flexible contrastive learning with pretrained image models J Kossen, M Collier, B Mustafa, X Wang, X Zhai, L Beyer, A Steiner, ... Advances in Neural Information Processing Systems 36, 2024 | 7 | 2024 |
No filter: Cultural and socioeconomic diversityin contrastive vision-language models A Pouget, L Beyer, E Bugliarello, X Wang, AP Steiner, X Zhai, ... arXiv preprint arXiv:2405.13777, 2024 | 5 | 2024 |
CLIP the Bias: How Useful is Balancing Data in Multimodal Learning? I Alabdulmohsin, X Wang, A Steiner, P Goyal, A D'Amour, X Zhai arXiv preprint arXiv:2403.04547, 2024 | 5 | 2024 |
A study of autoregressive decoders for multi-tasking in computer vision L Beyer, B Wan, G Madan, F Pavetic, A Steiner, A Kolesnikov, AS Pinto, ... arXiv preprint arXiv:2303.17376, 2023 | 5 | 2023 |
1kHz 2D silicon retina motion sensor platform A Steiner, R Moeckel, R Thurer, D Floreano, T Delbruck, SC Liu 2014 IEEE International Symposium on Circuits and Systems (ISCAS), 41-44, 2014 | 5 | 2014 |