MultiMAE: Multi-modal multi-task masked autoencoders R Bachmann, D Mizrahi, A Atanov, A Zamir European Conference on Computer Vision, 348-367, 2022 | 252 | 2022 |
4M: Massively Multimodal Masked Modeling D Mizrahi, R Bachmann, O Kar, T Yeo, M Gao, A Dehghan, A Zamir Advances in Neural Information Processing Systems 36, 2024 | 39 | 2024 |
4M-21: An Any-to-Any Vision Model for Tens of Tasks and Modalities R Bachmann, OF Kar, D Mizrahi, A Garjani, M Gao, D Griffiths, J Hu, ... arXiv preprint arXiv:2406.09406, 2024 | 8 | 2024 |
Composite relationship fields with transformers for scene graph generation G Adaimi, D Mizrahi, A Alahi Proceedings of the IEEE/CVF Winter Conference on Applications of Computer …, 2023 | 7 | 2023 |
[Re] Can gradient clipping mitigate label noise? D Mizrahi, OK Yüksel, AM Kyzy ReScience C 7 (2), 2021 | | 2021 |