MobileCLIP: Fast Image-Text Models through Multi-Modal Reinforced Training

Published in CVPR, 2024

Direct Link