DFN Models + Data Collection CLIP Models trained using DFN-2B/DFN-5B datasets • 7 items • Updated Aug 25, 2025 • 21
DepthPro Models Collection Depth Pro: Sharp Monocular Metric Depth in Less Than a Second • 4 items • Updated Aug 25, 2025 • 14
MobileCLIP Models + DataCompDR Data Collection MobileCLIP: Mobile-friendly image-text models with SOTA zero-shot capabilities. DataCompDR: Improved datasets for training image-text SOTA models. • 22 items • Updated Mar 2 • 39
AIMv2 Collection A collection of AIMv2 vision encoders that supports a number of resolutions, native resolution, and a distilled checkpoint. • 16 items • Updated Mar 2 • 84
MobileCLIP2 Collection MobileCLIP2: Mobile-friendly image-text models with SOTA zero-shot capabilities trained on DFNDR-2B • 30 items • Updated Apr 23 • 64
FastVLM Collection Efficient Vision Encoding for Vision Language Models • 8 items • Updated Mar 2 • 114
TiC-CLIP Collection Benchmark for the design of efficient continual learning of image-text models over years. • 8 items • Updated Mar 2 • 11