AR Models with FlexTok EPFL-VILAB/FlexAR-113M-T2I Text-to-Image • Updated 24 days ago • 17 • 1 EPFL-VILAB/FlexAR-382M-T2I Text-to-Image • Updated 24 days ago • 9 EPFL-VILAB/FlexAR-1B-T2I Text-to-Image • Updated 24 days ago • 7 EPFL-VILAB/FlexAR-3B-T2I Text-to-Image • Updated 24 days ago • 66
4M Models Multimodal models from https://4m.epfl.ch/ EPFL-VILAB/4M-7_B_CC12M Any-to-Any • 0.4B • Updated Oct 7, 2024 • 148 • 19 EPFL-VILAB/4M-7_L_CC12M Any-to-Any • 1B • Updated Oct 7, 2024 • 11 • 2 EPFL-VILAB/4M-7_XL_CC12M Any-to-Any • 3B • Updated Oct 7, 2024 • 8 • 1 EPFL-VILAB/4M-7_B_COYO700M Any-to-Any • 0.4B • Updated Oct 7, 2024 • 9 • 1
Omnidata depth & normals models Omnidata surface normals and depth estimators sashasax/omnidata_normal_dpt_hybrid_384 Updated Sep 25, 2024 • 1 sashasax/omnidata_depth_dpt_hybrid_384 Updated Sep 25, 2024 • 1 Runtime error 3 Omnidata Monocular Surface Normal Dpt Hybrid 384 🐠 3 Runtime error 3 Omnidata Monocular Depth DPT Hybrid 384 🐠 3
FlexTok Tokenizers & VAEs Flexible 1D tokenizers and VAEs from https://flextok.epfl.ch/ EPFL-VILAB/flextok_d18_d28_dfn 3B • Updated Mar 19, 2025 • 5.43k • 1 EPFL-VILAB/flextok_d18_d28_in1k 3B • Updated Mar 19, 2025 • 124 EPFL-VILAB/flextok_d18_d18_in1k 0.9B • Updated Mar 19, 2025 • 6 EPFL-VILAB/flextok_d12_d12_in1k 0.3B • Updated Mar 19, 2025 • 30
4M Tokenizers Multimodal tokenizers from https://4m.epfl.ch/ EPFL-VILAB/4M_tokenizers_rgb_16k_224-448 0.3B • Updated Jun 14, 2024 • 3.15k • 4 EPFL-VILAB/4M_tokenizers_depth_8k_224-448 0.3B • Updated Jun 14, 2024 • 1.5k • 1 EPFL-VILAB/4M_tokenizers_normal_8k_224-448 0.3B • Updated Jun 14, 2024 • 1.62k • 1 EPFL-VILAB/4M_tokenizers_semseg_4k_224-448 0.2B • Updated Jun 14, 2024 • 1.57k • 1
AR Models with FlexTok EPFL-VILAB/FlexAR-113M-T2I Text-to-Image • Updated 24 days ago • 17 • 1 EPFL-VILAB/FlexAR-382M-T2I Text-to-Image • Updated 24 days ago • 9 EPFL-VILAB/FlexAR-1B-T2I Text-to-Image • Updated 24 days ago • 7 EPFL-VILAB/FlexAR-3B-T2I Text-to-Image • Updated 24 days ago • 66
FlexTok Tokenizers & VAEs Flexible 1D tokenizers and VAEs from https://flextok.epfl.ch/ EPFL-VILAB/flextok_d18_d28_dfn 3B • Updated Mar 19, 2025 • 5.43k • 1 EPFL-VILAB/flextok_d18_d28_in1k 3B • Updated Mar 19, 2025 • 124 EPFL-VILAB/flextok_d18_d18_in1k 0.9B • Updated Mar 19, 2025 • 6 EPFL-VILAB/flextok_d12_d12_in1k 0.3B • Updated Mar 19, 2025 • 30
4M Models Multimodal models from https://4m.epfl.ch/ EPFL-VILAB/4M-7_B_CC12M Any-to-Any • 0.4B • Updated Oct 7, 2024 • 148 • 19 EPFL-VILAB/4M-7_L_CC12M Any-to-Any • 1B • Updated Oct 7, 2024 • 11 • 2 EPFL-VILAB/4M-7_XL_CC12M Any-to-Any • 3B • Updated Oct 7, 2024 • 8 • 1 EPFL-VILAB/4M-7_B_COYO700M Any-to-Any • 0.4B • Updated Oct 7, 2024 • 9 • 1
4M Tokenizers Multimodal tokenizers from https://4m.epfl.ch/ EPFL-VILAB/4M_tokenizers_rgb_16k_224-448 0.3B • Updated Jun 14, 2024 • 3.15k • 4 EPFL-VILAB/4M_tokenizers_depth_8k_224-448 0.3B • Updated Jun 14, 2024 • 1.5k • 1 EPFL-VILAB/4M_tokenizers_normal_8k_224-448 0.3B • Updated Jun 14, 2024 • 1.62k • 1 EPFL-VILAB/4M_tokenizers_semseg_4k_224-448 0.2B • Updated Jun 14, 2024 • 1.57k • 1
Omnidata depth & normals models Omnidata surface normals and depth estimators sashasax/omnidata_normal_dpt_hybrid_384 Updated Sep 25, 2024 • 1 sashasax/omnidata_depth_dpt_hybrid_384 Updated Sep 25, 2024 • 1 Runtime error 3 Omnidata Monocular Surface Normal Dpt Hybrid 384 🐠 3 Runtime error 3 Omnidata Monocular Depth DPT Hybrid 384 🐠 3