Text2Chart31: Instruction Tuning for Chart Generation with Automatic Feedback Paper • 2410.04064 • Published Oct 5, 2024
DECOR:Decomposition and Projection of Text Embeddings for Text-to-Image Customization Paper • 2412.09169 • Published Dec 12, 2024 • 1
AlphaTuning: Quantization-Aware Parameter-Efficient Adaptation of Large-Scale Pre-Trained Language Models Paper • 2210.03858 • Published Oct 8, 2022
Modal-specific Pseudo Query Generation for Video Corpus Moment Retrieval Paper • 2210.12617 • Published Oct 23, 2022
Background-aware Moment Detection for Video Moment Retrieval Paper • 2306.02728 • Published Jun 5, 2023
MVCustom: Multi-View Customized Diffusion via Geometric Latent Rendering and Completion Paper • 2510.13702 • Published Oct 15, 2025 • 2
CAMEO: Correspondence-Attention Alignment for Multi-View Diffusion Models Paper • 2512.03045 • Published Dec 2, 2025 • 2
Grounding World Simulation Models in a Real-World Metropolis Paper • 2603.15583 • Published 5 days ago • 139
Grounding World Simulation Models in a Real-World Metropolis Paper • 2603.15583 • Published 5 days ago • 139
Aligned Novel View Image and Geometry Synthesis via Cross-modal Attention Instillation Paper • 2506.11924 • Published Jun 13, 2025 • 35
PCME++ Collection The official weights of improved Probabilistic Cross-Modal Embeddings (PCME++) • 2 items • Updated May 26, 2024 • 1
ProLIP Collection Official ProLIP weights, Probabilistic Language-Image Pre-Training (ICLR 2025) • 7 items • Updated Apr 18, 2025 • 10
Let 2D Diffusion Model Know 3D-Consistency for Robust Text-to-3D Generation Paper • 2303.07937 • Published Mar 14, 2023
Robust Camera Pose Refinement for Multi-Resolution Hash Encoding Paper • 2302.01571 • Published Feb 3, 2023
Dense Text-to-Image Generation with Attention Modulation Paper • 2308.12964 • Published Aug 24, 2023 • 2