SCAIL: Towards Studio-Grade Character Animation via In-Context Learning of 3D-Consistent Pose Representations Paper ⢠2512.05905 ⢠Published Dec 5, 2025 ⢠20
MathSE: Improving Multimodal Mathematical Reasoning via Self-Evolving Iterative Reflection and Reward-Guided Fine-Tuning Paper ⢠2511.06805 ⢠Published Nov 10, 2025 ⢠13
WebVIA: A Web-based Vision-Language Agentic Framework for Interactive and Verifiable UI-to-Code Generation Paper ⢠2511.06251 ⢠Published Nov 9, 2025 ⢠14
UI2Code^N: A Visual Language Model for Test-Time Scalable Interactive UI-to-Code Generation Paper ⢠2511.08195 ⢠Published Nov 11, 2025 ⢠32
UI2Code$^\text{N}$: A Visual Language Model for Test-Time Scalable Interactive UI-to-Code Generation Paper ⢠2511.08195 ⢠Published Nov 11, 2025 ⢠32 ⢠4
Runtime error Featured 2.96k The Smol Training Playbook š 2.96k The secrets to building world-class LLMs
Glyph: Scaling Context Windows via Visual-Text Compression Paper ⢠2510.17800 ⢠Published Oct 20, 2025 ⢠68
zai-org/GLM-4.1V-9B-Thinking Image-Text-to-Text ⢠10B ⢠Updated Oct 25, 2025 ⢠193k ⢠⢠769
CogView: Mastering Text-to-Image Generation via Transformers Paper ⢠2105.13290 ⢠Published May 26, 2021
CogCoM: Train Large Vision-Language Models Diving into Details through Chain of Manipulations Paper ⢠2402.04236 ⢠Published Feb 6, 2024 ⢠9