TreeCUA: Efficiently Scaling GUI Automation with Tree-Structured Verifiable Evolution Paper • 2602.09662 • Published 28 days ago • 6
OCRVerse: Towards Holistic OCR in End-to-End Vision-Language Models Paper • 2601.21639 • Published Jan 29 • 50
ChartCoder: Advancing Multimodal Large Language Model for Chart-to-Code Generation Paper • 2501.06598 • Published Jan 11, 2025 • 2