Too Big to Think: Capacity, Memorization, and Generalization in Pre-Trained Transformers Paper • 2506.09099 • Published Jun 10, 2025
Atari-GPT: Investigating the Capabilities of Multimodal Large Language Models as Low-Level Policies for Atari Games Paper • 2408.15950 • Published Aug 28, 2024