Submitted by akhaliq 34 OpenFlamingo: An Open-Source Framework for Training Large Autoregressive Vision-Language Models · 16 authors 4.08k 7
Submitted by akhaliq 23 Scaling Relationship on Learning Mathematical Reasoning with Large Language Models · 6 authors 270
Submitted by akhaliq 19 MusicLDM: Enhancing Novelty in Text-to-Music Generation Using Beat-Synchronous Mixup Strategies · 6 authors 185
Submitted by akhaliq 13 The All-Seeing Project: Towards Panoptic Visual Recognition and Understanding of the Open World · 14 authors 506
Submitted by akhaliq 13 HANDAL: A Dataset of Real-World Manipulable Object Categories with Pose Annotations, Affordances, and Reconstructions · 7 authors
Submitted by akhaliq 8 Ambient Adventures: Teaching ChatGPT on Developing Complex Stories · 5 authors
Submitted by akhaliq 3 TDMD: A Database for Dynamic Color Mesh Subjective and Objective Quality Explorations · 5 authors