8 33

Zachary Bessinger

zbessinger

https://www.zachbessinger.com

AI & ML interests

Multimodal Computer Vision

Recent Activity

liked a dataset about 8 hours ago

Idavidrein/gpqa

liked a dataset about 8 hours ago

TIGER-Lab/MMLU-Pro

upvoted a paper 5 days ago

LoGeR: Long-Context Geometric Reconstruction with Hybrid Memory

View all activity

Organizations

None yet

liked 2 datasets about 8 hours ago

Idavidrein/gpqa

Benchmark • Updated 13 days ago • 1.25k • 105k • 391

TIGER-Lab/MMLU-Pro

Benchmark • Updated 7 days ago • 12.1k • 126k • 456

upvoted 6 papers 5 days ago

Penguin-VL: Exploring the Efficiency Limits of VLM with LLM-based Vision Encoders

Paper • 2603.06569 • Published 12 days ago • 108

Geometry-Guided Reinforcement Learning for Multi-view Consistent 3D Scene Editing

Paper • 2603.03143 • Published 15 days ago • 139

Spatial-TTT: Streaming Visual-based Spatial Intelligence with Test-Time Training

Paper • 2603.12255 • Published 6 days ago • 87

liked a model 2 months ago

Qwen/Qwen3-VL-30B-A3B-Instruct

Image-Text-to-Text • Updated Nov 26, 2025 • 2.87M • • 552

liked a Space 3 months ago

Vision Arena (Testing VLMs side-by-side)

🖼

560

Explore Vision Arena’s computer‑vision tools online

upvoted a paper 4 months ago

Tiny Model, Big Logic: Diversity-Driven Optimization Elicits Large-Model Reasoning Ability in VibeThinker-1.5B

Paper • 2511.06221 • Published Nov 9, 2025 • 133

liked 2 Spaces 4 months ago

Open VLM Leaderboard

🌎

VLMEvalKit Evaluation Results Collection

Transformers Timeline

🤗

Interactive timeline to explore the 🤗Transformers models

liked a Space 5 months ago

The Smol Training Playbook

📚

3.05k

The secrets to building world-class LLMs

liked a model 5 months ago

zai-org/GLM-4.6-FP8

Text Generation • Updated Oct 16, 2025 • 20.8k • • 98

liked 2 models 6 months ago

merve/smol-vision

Image-Text-to-Text • Updated Nov 5, 2025 • 192

kudzueye/boreal-qwen-image

Text-to-Image • Updated Sep 5, 2025 • 5.93k • • 125

upvoted a collection 9 months ago

Qwen2.5-Omni

Collection

End-to-End Omni (text, audio, image, video, and natural speech interaction) model based Qwen2.5 • 6 items • Updated 16 days ago • 165

liked a model 9 months ago

TIGER-Lab/VLM2Vec-Qwen2VL-7B

Image-Text-to-Text • Updated May 3, 2025 • 1.83k • 10

liked a Space 9 months ago

MMEB Leaderboard

📊

103

The massive multimodal embedding benchmark