Qiguang Chen

LightChen2333

·

https://lightchen233.github.io/

LightChen233

AI & ML interests

None yet

Recent Activity

updated a dataset 10 days ago

LightChen2333/OMIBench

updated a dataset 10 days ago

published a dataset 11 days ago

LightChen2333/OMIBench

View all activity

Organizations

updated 2 datasets 10 days ago

LightChen2333/OMIBench

Viewer • Updated 10 days ago • 1.32k • 43

LARG/OMIBench

Viewer • Updated 10 days ago • 1.32k • 96 • 1

published a dataset 11 days ago

LightChen2333/OMIBench

Viewer • Updated 10 days ago • 1.32k • 43

upvoted 2 papers about 2 months ago

OMIBench: Benchmarking Olympiad-Level Multi-Image Reasoning in Large Vision-Language Model

Paper • 2604.20806 • Published Apr 22 • 1

OScaR: The Occam's Razor for Extreme KV Cache Quantization in LLMs and Beyond

Paper • 2605.19660 • Published May 19 • 40

upvoted a paper 2 months ago

OProver: A Unified Framework for Agentic Formal Theorem Proving

Paper • 2605.17283 • Published May 17 • 31

upvoted 3 papers 3 months ago

ClawMark: A Living-World Benchmark for Multi-Turn, Multi-Day, Multimodal Coworker Agents

Paper • 2604.23781 • Published Apr 26 • 33

OpenMobile: Building Open Mobile Agents with Task and Trajectory Synthesis

Paper • 2604.15093 • Published Apr 16 • 30

OccuBench: Evaluating AI Agents on Real-World Professional Tasks via Language World Models

Paper • 2604.10866 • Published Apr 13 • 69

authored a paper 4 months ago

AI Can Learn Scientific Taste

Paper • 2603.14473 • Published Mar 15 • 432

upvoted a paper 4 months ago

AI Can Learn Scientific Taste

Paper • 2603.14473 • Published Mar 15 • 432

upvoted 8 papers 5 months ago

CMI-RewardBench: Evaluating Music Reward Models with Compositional Multimodal Instruction

Paper • 2603.00610 • Published Feb 28 • 36

Large-Scale Terminal Agentic Trajectory Generation from Dockerized Environments

Paper • 2602.01244 • Published Feb 1 • 16

OdysseyArena: Benchmarking Large Language Models For Long-Horizon, Active and Inductive Interactions

Paper • 2602.05843 • Published Feb 5 • 61

Self-Improving World Modelling with Latent Actions

Paper • 2602.06130 • Published Feb 5 • 32

MOVA: Towards Scalable and Synchronized Video-Audio Generation

Paper • 2602.08794 • Published Feb 9 • 159

Prism: Spectral-Aware Block-Sparse Attention

Paper • 2602.08426 • Published Feb 9 • 38

BABE: Biology Arena BEnchmark

Paper • 2602.05857 • Published Feb 5 • 10

HER: Human-like Reasoning and Reinforcement Learning for LLM Role-playing

Paper • 2601.21459 • Published Jan 29 • 10

upvoted a paper 6 months ago

SPARKLING: Balancing Signal Preservation and Symmetry Breaking for Width-Progressive Learning

Paper • 2602.02472 • Published Feb 2 • 48