Nishit Anand's picture

5 2

Nishit Anand

nishitanand

·

AI & ML interests

Computer Vision, Deep Learning

Recent Activity

authored a paper 3 days ago

MMOU: A Massive Multi-Task Omni Understanding and Reasoning Benchmark for Long and Complex Real-World Videos

upvoted a paper 3 days ago

MMOU: A Massive Multi-Task Omni Understanding and Reasoning Benchmark for Long and Complex Real-World Videos

upvoted a paper about 1 month ago

EgoAVU: Egocentric Audio-Visual Understanding

View all activity

Organizations

authored a paper 3 days ago

MMOU: A Massive Multi-Task Omni Understanding and Reasoning Benchmark for Long and Complex Real-World Videos

Paper • 2603.14145 • Published 5 days ago • 13

upvoted a paper 3 days ago

MMOU: A Massive Multi-Task Omni Understanding and Reasoning Benchmark for Long and Complex Real-World Videos

Paper • 2603.14145 • Published 5 days ago • 13

upvoted a paper about 1 month ago

EgoAVU: Egocentric Audio-Visual Understanding

Paper • 2602.06139 • Published Feb 5 • 12

authored 2 papers 7 months ago

Do Audio-Language Models Understand Linguistic Variations?

Paper • 2410.16505 • Published Oct 21, 2024 • 1

MMAU-Pro: A Challenging and Comprehensive Benchmark for Holistic Evaluation of Audio General Intelligence

Paper • 2508.13992 • Published Aug 19, 2025 • 7

upvoted a paper 7 months ago

MMAU-Pro: A Challenging and Comprehensive Benchmark for Holistic Evaluation of Audio General Intelligence

Paper • 2508.13992 • Published Aug 19, 2025 • 7

liked a Space 7 months ago

GAMA

Generate text based on audio input and questions

upvoted 2 papers 7 months ago

MMAU: A Massive Multi-Task Audio Understanding and Reasoning Benchmark

Paper • 2410.19168 • Published Oct 24, 2024 • 24

Audio Flamingo 2: An Audio-Language Model with Long-Audio Understanding and Expert Reasoning Abilities

Paper • 2503.03983 • Published Mar 6, 2025 • 27

liked a dataset 7 months ago

gamma-lab-umd/MMAU-Pro

Viewer • Updated Aug 28, 2025 • 5.31k • 3.49k • 17