Sarfraz Ahmad

SarfrazAhmad739

3 5

SarfrazAhmad307

AI & ML interests

None yet

Recent Activity

authored a paper 3 days ago

UrduMMLU: A Massive Multitask Benchmark for Urdu Language Understanding

authored a paper 3 days ago

TABVERSE: Benchmarking Cross-Format Table Understanding in LLMs and VLMs

authored a paper 3 days ago

Cultural Benchmarking of LLMs in Standard and Dialectal Arabic Dialogues

View all activity

Organizations

authored 6 papers 3 days ago

NeuralNexus at BEA 2025 Shared Task: Retrieval-Augmented Prompting for Mistake Identification in AI Tutors

Paper • 2506.10627 • Published Jun 12, 2025

A Parallel Cross-Lingual Benchmark for Multimodal Idiomaticity Understanding

Paper • 2601.08645 • Published Feb 24

liked a Space 25 days ago

AI Deadlines

⚡

772

Find upcoming AI conference and workshop deadlines

updated a dataset 25 days ago

MBZUAI/TABVERSE

Viewer • Updated 25 days ago • 1.33k • 1.76k • 2

updated a dataset 26 days ago

MBZUAI/UrduMMLU

Viewer • Updated 25 days ago • 26.4k • 370 • 3

liked a dataset 29 days ago

MBZUAI/TABVERSE

Viewer • Updated 25 days ago • 1.33k • 1.76k • 2

published a dataset 29 days ago

MBZUAI/TABVERSE

Viewer • Updated 25 days ago • 1.33k • 1.76k • 2

published a dataset about 1 month ago

MBZUAI/UrduMMLU

Viewer • Updated 25 days ago • 26.4k • 370 • 3

liked a dataset about 2 months ago

Almheiri/ArabCulture-Dialogue

Viewer • Updated Dec 17, 2025 • 3.47k • 73 • 1

upvoted a collection 2 months ago

Jais-2-Family

Collection

The 2nd generation of the Jais Large Language Models Family • 4 items • Updated Feb 20 • 15

upvoted a collection 8 months ago

Jais Family

Collection

The Jais Family of Models • 22 items • Updated Dec 9, 2025 • 16

liked a Space 8 months ago

OpenFactCheck

✅

Evaluate factual accuracy of text and AI‑generated content

authored 2 papers 10 months ago

iBitter-Stack: A Multi-Representation Ensemble Learning Model for Accurate Bitter Peptide Identification

Paper • 2505.15730 • Published May 21, 2025

UrduFactCheck: An Agentic Fact-Checking Framework for Urdu with Evidence Boosting and Benchmarking

Paper • 2505.15063 • Published May 21, 2025

liked a dataset 10 months ago

MBZUAI/EXAMS-V

Viewer • Updated Sep 18, 2025 • 24.5k • 220 • 10

upvoted a paper 10 months ago

Persuasion Dynamics in LLMs: Investigating Robustness and Adaptability in Knowledge and Safety with DuET-PD

Paper • 2508.17450 • Published Aug 24, 2025 • 9

Sarfraz Ahmad

AI & ML interests

Recent Activity

Organizations

SarfrazAhmad739's activity

AI Deadlines

OpenFactCheck