4 15 3

Jeff

JiayuJeff

JiayuJeff

AI & ML interests

None yet

Recent Activity

liked a dataset 4 days ago

JiayuJeff/CostBench

upvoted a paper 4 days ago

Model-Adaptive Tool Necessity Reveals the Knowing-Doing Gap in LLM Tool Use

authored a paper 11 days ago

CreativityBench: Evaluating Agent Creative Reasoning via Affordance-Based Tool Repurposing

View all activity

Organizations

None yet

liked a dataset 4 days ago

JiayuJeff/CostBench

Viewer • Updated Apr 9 • 381 • 66 • 2

upvoted a paper 4 days ago

Model-Adaptive Tool Necessity Reveals the Knowing-Doing Gap in LLM Tool Use

Paper • 2605.14038 • Published 8 days ago • 12

authored a paper 11 days ago

CreativityBench: Evaluating Agent Creative Reasoning via Affordance-Based Tool Repurposing

Paper • 2605.02910 • Published 15 days ago • 22

upvoted a paper 13 days ago

CreativityBench: Evaluating Agent Creative Reasoning via Affordance-Based Tool Repurposing

Paper • 2605.02910 • Published 15 days ago • 22

liked a dataset 14 days ago

chengq9/CreativityBench

Viewer • Updated 14 days ago • 3.29k • 98 • 2

updated a dataset about 1 month ago

JiayuJeff/CostBench

Viewer • Updated Apr 9 • 381 • 66 • 2

published a dataset about 1 month ago

JiayuJeff/CostBench

Viewer • Updated Apr 9 • 381 • 66 • 2

upvoted a paper 3 months ago

Code2Math: Can Your Code Agent Effectively Evolve Math Problems Through Exploration?

Paper • 2603.03202 • Published Mar 3 • 17

upvoted a collection 3 months ago

Qwen3.5

Collection

21 items • Updated Mar 9 • 1.64k

upvoted 4 papers 4 months ago

authored a paper 4 months ago

NAACL: Noise-AwAre Verbal Confidence Calibration for LLMs in RAG Systems

Paper • 2601.11004 • Published Jan 16 • 30

upvoted a paper 4 months ago

NAACL: Noise-AwAre Verbal Confidence Calibration for LLMs in RAG Systems

Paper • 2601.11004 • Published Jan 16 • 30

submitted a paper to Daily Papers 4 months ago

NAACL: Noise-AwAre Verbal Confidence Calibration for LLMs in RAG Systems

Paper • 2601.11004 • Published Jan 16 • 30

upvoted a paper 4 months ago

The Confidence Dichotomy: Analyzing and Mitigating Miscalibration in Tool-Use Agents

Paper • 2601.07264 • Published Jan 12 • 24

authored a paper 6 months ago

CritiCal: Can Critique Help LLM Uncertainty or Confidence Calibration?

Paper • 2510.24505 • Published Oct 28, 2025 • 4

upvoted a paper 6 months ago

CritiCal: Can Critique Help LLM Uncertainty or Confidence Calibration?

Paper • 2510.24505 • Published Oct 28, 2025 • 4

commented a paper 6 months ago

CritiCal: Can Critique Help LLM Uncertainty or Confidence Calibration?

Paper • 2510.24505 • Published Oct 28, 2025 • 4 •

Jeff

AI & ML interests

Recent Activity

Organizations

JiayuJeff's activity