4 4

Saurabh Jha

saurabhjha1

AI & ML interests

None yet

Recent Activity

upvoted an article 7 days ago

Beyond LLMs: Why Scalable Enterprise AI Adoption Depends on Agent Logic

upvoted an article 12 days ago

ITBench-AA: Frontier Models Score Below 50% on the First Benchmark for Agentic Enterprise IT Tasks — by Artificial Analysis and IBM

published an article 12 days ago

ITBench-AA: Frontier Models Score Below 50% on the First Benchmark for Agentic Enterprise IT Tasks — by Artificial Analysis and IBM

View all activity

Organizations

upvoted an article 7 days ago

Article

Beyond LLMs: Why Scalable Enterprise AI Adoption Depends on Agent Logic

ibm-research

•

7 days ago

• 83

upvoted an article 12 days ago

Article

ITBench-AA: Frontier Models Score Below 50% on the First Benchmark for Agentic Enterprise IT Tasks — by Artificial Analysis and IBM

ibm-research

•

12 days ago

• 14

published an article 12 days ago

Article

ITBench-AA: Frontier Models Score Below 50% on the First Benchmark for Agentic Enterprise IT Tasks — by Artificial Analysis and IBM

ibm-research

•

12 days ago

• 14

liked a Space 4 months ago

ITBench-Lite-Space

🚀

Develop and run interactive code notebooks with JupyterLab

upvoted an article 4 months ago

Article

IBM and UC Berkeley Diagnose Why Enterprise Agents Fail Using IT-Bench and MAST

ibm-research

•

Feb 18

• 19

published an article 4 months ago

Article

IBM and UC Berkeley Diagnose Why Enterprise Agents Fail Using IT-Bench and MAST

ibm-research

•

Feb 18

• 19

liked 2 datasets 4 months ago

ibm-research/ITBench-Lite

Updated Apr 21 • 2.62k • 5

ibm-research/ITBench-Trajectories

Updated Jan 19 • 461 • 3

liked a model over 1 year ago

ibm-granite/granite-3.0-8b-base

Text Generation • 8B • Updated Dec 19, 2024 • 1.92k • 26

upvoted a paper about 2 years ago

Larimar: Large Language Models with Episodic Memory Control

Paper • 2403.11901 • Published Mar 18, 2024 • 33

updated a model over 3 years ago

saurabhjha1/ppo-LunarLander-v2

Reinforcement Learning • Updated Jan 16, 2023 • 1

Saurabh Jha

AI & ML interests

Recent Activity

Organizations

saurabhjha1's activity

Beyond LLMs: Why Scalable Enterprise AI Adoption Depends on Agent Logic

ITBench-AA: Frontier Models Score Below 50% on the First Benchmark for Agentic Enterprise IT Tasks — by Artificial Analysis and IBM

ITBench-AA: Frontier Models Score Below 50% on the First Benchmark for Agentic Enterprise IT Tasks — by Artificial Analysis and IBM

ITBench-Lite-Space

IBM and UC Berkeley Diagnose Why Enterprise Agents Fail Using IT-Bench and MAST

IBM and UC Berkeley Diagnose Why Enterprise Agents Fail Using IT-Bench and MAST