1 6 3

Yoav Gur-Arieh

yoavgurarieh

https://yoav.ml

AI & ML interests

None yet

Recent Activity

upvoted a paper 4 days ago

Faithfulness Metrics Don't Measure Faithfulness: A Meta-Evaluation with Ground Truth

authored a paper 4 days ago

Precise In-Parameter Concept Erasure in Large Language Models

authored a paper 4 days ago

Enhancing Automated Interpretability with Output-Centric Feature Descriptions

View all activity

Organizations

None yet

upvoted a paper 4 days ago

Faithfulness Metrics Don't Measure Faithfulness: A Meta-Evaluation with Ground Truth

Paper • 2605.25052 • Published 6 days ago • 13

authored 6 papers 4 days ago

Precise In-Parameter Concept Erasure in Large Language Models

Paper • 2505.22586 • Published May 28, 2025 • 1

Enhancing Automated Interpretability with Output-Centric Feature Descriptions

Paper • 2501.08319 • Published May 29, 2025 • 11

LMEnt: A Suite for Analyzing Knowledge in Language Models from Pretraining Data to Representations

Paper • 2509.03405 • Published Sep 3, 2025 • 24

Mixing Mechanisms: How Language Models Retrieve Bound Entities In-Context

Paper • 2510.06182 • Published Oct 7, 2025 • 9

Disentangling MLP Neuron Weights in Vocabulary Space

Paper • 2604.06005 • Published Apr 7 • 1

Faithfulness Metrics Don't Measure Faithfulness: A Meta-Evaluation with Ground Truth

Paper • 2605.25052 • Published 6 days ago • 13

updated a collection 4 days ago

BonaFide

Collection

A benchmark for evaluating faithfulness metrics using ground-truth labels. The collection includes the leaderboard, as well as the datasets. • 4 items • Updated 4 days ago • 1

updated a Space 4 days ago

BonaFide Leaderboard

📊

A leaderboard for chain-of-thought faithfulness metrics.

updated 2 datasets 4 days ago

yoavgurarieh/BonaFide

Viewer • Updated 4 days ago • 3.07k • 157 • 1

yoavgurarieh/BonaFide-Extended

Viewer • Updated 4 days ago • 19.5k • 425 • 2

liked a Space 14 days ago

BonaFide Leaderboard

📊

A leaderboard for chain-of-thought faithfulness metrics.

liked a dataset 14 days ago

yoavgurarieh/BonaFide-Extended

Viewer • Updated 4 days ago • 19.5k • 425 • 2

updated a collection 14 days ago

BonaFide

Collection

A benchmark for evaluating faithfulness metrics using ground-truth labels. The collection includes the leaderboard, as well as the datasets. • 4 items • Updated 4 days ago • 1

upvoted a collection 14 days ago

BonaFide

Collection

A benchmark for evaluating faithfulness metrics using ground-truth labels. The collection includes the leaderboard, as well as the datasets. • 4 items • Updated 4 days ago • 1

liked a dataset 14 days ago

yoavgurarieh/BonaFide

Viewer • Updated 4 days ago • 3.07k • 157 • 1

published a dataset 14 days ago

yoavgurarieh/BonaFide-Extended

Viewer • Updated 4 days ago • 19.5k • 425 • 2

updated a collection 15 days ago

BonaFide

Collection

A benchmark for evaluating faithfulness metrics using ground-truth labels. The collection includes the leaderboard, as well as the datasets. • 4 items • Updated 4 days ago • 1

published a dataset 17 days ago

yoavgurarieh/BonaFide

Viewer • Updated 4 days ago • 3.07k • 157 • 1

Yoav Gur-Arieh

AI & ML interests

Recent Activity

Organizations

yoavgurarieh's activity

BonaFide Leaderboard

BonaFide Leaderboard