Faithfulness Metrics Don't Measure Faithfulness: A Meta-Evaluation with Ground Truth Paper • 2605.25052 • Published 6 days ago • 13
Precise In-Parameter Concept Erasure in Large Language Models Paper • 2505.22586 • Published May 28, 2025 • 1
Enhancing Automated Interpretability with Output-Centric Feature Descriptions Paper • 2501.08319 • Published May 29, 2025 • 11
LMEnt: A Suite for Analyzing Knowledge in Language Models from Pretraining Data to Representations Paper • 2509.03405 • Published Sep 3, 2025 • 24
Mixing Mechanisms: How Language Models Retrieve Bound Entities In-Context Paper • 2510.06182 • Published Oct 7, 2025 • 9
Faithfulness Metrics Don't Measure Faithfulness: A Meta-Evaluation with Ground Truth Paper • 2605.25052 • Published 6 days ago • 13
BonaFide Collection A benchmark for evaluating faithfulness metrics using ground-truth labels. The collection includes the leaderboard, as well as the datasets. • 4 items • Updated 4 days ago • 1
Sleeping Agents 3 BonaFide Leaderboard 📊 3 A leaderboard for chain-of-thought faithfulness metrics.
Sleeping Agents 3 BonaFide Leaderboard 📊 3 A leaderboard for chain-of-thought faithfulness metrics.
BonaFide Collection A benchmark for evaluating faithfulness metrics using ground-truth labels. The collection includes the leaderboard, as well as the datasets. • 4 items • Updated 4 days ago • 1
BonaFide Collection A benchmark for evaluating faithfulness metrics using ground-truth labels. The collection includes the leaderboard, as well as the datasets. • 4 items • Updated 4 days ago • 1
BonaFide Collection A benchmark for evaluating faithfulness metrics using ground-truth labels. The collection includes the leaderboard, as well as the datasets. • 4 items • Updated 4 days ago • 1