Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

Mechanistic Interpretability Benchmark

university
https://mib-bench.github.io
Activity Feed

AI & ML interests

Principled evaluation of mechanistic interpretability methods.

Aaron Mueller's profile picture Sarah Wiegreffe's profile picture Ivan Arcuschin's profile picture Dana Arad's profile picture Yaniv Nikankin's profile picture Aruna S's profile picture Rohan Gupta's profile picture Michael Hanna's profile picture shun shao's profile picture Adam Belfki's profile picture Atticus Geiger's profile picture Yik Siu Chan's profile picture Amir Zur's profile picture Alessandro Stolfo's profile picture Nikhil Prakash's profile picture Jing's profile picture Hadas Orgad's profile picture Martin Tutek's profile picture Yonatan Belinkov's profile picture Nicolò Brunello's profile picture

mib-bench 's collections 1

MIB Datasets
The tasks and counterfactuals from the Mechanistic Interpretability Benchmark.
  • mib-bench/ioi

    Viewer • Updated May 29, 2025 • 21k • 2.3k
  • mib-bench/copycolors_mcqa

    Viewer • Updated Jan 16, 2025 • 1.89k • 1.12k
  • mib-bench/arithmetic_addition

    Viewer • Updated May 31, 2025 • 40.4k • 34
  • mib-bench/arithmetic_subtraction

    Viewer • Updated May 31, 2025 • 20.9k • 35
MIB Datasets
The tasks and counterfactuals from the Mechanistic Interpretability Benchmark.
  • mib-bench/ioi

    Viewer • Updated May 29, 2025 • 21k • 2.3k
  • mib-bench/copycolors_mcqa

    Viewer • Updated Jan 16, 2025 • 1.89k • 1.12k
  • mib-bench/arithmetic_addition

    Viewer • Updated May 31, 2025 • 40.4k • 34
  • mib-bench/arithmetic_subtraction

    Viewer • Updated May 31, 2025 • 20.9k • 35
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs