Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

FAR AI

non-profit
https://far.ai/
FARAIResearch
AlignmentResearch
Activity Feed Request to join this org

AI & ML interests

Frontier alignment research to ensure the safe development and deployment of advanced AI systems.

Recent Activity

chrisjcundy  updated a dataset about 10 hours ago
AlignmentResearch/deceptive-followup-v15
chrisjcundy  published a dataset about 10 hours ago
AlignmentResearch/deceptive-followup-v15
taufeeque  updated a collection 5 days ago
Diverse Deception Probes
View all activity

Papers

Exposing the Systematic Vulnerability of Open-Weight Models to Prefill Attacks

View all Papers

Adam Gleave's profile pictureMohammad Taufeeque's profile pictureTom Tseng's profile pictureOskar John Hollinsworth's profile pictureAaron Tucker's profile pictureChris Cundy's profile pictureKellin Pelrine's profile pictureLars Yencken's profile pictureJames Collins's profile pictureAnn-Kathrin Dombrowski's profile pictureLevon Avagyan's profile pictureSam Adam-Day's profile pictureLukas Struppek's profile pictureMatt Pallissard's profile pictureTigist Diriba's profile picturePranav Gade's profile pictureTuomas Oikarinen's profile picture

AlignmentResearch 's datasets 96

AlignmentResearch/WordLength-test

Viewer • Updated Jul 26, 2024 • 100k • 6

AlignmentResearch/StrongREJECT-test

Viewer • Updated Jul 26, 2024 • 313 • 12

AlignmentResearch/IMDB-test

Viewer • Updated Jul 26, 2024 • 97.5k • 7

AlignmentResearch/EnronSpam-test

Viewer • Updated Jul 26, 2024 • 62.4k • 6

AlignmentResearch/boxoban-astar-solutions

Preview • Updated Jul 25, 2024 • 73 • 1

AlignmentResearch/RuLES-Encryption

Viewer • Updated Jul 16, 2024 • 50k • 8 • 1
  • Previous
  • 1
  • 2
  • 3
  • 4
  • Next
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs