Ai2

Team

non-profit

Verified

https://allenai.org/

allen_ai

allenai

AI & ML interests

Building breatkthrough AI to solve the world's biggest problems.

Recent Activity

shriyaa44 updated a dataset about 1 hour ago

allenai/asta-summary-citation-counts

baileyk updated a dataset about 2 hours ago

allenai/dolma3_mix-6T-1025-7B

baileyk new activity 2 days ago

allenai/dolma3_mix-6T-1025-7B:Full Dataset

View all activity

Papers

Bolmo: Byteifying the Next Generation of Language Models

Olmo 3

View all Papers

shriyaa44

updated a dataset about 1 hour ago

allenai/asta-summary-citation-counts

Viewer • Updated about 1 hour ago • 31.1M • 333 • 6

baileyk

updated a dataset about 2 hours ago

allenai/dolma3_mix-6T-1025-7B

Preview • Updated about 2 hours ago • 273k • 24

baileyk

in allenai/dolma3_mix-6T-1025-7B 2 days ago

Full Dataset

#3 opened 19 days ago by

hamishivi

authored a paper 9 days ago

Olmo 3

Paper • 2512.13961 • Published 27 days ago • 23

faezeb

authored a paper about 2 months ago

DR Tulu: Reinforcement Learning with Evolving Rubrics for Deep Research

Paper • 2511.19399 • Published Nov 24, 2025 • 60

undfined

authored a paper about 2 months ago

DR Tulu: Reinforcement Learning with Evolving Rubrics for Deep Research

Paper • 2511.19399 • Published Nov 24, 2025 • 60

sewon

authored a paper about 2 months ago

DR Tulu: Reinforcement Learning with Evolving Rubrics for Deep Research

Paper • 2511.19399 • Published Nov 24, 2025 • 60

pradeepd

authored a paper about 2 months ago

DR Tulu: Reinforcement Learning with Evolving Rubrics for Deep Research

Paper • 2511.19399 • Published Nov 24, 2025 • 60

hamishivi

authored 2 papers about 2 months ago

RLVE: Scaling Up Reinforcement Learning for Language Models with Adaptive Verifiable Environments

Paper • 2511.07317 • Published Nov 10, 2025 • 15

DR Tulu: Reinforcement Learning with Evolving Rubrics for Deep Research

Paper • 2511.19399 • Published Nov 24, 2025 • 60

yanhong-l

authored a paper 3 months ago

Text or Pixels? It Takes Half: On the Token Efficiency of Visual Text Inputs in Multimodal LLMs

Paper • 2510.18279 • Published Oct 21, 2025 • 4

davidheineman

authored 3 papers 3 months ago

Signal and Noise: A Framework for Reducing Uncertainty in Language Model Evaluation

Paper • 2508.13144 • Published Aug 18, 2025

Evaluating LLMs on Chinese Idiom Translation

Paper • 2508.10421 • Published Aug 14, 2025

Fluid Language Model Benchmarking

Paper • 2509.11106 • Published Sep 14, 2025

stellalisy

authored 6 papers 3 months ago

CulturalTeaming: AI-Assisted Interactive Red-Teaming for Challenging LLMs' (Lack of) Multicultural Knowledge

Paper • 2404.06664 • Published Apr 10, 2024 • 1

CulturalBench: a Robust, Diverse and Challenging Benchmark on Measuring the (Lack of) Cultural Knowledge of LLMs

Paper • 2410.02677 • Published Oct 3, 2024 • 1

Aligning LLMs to Ask Good Questions A Case Study in Clinical Reasoning

Paper • 2502.14860 • Published Feb 20, 2025

BLAB: Brutally Long Audio Bench

Paper • 2505.03054 • Published May 5, 2025 • 1

Spurious Rewards: Rethinking Training Signals in RLVR

Paper • 2506.10947 • Published Jun 12, 2025 • 2

MediQ: Question-Asking LLMs and a Benchmark for Reliable Interactive Clinical Reasoning

Paper • 2406.00922 • Published Jun 3, 2024