Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
64
3
2
Enrico Shippole
conceptofmind
Follow
IHaBiS's profile picture
louisbrulenaudet's profile picture
Alignment-Lab-AI's profile picture
160 followers
ยท
3 following
https://www.teraflopai.com/
EnricoShippole
conceptofmind
AI & ML interests
None yet
Recent Activity
reacted
to
tomaarsen
's
post
with ๐ฅ
about 9 hours ago
๐ค Announcing the Ettin Reranker family: six new state-of-the-art CrossEncoder rerankers for search from 17M to 1B parameters, plus the full training data and the ~150-line recipe. Built on the Ettin ModernBERT encoders, Apache 2.0. Details: All six were trained with the same single-stage pointwise MSE distillation recipe, with mixedbread-ai/mxbai-rerank-large-v2 (1.54B) as the teacher. Only the learning rate and per-device batch size change between sizes. The 1B student matches the teacher within 0.0001 NDCG@10 on MTEB(eng, v2) Retrieval, the 150M is the strongest reranker I tested in the under-600M range, and the 17M beats the 33M ms-marco-MiniLM-L12-v2 by +0.051 NDCG@10 at roughly half the parameter count. Speed matters as much as quality for a reranker, since it determines whether the model fits the latency budget between retrieval and showing results. Our 17M is the fastest reranker in the whole comparison at 7517 pairs/sec on an H100. Our 150M runs 2.3x faster than the two other 150M ModernBERT-base rerankers (gte-reranker-modernbert-base and granite-embedding-reranker-english-r2) because the modular Transformer module propagates unpadded inputs through every layer rather than just the FA2 attention kernel. And our 1B is 2.4x faster than its 1.5B teacher while matching it on quality. I bootstrapped the training recipe with the new train-sentence-transformers Agent Skill shipped in Sentence Transformers v5.5.0. Install it with `hf skills add train-sentence-transformers --claude` and ask Claude Code (or Codex / Cursor / Gemini CLI) to fine-tune a SentenceTransformer, CrossEncoder, or SparseEncoder model on your data. I wrote a blog post walking through usage, results across six embedder pairings, the speed story, and the complete training script. Check it out, or just point your Agent to the URL: https://huggingface.co/blog/ettin-reranker Collection: https://huggingface.co/collections/cross-encoder/ettin-rerankers
updated
a dataset
about 12 hours ago
TeraflopAI/caselaw-evaluation
published
a dataset
about 16 hours ago
TeraflopAI/caselaw-evaluation
View all activity
Organizations
conceptofmind
's Spaces
1
Sort:ย Recently updated
Runtime error
Agents
3
PaLM
๐