34 128

Charleno Pires

charleno

AI & ML interests

None yet

Recent Activity

upvoted an article about 4 hours ago

Prefill and Decode for Concurrent Requests - Optimizing LLM Performance

upvoted an article about 15 hours ago

New in llama.cpp: Anthropic Messages API

liked a model about 16 hours ago

unsloth/gemma-4-26B-A4B-it-GGUF

View all activity

Organizations

None yet

upvoted an article about 4 hours ago

Article

Prefill and Decode for Concurrent Requests - Optimizing LLM Performance

Apr 16, 2025

•

upvoted an article about 15 hours ago

Article

New in llama.cpp: Anthropic Messages API

Jan 19

•

liked 2 models about 16 hours ago

unsloth/gemma-4-26B-A4B-it-GGUF

Image-Text-to-Text • 25B • Updated 2 days ago • 1.36M • 382

openai/gpt-oss-safeguard-20b

Text Generation • Updated Jan 14 • 47.1k • • 206

upvoted a changelog 1 day ago

Hugging Face Changelog

Agent Traces on the Hub

3 days ago

• 82

liked a model 1 day ago

zai-org/GLM-5.1

Text Generation • 754B • Updated 2 days ago • 15.9k • • 871

liked a model 2 days ago

arcee-ai/Trinity-Large-Thinking

Text Generation • 399B • Updated 1 day ago • 12.7k • • 141

published a Space 11 days ago

Tads

📓

Pratica de TADS

liked a model 14 days ago

dystrio/Qwen3.5-9B-Sculpt-Throughput

Text Generation • 8B • Updated 18 days ago • 341 • 2

upvoted an article 15 days ago

Article

KV Caching Explained: Optimizing Transformer Inference Efficiency

Jan 30, 2025

•

291

liked a dataset 18 days ago

stanfordnlp/imdb

Viewer • Updated Jan 4, 2024 • 100k • 209k • 366

liked a dataset 19 days ago

huggingface-course/supervised-finetuning_quiz_student_responses

Viewer • Updated about 16 hours ago • 10 • 535 • 3

liked a model 20 days ago

mistralai/Mistral-Small-4-119B-2603

119B • Updated 15 days ago • 74.5k • 349

liked 2 models 21 days ago

XiaomiMiMo/MiMo-V2-Flash

Text Generation • 310B • Updated Feb 27 • 58.9k • • 706

MiniMaxAI/MiniMax-M2.5

Text Generation • 229B • Updated about 1 month ago • 686k • • 1.37k

liked a model 28 days ago

DavidAU/Qwen3.5-40B-Claude-4.5-Opus-High-Reasoning-Thinking

Image-Text-to-Text • 40B • Updated 23 days ago • 1.05k • 39

liked a model 29 days ago

Qwen/Qwen3.5-397B-A17B

Image-Text-to-Text • 403B • Updated 26 days ago • 845k • • 1.43k

liked a model about 1 month ago

stepfun-ai/Step-3.5-Flash

Text Generation • 199B • Updated 24 days ago • 124k • • 771

liked a model about 2 months ago

knowledgator/gliner-bi-edge-v2.0

Token Classification • Updated Feb 24 • 77 • 8

upvoted a collection about 2 months ago

GLiNER-bi-V2

Collection

4 items • Updated Jan 30 • 7

Charleno Pires

AI & ML interests

Recent Activity

Organizations

charleno's activity

Prefill and Decode for Concurrent Requests - Optimizing LLM Performance

New in llama.cpp: Anthropic Messages API

Agent Traces on the Hub

Tads

KV Caching Explained: Optimizing Transformer Inference Efficiency