Pouya Esmaeili's picture

8 4

Pouya Esmaeili

Pouyae

·

https://pouyae.xyz

AI & ML interests

RAG/LLM/Agents

Organizations

None yet

upvoted a collection 10 months ago

NVIDIA Nemotron V2

Open, Production-ready Enterprise Models. Nvidia Open Model license. • 9 items • Updated 9 days ago • 106

upvoted 3 articles about 1 year ago

Article

Illustrating Reinforcement Learning from Human Feedback (RLHF)

+2

natolambert, LouisCastricato, lvwerra, Dahoas

•

Dec 9, 2022

• 416

Article

KV Caching Explained: Optimizing Transformer Inference Efficiency

not-lain

•

Jan 30, 2025

• 351

Article

Prefill and Decode for Concurrent Requests - Optimizing LLM Performance

tngtech

•

Apr 16, 2025

• 81

upvoted 2 collections about 1 year ago

Llama 3.2

This collection hosts the transformers and original repos of the Llama 3.2 and Llama Guard 3 • 15 items • Updated Dec 6, 2024 • 675

Phi-4

Phi-4 family of small language, multi-modal and reasoning models. • 17 items • Updated Jul 10, 2025 • 212

upvoted 2 papers over 2 years ago

Grandmaster-Level Chess Without Search

Paper • 2402.04494 • Published Feb 7, 2024 • 70

LLM in a flash: Efficient Large Language Model Inference with Limited Memory

Paper • 2312.11514 • Published Dec 12, 2023 • 264