7 8

Aykut Çayır PRO

acayir64

https://scholar.google.com/citations?user=TEh4eE0AAAAJ&hl=tr&oi=ao

AI & ML interests

Deep learning applications

Recent Activity

upvoted an article 4 days ago

A Guide to Reinforcement Learning Post-Training for LLMs: PPO, DPO, GRPO, and Beyond

updated a model 4 days ago

acayir64/nl2docker_mlx

published a model 4 days ago

acayir64/nl2docker_mlx

View all activity

Organizations

upvoted an article 4 days ago

Article

A Guide to Reinforcement Learning Post-Training for LLMs: PPO, DPO, GRPO, and Beyond

karina-zadorozhny

•

Jan 19

• 26

upvoted an article 8 months ago

Article

SmolLM3: smol, multilingual, long-context reasoner

eliebak, cmpatino, anton-l, edbeeching, m-ric, nouamanetazi, akseljoonas, guipenedo, hynky, clefourrier, SaylorTwift, kashif, qgallouedec, hlarcher, glutamatt, Xenova, reach-vb, ngxson, craffel, lewtun, loubnabnl, lvwerra, thomwolf

•

Jul 8, 2025

• 778

upvoted an article about 1 year ago

Article

Topic 33: Slim Attention, KArAt, XAttention and Multi-Token Attention Explained – What’s Really Changing in Transformers?

Kseniase

•

Apr 4, 2025

• 16

upvoted an article over 1 year ago

Article

Deriving DPO's Loss

hba123

•

Dec 24, 2024

• 30

upvoted an article almost 2 years ago

Article

Welcome Gemma 2 - Google’s new open LLM

philschmid, osanseviero, pcuenq, lewtun, tomaarsen, reach-vb

•

Jun 27, 2024

• 132

upvoted an article about 2 years ago

Article

CodeGemma - an official Google release for code LLMs

pcuenq, osanseviero, reach-vb, philschmid, mishig, loubnabnl

•

Apr 9, 2024

• 107

upvoted a collection about 2 years ago

Vision Language Models Papers 🖼️💬📝

Collection

Papers about vision-language models, most important ones are on top of the list. • 27 items • Updated Apr 30, 2024 • 40

Aykut Çayır PRO

AI & ML interests

Recent Activity

Organizations

acayir64's activity

A Guide to Reinforcement Learning Post-Training for LLMs: PPO, DPO, GRPO, and Beyond

SmolLM3: smol, multilingual, long-context reasoner

Topic 33: Slim Attention, KArAt, XAttention and Multi-Token Attention Explained – What’s Really Changing in Transformers?

Deriving DPO's Loss

Welcome Gemma 2 - Google’s new open LLM

CodeGemma - an official Google release for code LLMs