Roman Nekrasov

Rob1234567

romannekrasovaillm

AI & ML interests

Areas of interest: agentic mid-training, reinforcement learning with reward verification (RLVR), scaling agent environments, interleaved agent reasoning with tools

Recent Activity

upvoted a collection 1 day ago

GigaChat 3.1

upvoted a paper 3 days ago

LongCat-Flash-Prover: Advancing Native Formal Reasoning via Agentic Tool-Integrated Reinforcement Learning

liked a model 12 days ago

Qwen/Qwen3.5-35B-A3B

View all activity

Organizations

None yet

upvoted a collection 1 day ago

GigaChat 3.1

Collection

6 items • Updated 2 days ago • 43

upvoted a paper 3 days ago

LongCat-Flash-Prover: Advancing Native Formal Reasoning via Agentic Tool-Integrated Reinforcement Learning

Paper • 2603.21065 • Published 5 days ago • 69

liked a model 12 days ago

Qwen/Qwen3.5-35B-A3B

Image-Text-to-Text • 36B • Updated 27 days ago • 2.93M • • 1.26k

liked a dataset about 1 month ago

Fujitsu-FRE/MAPS

Viewer • Updated Feb 10 • 8.86k • 68 • 9

upvoted a paper about 1 month ago

VESPO: Variational Sequence-Level Soft Policy Optimization for Stable Off-Policy LLM Training

Paper • 2602.10693 • Published Feb 11 • 220

liked 2 models about 2 months ago

yujiepan/kimi-k2.5-tiny-random

Feature Extraction • 3.24M • Updated Feb 20 • 76 • 1

moonshotai/Kimi-K2.5

Image-Text-to-Text • 1.1T • Updated 27 days ago • 3.98M • • 2.36k

liked 2 models 2 months ago

Qwen/Qwen3-Coder-30B-A3B-Instruct

Text Generation • 31B • Updated Dec 3, 2025 • 1.11M • • 983

google/medasr

Automatic Speech Recognition • Updated Jan 26 • 21.5k • 295

upvoted a collection 2 months ago

MedGemma Release

Collection

Collection of Gemma 3 variants for performance on medical text and image comprehension to accelerate building healthcare-based AI applications. • 9 items • Updated 14 days ago • 454

New activity in nvidia/Nemotron-Agentic-v1 3 months ago

Inquiry regarding Banking Domain data mentioned in Nemotron 3 Nano Paper (arXiv:2512.20848, p. 17)

#4 opened 3 months ago by

Rob1234567

liked a model 4 months ago

allenai/Olmo-3-32B-Think

Text Generation • 1.05M • Updated Jan 5 • 5.39k • 169

upvoted a collection 4 months ago

Olmo 3

Collection

Artifacts for the Olmo 3 release. • 7 items • Updated 24 days ago • 167

upvoted a paper 4 months ago

DeepSeekMath-V2: Towards Self-Verifiable Mathematical Reasoning

Paper • 2511.22570 • Published Nov 27, 2025 • 93

upvoted 2 articles 4 months ago

Article

What makes good reasoning data

Oct 30, 2025

•

Article

Aligning to What? Rethinking Agent Generalization in MiniMax M2

Oct 30, 2025

•

upvoted a collection 5 months ago

Gemma 3 Release

Collection

28 items • Updated 14 days ago • 627

upvoted a collection 6 months ago

Qwen3Guard

Collection

7 items • Updated Dec 31, 2025 • 64

liked a model 8 months ago

openai/gpt-oss-120b

Text Generation • 120B • Updated Aug 26, 2025 • 4.45M • • 4.61k

liked a model 10 months ago

ai-sage/GigaChat-20B-A3B-instruct

Text Generation • 21B • Updated Jun 25, 2025 • 935 • 50

Roman Nekrasov

AI & ML interests

Recent Activity

Organizations

Rob1234567's activity

Inquiry regarding Banking Domain data mentioned in Nemotron 3 Nano Paper (arXiv:2512.20848, p. 17)

What makes good reasoning data

Aligning to What? Rethinking Agent Generalization in MiniMax M2