Иван Поляков's picture

Иван Поляков

meme-addict

·

AI & ML interests

None yet

Recent Activity

liked a dataset about 13 hours ago

felixwangg/prime_vul_minus_splitted_line_diff_mask_skip_indent_ctx5_chat_v2

liked a model 1 day ago

rghosh8/arc-grpo-deepseek-r1-distill-qwen-1.5b-rajat-seed-42-G-4-new

upvoted a paper 2 days ago

Adam's Law: Textual Frequency Law on Large Language Models

View all activity

Organizations

None yet

upvoted a paper 2 days ago

Adam's Law: Textual Frequency Law on Large Language Models

Paper • 2604.02176 • Published 11 days ago • 461

upvoted a paper 3 days ago

GrandCode: Achieving Grandmaster Level in Competitive Programming via Agentic Reinforcement Learning

Paper • 2604.02721 • Published 10 days ago • 349

upvoted a paper 4 days ago

DataFlex: A Unified Framework for Data-Centric Dynamic Training of Large Language Models

Paper • 2603.26164 • Published 17 days ago • 347

upvoted a paper 5 days ago

T5Gemma-TTS Technical Report

Paper • 2604.01760 • Published 11 days ago • 11

upvoted a paper 9 days ago

ChartNet: A Million-Scale, High-Quality Multimodal Dataset for Robust Chart Understanding

Paper • 2603.27064 • Published 16 days ago • 25

upvoted a paper 12 days ago

FIPO: Eliciting Deep Reasoning with Future-KL Influenced Policy Optimization

Paper • 2603.19835 • Published 24 days ago • 331

upvoted a paper 14 days ago

Out of Sight but Not Out of Mind: Hybrid Memory for Dynamic Video World Models

Paper • 2603.25716 • Published 18 days ago • 154

upvoted a paper 27 days ago

Bootstrapping Exploration with Group-Level Natural Language Feedback in Reinforcement Learning

Paper • 2603.04597 • Published Mar 4 • 210

upvoted 5 papers about 2 months ago

A Very Big Video Reasoning Suite

Paper • 2602.20159 • Published Feb 23 • 519

VESPO: Variational Sequence-Level Soft Policy Optimization for Stable Off-Policy LLM Training

Paper • 2602.10693 • Published Feb 11 • 220

SQuTR: A Robustness Benchmark for Spoken Query to Text Retrieval under Acoustic Noise

Paper • 2602.12783 • Published Feb 13 • 216

TermiGen: High-Fidelity Environment and Robust Trajectory Synthesis for Terminal Agents

Paper • 2602.07274 • Published Feb 6 • 208

Less is Enough: Synthesizing Diverse Data in Feature Space of LLMs

Paper • 2602.10388 • Published Feb 11 • 244

upvoted 2 papers 2 months ago

Weak-Driven Learning: How Weak Agents make Strong Agents Stronger

Paper • 2602.08222 • Published Feb 9 • 289

FASA: Frequency-aware Sparse Attention

Paper • 2602.03152 • Published Feb 3 • 154