🔄 In a Training Loop

Urro

urroxyz

82 686 97

https://urro.xyz/

urroxyz

AI & ML interests

computational linguistics major 🤖🔎🔠 i am autistic. if i come off rude, i probably didn't mean to. please feel free to ask me for clarification.

Recent Activity

upvoted a paper 2 days ago

ReFreeKV: Towards Threshold-Free KV Cache Compression

updated a collection 2 days ago

WTF GENIUS PAPERS

upvoted a paper 2 days ago

BlockPilot: Instance-Adaptive Policy Learning for Diffusion-based Speculative Decoding

View all activity

Organizations

upvoted 7 papers 2 days ago

ReFreeKV: Towards Threshold-Free KV Cache Compression

Paper • 2502.16886 • Published 8 days ago • 47

BlockPilot: Instance-Adaptive Policy Learning for Diffusion-based Speculative Decoding

Paper • 2606.31315 • Published 4 days ago • 71

Multi-Block Diffusion Language Models

Paper • 2606.29215 • Published 4 days ago • 30

Reinforcement Learning with Metacognitive Feedback Elicits Faithful Uncertainty Expression in LLMs

Paper • 2606.32032 • Published 4 days ago • 21

AsyncOPD: How Stale Can On-Policy Distillation Be?

Paper • 2606.24143 • Published 11 days ago • 29

MOPD: Multi-Teacher On-Policy Distillation for Capability Integration in LLM Post-Training

Paper • 2606.30406 • Published 5 days ago • 11

DOPD: Dual On-policy Distillation

Paper • 2606.30626 • Published 5 days ago • 91

upvoted 3 papers 3 days ago

JetSpec: Breaking the Scaling Ceiling of Speculative Decoding with Parallel Tree Drafting

Paper • 2606.18394 • Published 9 days ago • 35

Information-Aware KV Cache Compression for Long Reasoning

Paper • 2606.26875 • Published 9 days ago • 11

Simplified Sparse Attention via Gist Tokens

Paper • 2604.20920 • Published 8 days ago • 5

upvoted 2 papers 5 days ago

The Tatoxa System for Text Detoxification in Low-Resource Languages: The Case of Tatar

Paper • 2606.26015 • Published 10 days ago • 10

Formalizing Latent Thoughts: Four Axioms of Thought Representation in LLMs

Paper • 2606.27378 • Published May 7 • 58

upvoted a paper 6 days ago

Discretizing Reward Models

Paper • 2606.21795 • Published 15 days ago • 17

upvoted a changelog 7 days ago

Hugging Face Changelog

Share your feedback with us

8 days ago

• 104

upvoted 3 papers 7 days ago

CODI: Compressing Chain-of-Thought into Continuous Space via Self-Distillation

Paper • 2502.21074 • Published Feb 28, 2025 • 5

Demystifying Training-Time Augmentation for Data-Constrained Language Model Pretraining

Paper • 2606.16246 • Published 15 days ago • 4

Tapered Language Models

Paper • 2606.23670 • Published 12 days ago • 9

upvoted 2 papers 11 days ago

Learning from Your Own Mistakes: Constructing Learnable Micro-Reflective Trajectories for Self-Distillation

Paper • 2606.18844 • Published 17 days ago • 18

Multi-Turn Reflective Masking Elicits Reasoning in Mask Diffusion Models

Paper • 2606.16700 • Published 19 days ago • 14

upvoted a paper 12 days ago

RepSelect: Robust LLM Unlearning via Representation Selectivity

Paper • 2606.17168 • Published 19 days ago • 5

Urro

AI & ML interests

Recent Activity

Organizations

urroxyz's activity

Share your feedback with us