5 8

Hu

Alexhu1999

AI & ML interests

None yet

Recent Activity

upvoted a paper about 2 months ago

Rewarding the Rare: Uniqueness-Aware RL for Creative Problem Solving in LLMs

upvoted a paper about 2 months ago

Collaborative Multi-Agent Test-Time Reinforcement Learning for Reasoning

published a dataset 4 months ago

Alexhu1999/qwen3_embedings

View all activity

Organizations

upvoted 2 papers about 2 months ago

Rewarding the Rare: Uniqueness-Aware RL for Creative Problem Solving in LLMs

Paper • 2601.08763 • Published Jan 13 • 148

Collaborative Multi-Agent Test-Time Reinforcement Learning for Reasoning

Paper • 2601.09667 • Published Jan 14 • 91

published a dataset 4 months ago

Alexhu1999/qwen3_embedings

Updated Nov 2, 2025 • 7

updated a Space 5 months ago

Trackio

🚀

Track and visualize project metrics

published a Space 5 months ago

Trackio

🚀

Track and visualize project metrics

updated a model 6 months ago

Alexhu1999/cerebras_kto_iseuiuc

Updated Sep 21, 2025

published a model 6 months ago

Alexhu1999/cerebras_kto_iseuiuc

Updated Sep 21, 2025

updated a dataset 6 months ago

Alexhu1999/maicrl

Updated Sep 19, 2025 • 7

published a dataset 6 months ago

Alexhu1999/maicrl

Updated Sep 19, 2025 • 7

liked 5 datasets 6 months ago

updated a model 7 months ago

Alexhu1999/lfm2_vl

1B • Updated Sep 1, 2025 • 3

liked a model 7 months ago

NexaAI/OmniNeural-4B

Any-to-Any • Updated Nov 7, 2025 • 88 • 163

updated 2 models 7 months ago

Alexhu1999/Qwen3-4B-GSPO-email-retriever

4B • Updated Aug 15, 2025

Alexhu1999/Qwen3-4B-GSPO-email-retriever-120steps

4B • Updated Aug 14, 2025

published a model 7 months ago

Alexhu1999/Qwen3-4B-GSPO-email-retriever-120steps

4B • Updated Aug 14, 2025

updated a model 7 months ago

Alexhu1999/Qwen3-4B-DAPO-email-retriever

4B • Updated Aug 13, 2025

Hu

AI & ML interests

Recent Activity

Organizations

Alexhu1999's activity

Trackio

Trackio