Datasets and trained checkpoints of Composition-RL
xuxin
xx18
AI & ML interests
None yet
Recent Activity
upvoted a paper 3 days ago
Progressive Residual Warmup for Language Model Pretraining authored
a paper
3 days ago
Progressive Residual Warmup for Language Model Pretraining authored
a paper
27 days ago
Composition-RL: Compose Your Verifiable Prompts for Reinforcement Learning of Large Language Models