5 5 14

Lazy Beaver

Jayce-Ping

AI & ML interests

None yet

Recent Activity

authored a paper 6 days ago

Flow-DPPO: Divergence Proximal Policy Optimization for Flow Matching Models

updated a model 6 days ago

Tencent-Hunyuan-Multimodal-RL/FLUX2-klein-base-9b-GenEval2-Multi-Reward

updated a model 6 days ago

Tencent-Hunyuan-Multimodal-RL/FLUX2-klein-base-9b-GenEval2-Single-Reward

View all activity

Organizations

authored a paper 6 days ago

Flow-DPPO: Divergence Proximal Policy Optimization for Flow Matching Models

Paper • 2606.11025 • Published 8 days ago • 41

updated 4 models 6 days ago

updated a collection 6 days ago

Flow-DPPO: GenEval2

Collection

Flow-DPPO-trained LoRA adapters (single- and multi-reward) for SD3.5 and FLUX.2-klein-9B optimized on GenEval2. • 5 items • Updated 6 days ago

upvoted a paper 6 days ago

Flow-DPPO: Divergence Proximal Policy Optimization for Flow Matching Models

Paper • 2606.11025 • Published 8 days ago • 41

upvoted a paper 7 days ago

Rethinking the Divergence Regularization in LLM RL

Paper • 2606.09821 • Published 9 days ago • 33

updated a collection 8 days ago

Flow-DPPO: GenEval2

Collection

Flow-DPPO-trained LoRA adapters (single- and multi-reward) for SD3.5 and FLUX.2-klein-9B optimized on GenEval2. • 5 items • Updated 6 days ago

published 4 models 8 days ago

Tencent-Hunyuan-Multimodal-RL/FLUX2-klein-base-9b-GenEval2-Multi-Reward

Text-to-Image • Updated 6 days ago • 32 • 2

Tencent-Hunyuan-Multimodal-RL/FLUX2-klein-base-9b-GenEval2-Single-Reward

Text-to-Image • Updated 6 days ago • 37 • 1

Tencent-Hunyuan-Multimodal-RL/SD3.5-GenEval2-Multi-Reward

Text-to-Image • Updated 6 days ago • 39

Tencent-Hunyuan-Multimodal-RL/SD3.5-GenEval2-Single-Reward

Text-to-Image • Updated 6 days ago • 38

updated a collection 16 days ago

OPD-Teachers

Collection

3 items • Updated 16 days ago

updated a model 16 days ago

Jayce-Ping/Pickscore-Teacher

Updated 16 days ago • 44

Lazy Beaver

AI & ML interests

Recent Activity

Organizations

Jayce-Ping's activity