Mingzhe Li
Mubuky
ยท
AI & ML interests
RL & Agent
Recent Activity
authored
a paper
about 12 hours ago
STAR-S: Improving Safety Alignment through Self-Taught Reasoning on Safety Rules liked
a model about 20 hours ago
OpenMOSS-Team/SciThinker-4B liked
a model about 20 hours ago
OpenMOSS-Team/SciThinker-30B