Yuheng Zhang's picture

Yuheng Zhang

MatouK98

·

AI & ML interests

None yet

Organizations

authored a paper almost 2 years ago

Iterative Nash Policy Optimization: Aligning LLMs with General Preferences via No-Regret Learning

Paper • 2407.00617 • Published Jun 30, 2024 • 7

authored a paper about 2 years ago

Offline Learning in Markov Games with General Function Approximation

Paper • 2302.02571 • Published Feb 6, 2023