What will happen if we train a Q function for digital agents?
HAO BAI
JackBAI
AI & ML interests
Representation learning, language models.
Recent Activity
updated
a dataset
about 11 hours ago
JackBAI/jack-latest-vllm-stack
published
a dataset
about 11 hours ago
JackBAI/jack-latest-vllm-stack
authored
a paper
6 days ago
InT: Self-Proposed Interventions Enable Credit Assignment in LLM Reasoning