-
A Survey of Large Language Models
Paper • 2303.18223 • Published • 13 -
Reasoning-SQL: Reinforcement Learning with SQL Tailored Partial Rewards for Reasoning-Enhanced Text-to-SQL
Paper • 2503.23157 • Published • 10 -
LLMs are Greedy Agents: Effects of RL Fine-tuning on Decision-Making Abilities
Paper • 2504.16078 • Published • 21 -
Jiunsong/supergemma4-e4b-abliterated-mlx
Text Generation • 1B • Updated • 4.33k • 36
Tae-Hyoung Choi
selmoch
AI & ML interests
None yet
Recent Activity
updated a collection about 1 month ago
LLM liked a model about 1 month ago
Jiunsong/supergemma4-e4b-abliterated-mlx liked a dataset 10 months ago
allganize/RAG-Evaluation-Dataset-KOOrganizations
None yet