Joel Wang's picture

Joel Wang

joelhenwang

·

joelhenwang

AI & ML interests

None yet

Recent Activity

upvoted a paper 6 days ago

MinT: Managed Infrastructure for Training and Serving Millions of LLMs

upvoted a paper 6 days ago

SlimQwen: Exploring the Pruning and Distillation in Large MoE Model Pre-training

upvoted a paper 6 days ago

Learning, Fast and Slow: Towards LLMs That Adapt Continually

View all activity

Organizations

upvoted 9 papers 6 days ago

MinT: Managed Infrastructure for Training and Serving Millions of LLMs

Paper • 2605.13779 • Published 8 days ago • 216

SlimQwen: Exploring the Pruning and Distillation in Large MoE Model Pre-training

Paper • 2605.08738 • Published 12 days ago • 13

Learning, Fast and Slow: Towards LLMs That Adapt Continually

Paper • 2605.12484 • Published 9 days ago • 17

Multi-Stream LLMs: Unblocking Language Models with Parallel Streams of Thoughts, Inputs and Outputs

Paper • 2605.12460 • Published 9 days ago • 17

EvolveMem:Self-Evolving Memory Architecture via AutoResearch for LLM Agents

Paper • 2605.13941 • Published 8 days ago • 23

TextLDM: Language Modeling with Continuous Latent Diffusion

Paper • 2605.07748 • Published 13 days ago • 26

Teaching Language Models to Think in Code

Paper • 2605.07237 • Published 10 days ago • 30

Memory-Efficient Looped Transformer: Decoupling Compute from Memory in Looped Language Models

Paper • 2605.07721 • Published 13 days ago • 29

SEIF: Self-Evolving Reinforcement Learning for Instruction Following

Paper • 2605.07465 • Published 13 days ago • 29

commented a paper 14 days ago

TIDE: Token-Informed Depth Execution for Per-Token Early Exit in LLM Inference

Paper • 2603.21365 • Published Mar 22 •

liked a model 15 days ago

kshitijthakkar/deepseek-v4-mini-300M-from-flash

Text Generation • 0.3B • Updated 15 days ago • 207 • 5

liked a dataset 15 days ago

nampdn-ai/tiny-textbooks

Viewer • Updated Jul 3, 2024 • 420k • 928 • 177

liked 4 models 15 days ago

HuggingFaceTB/nanowhale-100m

Text Generation • 0.1B • Updated 17 days ago • 4.12k • 58

TenStrip/LTX2.3-10Eros

Image-to-Video • Updated 10 days ago • 168k • 296

poolside/Laguna-XS.2

Text Generation • 33B • Updated about 8 hours ago • 50.2k • 260

SulphurAI/Sulphur-2-base

Text-to-Video • 9B • Updated 3 days ago • 1.16M • 1.21k

liked 3 models 17 days ago

NucleusAI/Nucleus-Image

Text-to-Image • Updated Apr 16 • 1.29k • • 251

PleIAs/Baguettotron

Text Generation • 0.3B • Updated 24 days ago • 2k • 253

instruction-pretrain/InstructLM-500M

Text Generation • 0.6B • Updated Mar 2 • 1.41k • 38

liked a model 29 days ago

StentorLabs/Portimbria-150M

Text Generation • 0.2B • Updated 27 days ago • 909 • 9