12 19

Henry

danzh0

AI & ML interests

None yet

Recent Activity

upvoted a paper 5 days ago

When Models Manipulate Manifolds: The Geometry of a Counting Task

liked a model 20 days ago

unsloth/Qwen3-Coder-Next-GGUF

liked a model 26 days ago

nvidia/NVIDIA-Nemotron-3-Nano-30B-A3B-NVFP4

View all activity

Organizations

None yet

upvoted a paper 5 days ago

When Models Manipulate Manifolds: The Geometry of a Counting Task

Paper • 2601.04480 • Published Jan 8 • 4

liked a model 20 days ago

unsloth/Qwen3-Coder-Next-GGUF

Text Generation • 80B • Updated about 18 hours ago • 502k • 397

liked a model 26 days ago

nvidia/NVIDIA-Nemotron-3-Nano-30B-A3B-NVFP4

Text Generation • 18B • Updated 3 days ago • 259k • 101

liked 2 models 27 days ago

mlx-community/Jan-v3-4B-base-instruct-4bit

Text Generation • 0.6B • Updated 28 days ago • 389 • 2

mlx-community/Jan-v3-4B-base-instruct-8bit

Text Generation • 1B • Updated 28 days ago • 194 • 3

liked a model about 1 month ago

unsloth/GLM-4.7-Flash-GGUF

Text Generation • 30B • Updated 11 days ago • 324k • 526

upvoted 2 articles about 1 month ago

Article

Open Responses: What you need to know

Jan 15

•

108

Article

Small Yet Mighty: Improve Accuracy In Multimodal Search and Visual Document Retrieval with Llama Nemotron RAG Models

Jan 6

•

liked 3 models about 1 month ago

upvoted an article about 2 months ago

Article

Deriving the PPO Loss from First Principles

Dec 25, 2025

•

liked a model about 2 months ago

MiniMaxAI/MiniMax-M2.1

Text Generation • Updated 11 days ago • 72.7k • • 1.26k

upvoted 2 articles 2 months ago

Article

Tricks from OpenAI gpt-oss YOU 🫵 can use with transformers

Sep 11, 2025

•

181

Article

Tokenization in Transformers v5: Simpler, Clearer, and More Modular

Dec 18, 2025

•

120

liked a model 2 months ago

apple/Sharp

Image-to-3D • Updated Dec 18, 2025 • 1.74k • 347

liked 2 models 3 months ago

janhq/Jan-v2-VL-high-gguf

Image-Text-to-Text • 8B • Updated Nov 26, 2025 • 80.5k • 35

apple/starflow

Updated 26 days ago • 280

upvoted 2 articles 3 months ago

Article

Introducing swift-huggingface: The Complete Swift Client for Hugging Face

Dec 5, 2025

•

Article

Building Deep Research: How we Achieved State of the Art

Nov 24, 2025

•

Henry

AI & ML interests

Recent Activity

Organizations

danzh0's activity

Open Responses: What you need to know

Small Yet Mighty: Improve Accuracy In Multimodal Search and Visual Document Retrieval with Llama Nemotron RAG Models

Deriving the PPO Loss from First Principles

Tricks from OpenAI gpt-oss YOU 🫵 can use with transformers

Tokenization in Transformers v5: Simpler, Clearer, and More Modular

Introducing swift-huggingface: The Complete Swift Client for Hugging Face

Building Deep Research: How we Achieved State of the Art