Not Lain's picture

Building on HF

Not Lain PRO

not-lain

chonkie-ai

·

https://not-lain.github.io

AI & ML interests

custom AI models with HF integration, HuggingFace fellow 🤗

Recent Activity

liked a Space about 7 hours ago

abidlabs/daggr-3d

reacted to alvarobartt's post with 🚀 1 day ago

💥 `hf-mem` v0.4.1 now also estimates KV cache memory requirements for any context length and batch size with the `--experimental` flag! `uvx hf-mem --model-id ... --experimental` will automatically pull the required information from the Hugging Face Hub to include the KV cache estimation, when applicable. 💡 Alternatively, you can also set the `--max-model-len`, `--batch-size` and `--kv-cache-dtype` arguments (à la vLLM) manually if preferred.

reacted to alvarobartt's post with 🔥 1 day ago

💥 `hf-mem` v0.4.1 now also estimates KV cache memory requirements for any context length and batch size with the `--experimental` flag! `uvx hf-mem --model-id ... --experimental` will automatically pull the required information from the Hugging Face Hub to include the KV cache estimation, when applicable. 💡 Alternatively, you can also set the `--max-model-len`, `--batch-size` and `--kv-cache-dtype` arguments (à la vLLM) manually if preferred.

View all activity

Organizations

liked a Space about 7 hours ago

Daggr 3d

reacted to alvarobartt's post with 🚀🔥 1 day ago

Post

1547

💥 hf-mem v0.4.1 now also estimates KV cache memory requirements for any context length and batch size with the --experimental flag!

uvx hf-mem --model-id ... --experimental will automatically pull the required information from the Hugging Face Hub to include the KV cache estimation, when applicable.

💡 Alternatively, you can also set the --max-model-len, --batch-size and --kv-cache-dtype arguments (à la vLLM) manually if preferred.

1 reply

·

upvoted a collection 1 day ago

Trinity-Large

5 items • Updated 1 day ago • 31

updated 2 Spaces 1 day ago

Text-Streaming

text streaming space using Gemma-7B

RAG-Chatbot

A retrieval system with chatbot integration

liked a model 1 day ago

deepseek-ai/DeepSeek-OCR-2

Image-Text-to-Text • 3B • Updated about 3 hours ago • 30.9k • 520

upvoted a changelog 3 days ago

Changelog

MLX Hardware Compatibility

7 days ago

• 32

liked 2 Spaces 3 days ago

Echo-TTS Preview

Fast, multi-speaker TTS (44.1kHz) with voice cloning

Korean Open Source Heatmap

Explore Korean AI open source contributions and releases

upvoted an article 9 days ago

Article

Small Yet Mighty: Improve Accuracy In Multimodal Search and Visual Document Retrieval with Llama Nemotron RAG Models

23 days ago

•

20

upvoted a paper 21 days ago

LTX-2: Efficient Joint Audio-Visual Foundation Model

Paper • 2601.03233 • Published 23 days ago • 141

upvoted an article 22 days ago

Article

Scaling Real-Time Voice Agents with Cache-Aware Streaming ASR

24 days ago

•

72

upvoted a changelog 22 days ago

Changelog

HuggingChat for Papers

23 days ago

• 99

upvoted an article 25 days ago

Article

The Engineering Handbook for GRPO + LoRA with Verl: Training Qwen2.5 on Multi-GPU

27 days ago

•

12

upvoted a paper 27 days ago

mHC: Manifold-Constrained Hyper-Connections

Paper • 2512.24880 • Published 30 days ago • 289

upvoted a changelog about 1 month ago

Changelog

Set your primary organization on your profile

Dec 19, 2025

• 108

upvoted an article about 1 month ago

Article

Tokenization in Transformers v5: Simpler, Clearer, and More Modular

+4

Dec 18, 2025

•

119

liked a Space about 1 month ago

hf-wrapped

Generate your 2025 recap

upvoted a changelog about 2 months ago

Changelog

HuggingChat for Docs

Dec 12, 2025

• 114