๐ฎ๐ณ New in my Hindi LLM Series: Gemma-4 E4B, fine-tuned for Hindi โ and it runs on your laptop's CPU. I fine-tuned Google's new Gemma-4 E4B on ~10k Hindi instruction pairs (AI4Bharat: anudesh + dolly) using Unsloth + LoRA, on a single L4 GPU. Then I ran an honest side-by-side eval: base Gemma-4 vs my fine-tune, across 25 Hindi prompts. The results were interesting ๐ โ My fine-tune is more concise โ ask for "3 tips" and it gives exactly 3. Base writes a 1,200-character essay.
โ Pure native Hindi โ base keeps slipping into English ("เคธเคเคคเฅเคฒเคฟเคค เคเคนเคพเคฐ (Eat a Balanced Diet)", "เคคเคพเคฐเคพ (Star)"). My fine-tune stays in clean Hindi.
โ Tighter instruction-following โ ask for a "short message" and it gives one, not a menu of options. โ๏ธ And to be honest: base Gemma-4 is more detailed and comprehensive. I didn't build a "smarter" model โ I built a focused, Hindi-native, edge-friendly one that runs as a 5GB GGUF (Q4) on CPU. ๐ Try it: