BlitzKode 1.5B LoRA Adapter

This repository contains the BlitzKode 1.5B PEFT/LoRA adapter for Qwen/Qwen2.5-1.5B-Instruct. It is intended for research, inspection, and reproducible adapter loading. The production local-inference artifact is the merged GGUF model at neuralbroker/blitzkode.

Intended use

  • Local coding-assistant research
  • Adapter inspection and continued fine-tuning
  • Reproducing the BlitzKode GGUF export pipeline
  • Educational experiments with PEFT/LoRA on Qwen2.5

Do not use generated code in production without review and tests.

Loading

from peft import PeftModel
from transformers import AutoModelForCausalLM, AutoTokenizer

base_model_id = "Qwen/Qwen2.5-1.5B-Instruct"
adapter_id = "neuralbroker/blitzkode-1.5b-lora"

tokenizer = AutoTokenizer.from_pretrained(base_model_id, trust_remote_code=True)
model = AutoModelForCausalLM.from_pretrained(
    base_model_id,
    torch_dtype="auto",
    device_map="auto",
    trust_remote_code=True,
)
model = PeftModel.from_pretrained(model, adapter_id)
model.eval()

Training summary

BlitzKode was developed through staged coding-assistant post-training:

Stage Method Purpose
SFT Curated coding examples Teach concise coding responses and common algorithms
Reward-SFT Heuristic continuation Reinforce formatting, correctness patterns, and practical explanations
DPO Preference pairs Penalize weak answers and hallucinated details
Export Merge + GGUF Produce the production blitzkode.gguf artifact

Dataset provenance is tracked in the GitHub project and the production model repository docs.

Evaluation

The latest published smoke evaluation is attached to the production GGUF repository: neuralbroker/blitzkode. Current GGUF smoke eval: 3 / 4 passed (75%) on Python factorial, binary search, SQL top users, and fictional-API uncertainty checks. The raw model still fails the fictional-API uncertainty case, so downstream serving should keep guardrails enabled.

This is a lightweight regression smoke eval, not a comprehensive benchmark such as HumanEval, MBPP, or SWE-bench.

Limitations

  • Small-model limitations apply; outputs can be wrong or incomplete.
  • Direct prompting may hallucinate unsupported APIs or signatures.
  • Default deployment context is 2,048 tokens.
  • Text-only model; no image/file multimodal support.

Related repositories

License

MIT for BlitzKode project files and adapter release metadata. You must also comply with the upstream Qwen2.5 license for the base model.

Downloads last month
39
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for neuralbroker/blitzkode-1.5b-lora

Adapter
(1019)
this model