How to use from
Lemonade
Pull the model
# Download Lemonade from https://lemonade-server.ai/
lemonade pull llmware/phi-3.5-gguf
Run and chat with the model
lemonade run user.phi-3.5-gguf-{{QUANT_TAG}}
List all available models
lemonade list
Quick Links

Configuration Parsing Warning:In UNKNOWN_FILENAME: "tokenizer_config.bos_token" must be one of [string, object]

Configuration Parsing Warning:In UNKNOWN_FILENAME: "tokenizer_config.eos_token" must be one of [string, object]

phi-3.5-gguf

phi-3.5-gguf is a GGUF 4_K_M (int4) quantized version of Microsoft Phi-3.5-mini-instruct, providing a very fast, very small inference implementation, optimized for AI PCs.

Model Description

  • Developed by: microsoft
  • Model type: phi3
  • Parameters: 3.8 billion
  • Model Parent: microsoft/Phi-3.5-mini-instruct
  • Language(s) (NLP): English
  • License: Apache 2.0
  • Uses: Chat, general-purpose LLM
  • Quantization: 4_K_M (int4)

Model Card Contact

llmware on hf

llmware website

Downloads last month
19
GGUF
Model size
4B params
Architecture
phi3
Hardware compatibility
Log In to add your hardware

We're not able to determine the quantization variants.

Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Model tree for llmware/phi-3.5-gguf

Quantized
(183)
this model