Instructions to use mlx-community/Meta-Llama-3.1-70B-bf16 with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- MLX
How to use mlx-community/Meta-Llama-3.1-70B-bf16 with MLX:
# Make sure mlx-lm is installed # pip install --upgrade mlx-lm # if on a CUDA device, also pip install mlx[cuda] # Generate text with mlx-lm from mlx_lm import load, generate model, tokenizer = load("mlx-community/Meta-Llama-3.1-70B-bf16") prompt = "Once upon a time in" text = generate(model, tokenizer, prompt=prompt, verbose=True) - Notebooks
- Google Colab
- Kaggle
- Local Apps
- LM Studio
- MLX LM
How to use mlx-community/Meta-Llama-3.1-70B-bf16 with MLX LM:
Generate or start a chat session
# Install MLX LM uv tool install mlx-lm # Generate some text mlx_lm.generate --model "mlx-community/Meta-Llama-3.1-70B-bf16" --prompt "Once upon a time"
Ctrl+K
- 1.52 kB
- 14.4 kB
- 901 Bytes
- 5.05 GB xet
- 5.13 GB xet
- 5.13 GB xet
- 5.13 GB xet
- 5.13 GB xet
- 5.13 GB xet
- 5.13 GB xet
- 5.13 GB xet
- 5.13 GB xet
- 5.13 GB xet
- 5.13 GB xet
- 5.13 GB xet
- 5.13 GB xet
- 5.13 GB xet
- 5.13 GB xet
- 5.13 GB xet
- 5.13 GB xet
- 5.13 GB xet
- 5.13 GB xet
- 5.13 GB xet
- 5.13 GB xet
- 5.13 GB xet
- 5.13 GB xet
- 5.13 GB xet
- 5.13 GB xet
- 5.13 GB xet
- 5.13 GB xet
- 2.57 GB xet
- 62.5 kB
- 301 Bytes
- 9.08 MB
- 50.5 kB