Meta-Llama-3-8B-Instruct-4bit (mlx-community)
Llama-3-8B 4-bit版,M系列芯片本地推理
- Apple Silicon
- 部署
-
- pip pip install mlx-lm && mlx_lm.generate --model mlx-community/Meta-Llama-3-8B-Instruct-4bit --prompt "hello"
- py from mlx_lm import load, generate; model, tokenizer = load('mlx-community/Meta-Llama-3-8B-Instruct-4bit'); generate(model, tokenizer, prompt='hello')