2023-10-21 18:15:54 -04:00
## What Works
| Loader | Loading 1 LoRA | Loading 2 or more LoRAs | Training LoRAs | Multimodal extension | Perplexity evaluation |
|----------------|----------------|-------------------------|----------------|----------------------|-----------------------|
2024-05-21 12:32:02 -04:00
| Transformers | ✅ | ✅\*\* | ✅\* | ✅ | ✅ |
2024-02-04 21:48:04 -05:00
| llama.cpp | ❌ | ❌ | ❌ | ❌ | use llamacpp_HF |
| llamacpp_HF | ❌ | ❌ | ❌ | ❌ | ✅ |
2023-10-21 18:15:54 -04:00
| ExLlamav2_HF | ✅ | ✅ | ❌ | ❌ | ✅ |
2024-02-06 09:26:27 -05:00
| ExLlamav2 | ✅ | ✅ | ❌ | ❌ | use ExLlamav2_HF |
2023-10-21 18:15:54 -04:00
| AutoGPTQ | ✅ | ❌ | ❌ | ✅ | ✅ |
2024-02-04 21:48:04 -05:00
| AutoAWQ | ? | ❌ | ? | ? | ✅ |
| HQQ | ? | ? | ? | ? | ✅ |
2023-10-21 18:15:54 -04:00
❌ = not implemented
✅ = implemented
\* Training LoRAs with GPTQ models also works with the Transformers loader. Make sure to check "auto-devices" and "disable_exllama" before loading the model.
2024-05-21 12:32:02 -04:00
\*\* Multi-LoRA in PEFT is tricky and the current implementation does not work reliably in all cases.