text-generation-webui/modules
Alex "mcmonkey" Goodwin a3eec62b50
Lora trainer improvements part 3 (#1098)
* add support for other model types

dependent on future-peft-changes but with fallback to function now

* use encoding=utf8 for training format

* make shuffling optional

and describe dropout a bit more

* add eval_steps to control evaluation

* make callbacks not depend on globals

* make save steps controllable

* placeholder of initial loading-existing-model support

and var name cleanup

* save/load parameters

* last bit of cleanup

* remove `gptq_bits` ref as main branch removed that setting

* add higher_rank_limit option

2048 is basically unreachable due to VRAM, but i trained at 1536 with batch size = 1 on a 7B model.
Note that it's in the do_train input just to save as a parameter

* fix math on save_steps
2023-04-16 02:35:13 -03:00
..
api.py Minor change to API code 2023-04-14 12:11:47 -03:00
callbacks.py Make the code more like PEP8 for readability (#862) 2023-04-07 00:15:45 -03:00
chat.py Merge pull request from GHSA-hv5m-3rp9-xcpf 2023-04-16 01:36:50 -03:00
deepspeed_parameters.py Fix deepspeed (oops) 2023-02-02 10:39:37 -03:00
extensions.py Merge pull request from GHSA-hv5m-3rp9-xcpf 2023-04-16 01:36:50 -03:00
GPTQ_loader.py Simplify GPTQ_loader.py 2023-04-13 12:13:07 -03:00
html_generator.py Don't treat Intruct mode histories as regular histories 2023-04-10 15:48:07 -03:00
llama_attn_hijack.py Added xformers support to Llama (#950) 2023-04-09 23:08:40 -03:00
llamacpp_model_alternative.py Bump llama-cpp-python to use LlamaCache 2023-04-16 00:53:40 -03:00
llamacpp_model.py Make the code more like PEP8 for readability (#862) 2023-04-07 00:15:45 -03:00
LoRA.py initial multi-lora support (#1103) 2023-04-14 14:52:06 -03:00
models.py Add "Save current settings for this model" button 2023-04-15 12:54:02 -03:00
RWKV.py Make the code more like PEP8 for readability (#862) 2023-04-07 00:15:45 -03:00
shared.py Merge pull request from GHSA-hv5m-3rp9-xcpf 2023-04-16 01:36:50 -03:00
text_generation.py revert incorrect eos_token_id change from #814 (#1261) 2023-04-16 01:47:01 -03:00
training.py Lora trainer improvements part 3 (#1098) 2023-04-16 02:35:13 -03:00
ui.py Automatically set wbits/groupsize/instruct based on model name (#1167) 2023-04-14 11:07:28 -03:00