text-generation-webui

mirror of https://github.com/oobabooga/text-generation-webui.git synced 2024-09-19 15:56:56 +00:00

History

oobabooga bae14c8f13 Right-truncate long chat completion prompts instead of left-truncating Instructions are usually at the beginning of the prompt.		2024-03-07 08:50:24 -08:00
..
grammar	Better HF grammar implementation (#4953 )	2023-12-17 02:01:23 -03:00
AutoGPTQ_loader.py	AutoGPTQ: Add --disable_exllamav2 flag (Mixtral CPU offloading needs this)	2023-12-15 06:46:13 -08:00
block_requests.py	Revert gradio to 3.50.2 (#5640 )	2024-03-06 11:52:46 -03:00
callbacks.py	Dynamic Temperature HF loader support (#5174 )	2024-01-07 10:36:26 -03:00
chat.py	Right-truncate long chat completion prompts instead of left-truncating	2024-03-07 08:50:24 -08:00
ctransformers_model.py	Fix ctransformers model unload (#3711 )	2023-08-27 10:53:48 -03:00
deepspeed_parameters.py	Fix typo in deepspeed_parameters.py (#3222 )	2023-07-24 11:17:28 -03:00
evaluate.py	Perplexity evaluation: print to terminal after calculation is finished	2024-02-28 19:58:21 -08:00
exllamav2_hf.py	Update cache_4bit documentation (#5649 )	2024-03-07 13:08:21 -03:00
exllamav2.py	Add cache_4bit option for ExLlamaV2 (#5645 )	2024-03-06 23:02:25 -03:00
extensions.py	Move update_wizard_windows.sh to update_wizard_windows.bat (oops)	2024-03-04 19:26:24 -08:00
github.py	Lint	2023-09-25 20:31:11 -07:00
GPTQ_loader.py	Improve several log messages	2023-12-19 20:54:32 -08:00
html_generator.py	Lint	2024-01-22 03:25:55 -08:00
llama_cpp_python_hijack.py	Lint	2024-02-13 16:05:41 -08:00
llamacpp_hf.py	Fix CFG with llamacpp_HF (2nd attempt)	2024-02-19 18:35:42 -08:00
llamacpp_model.py	llama.cpp: add a progress bar for prompt evaluation	2024-02-07 21:56:10 -08:00
loaders.py	Add cache_4bit option for ExLlamaV2 (#5645 )	2024-03-06 23:02:25 -03:00
logging_colors.py	Lint	2023-12-19 21:36:57 -08:00
logits.py	Revert "Remove non-HF ExLlamaV2 loader (#5431 )"	2024-02-06 06:21:36 -08:00
LoRA.py	Revert "Remove non-HF ExLlamaV2 loader (#5431 )"	2024-02-06 06:21:36 -08:00
metadata_gguf.py	llama.cpp: read instruction template from GGUF metadata (#4975 )	2023-12-18 01:51:58 -03:00
models_settings.py	Autodetect llamacpp_HF loader when tokenizer exists	2024-02-16 09:29:26 -08:00
models.py	Autodetect llamacpp_HF loader when tokenizer exists	2024-02-16 09:29:26 -08:00
monkey_patch_gptq_lora.py	fix lora training with alpaca_lora_4bit (#3853 )	2023-09-11 01:22:20 -03:00
one_click_installer_check.py	Lint	2023-11-16 18:03:06 -08:00
presets.py	Cubic sampling w/ curve param (#5551 )	2024-03-03 13:22:21 -03:00
prompts.py	Fix "send instruction template to..." buttons (closes #4625 )	2023-11-16 18:16:42 -08:00
relative_imports.py	Add ExLlama+LoRA support (#2756 )	2023-06-19 12:31:24 -03:00
RoPE.py	Lint	2024-01-09 16:27:50 -08:00
sampler_hijack.py	Cubic sampling w/ curve param (#5551 )	2024-03-03 13:22:21 -03:00
shared.py	Update cache_4bit documentation (#5649 )	2024-03-07 13:08:21 -03:00
text_generation.py	Revert gradio to 3.50.2 (#5640 )	2024-03-06 11:52:46 -03:00
training.py	Perplexity evaluation: make UI events more robust (attempt)	2024-02-22 07:13:22 -08:00
ui_chat.py	Revert gradio to 3.50.2 (#5640 )	2024-03-06 11:52:46 -03:00
ui_default.py	Revert gradio to 3.50.2 (#5640 )	2024-03-06 11:52:46 -03:00
ui_file_saving.py	Improve the file saving/deletion menus	2024-01-09 06:33:47 -08:00
ui_model_menu.py	Update cache_4bit documentation (#5649 )	2024-03-07 13:08:21 -03:00
ui_notebook.py	Revert gradio to 3.50.2 (#5640 )	2024-03-06 11:52:46 -03:00
ui_parameters.py	Revert gradio to 3.50.2 (#5640 )	2024-03-06 11:52:46 -03:00
ui_session.py	Revert gradio to 3.50.2 (#5640 )	2024-03-06 11:52:46 -03:00
ui.py	Add cache_4bit option for ExLlamaV2 (#5645 )	2024-03-06 23:02:25 -03:00
utils.py	Add a menu for customizing the instruction template for the model (#5521 )	2024-02-16 14:21:17 -03:00