text-generation-webui/modules
Water 674be9a09a
Add HQQ quant loader (#4888)
---------

Co-authored-by: oobabooga <112222186+oobabooga@users.noreply.github.com>
2023-12-18 21:23:16 -03:00
..
grammar Better HF grammar implementation (#4953) 2023-12-17 02:01:23 -03:00
AutoGPTQ_loader.py AutoGPTQ: Add --disable_exllamav2 flag (Mixtral CPU offloading needs this) 2023-12-15 06:46:13 -08:00
block_requests.py Bump to latest gradio (3.47) (#4258) 2023-10-10 22:20:49 -03:00
callbacks.py Revert "Update callbacks.py to show tracebacks on ValueError (#4892)" 2023-12-12 11:47:11 -08:00
chat.py Instruction templates: better handle unwanted bos tokens 2023-12-17 21:04:30 -08:00
ctransformers_model.py Fix ctransformers model unload (#3711) 2023-08-27 10:53:48 -03:00
deepspeed_parameters.py Fix typo in deepspeed_parameters.py (#3222) 2023-07-24 11:17:28 -03:00
evaluate.py Cleanup: set shared.model_name only once 2023-12-08 06:35:23 -08:00
exllama_hf.py Fix off-by-one error in exllama_hf caching logic (#4145) 2023-10-05 12:20:56 -03:00
exllama.py Fix partial unicode characters issue (#4837) 2023-12-08 09:50:53 -03:00
exllamav2_hf.py Add --num_experts_per_token parameter (ExLlamav2) (#4955) 2023-12-17 12:08:33 -03:00
exllamav2.py Add --num_experts_per_token parameter (ExLlamav2) (#4955) 2023-12-17 12:08:33 -03:00
extensions.py Fix unexpected extensions load after gradio restart (#3965) 2023-09-17 17:35:43 -03:00
github.py Lint 2023-09-25 20:31:11 -07:00
GPTQ_loader.py make torch.load a bit safer (#4448) 2023-11-02 14:07:08 -03:00
html_generator.py Gallery improvements (#4789) 2023-12-03 22:45:50 -03:00
llama_attn_hijack.py Lint 2023-10-23 13:09:03 -07:00
llamacpp_hf.py Bump llama-cpp-python to 0.2.18 (2nd attempt) (#4637) 2023-11-18 00:31:27 -03:00
llamacpp_model.py Fix a bug in llama.cpp get_logits() function 2023-11-30 11:21:40 -08:00
loaders.py Add HQQ quant loader (#4888) 2023-12-18 21:23:16 -03:00
logging_colors.py Add menus for saving presets/characters/instruction templates/prompts (#2621) 2023-06-11 12:19:18 -03:00
logits.py [OpenAI Extension] Add 'max_logits' parameter in logits endpoint (#4916) 2023-12-15 00:22:43 -03:00
LoRA.py Minor LoRA bug fix 2023-11-19 07:59:29 -08:00
metadata_gguf.py llama.cpp: read instruction template from GGUF metadata (#4975) 2023-12-18 01:51:58 -03:00
models_settings.py Add HQQ quant loader (#4888) 2023-12-18 21:23:16 -03:00
models.py Add HQQ quant loader (#4888) 2023-12-18 21:23:16 -03:00
monkey_patch_gptq_lora.py fix lora training with alpaca_lora_4bit (#3853) 2023-09-11 01:22:20 -03:00
one_click_installer_check.py Lint 2023-11-16 18:03:06 -08:00
presets.py Parameters: change max_new_tokens & repetition_penalty_range defaults (#4842) 2023-12-07 20:04:52 -03:00
prompts.py Fix "send instruction template to..." buttons (closes #4625) 2023-11-16 18:16:42 -08:00
relative_imports.py Add ExLlama+LoRA support (#2756) 2023-06-19 12:31:24 -03:00
RoPE.py Add missing file 2023-08-25 07:10:26 -07:00
RWKV.py Intel Gpu support initialization (#4340) 2023-10-26 23:39:51 -03:00
sampler_hijack.py Add temperature_last parameter (#4472) 2023-11-04 13:09:07 -03:00
shared.py Add HQQ quant loader (#4888) 2023-12-18 21:23:16 -03:00
text_generation.py Better HF grammar implementation (#4953) 2023-12-17 02:01:23 -03:00
training.py UI: update context upper limit to 200000 2023-12-04 15:48:34 -08:00
ui_chat.py UI: update "Saved instruction templates" dropdown after loading template 2023-12-17 21:19:06 -08:00
ui_default.py Improve --multi-user mode 2023-09-26 06:42:33 -07:00
ui_file_saving.py Refresh the Preset menu after saving a preset 2023-11-18 14:03:42 -08:00
ui_model_menu.py Add HQQ quant loader (#4888) 2023-12-18 21:23:16 -03:00
ui_notebook.py Improve --multi-user mode 2023-09-26 06:42:33 -07:00
ui_parameters.py New feature: "random preset" button (#4647) 2023-11-18 18:31:41 -03:00
ui_session.py Jinja templates for Instruct and Chat (#4874) 2023-12-12 17:23:14 -03:00
ui.py Add HQQ quant loader (#4888) 2023-12-18 21:23:16 -03:00
utils.py UI: update "Saved instruction templates" dropdown after loading template 2023-12-17 21:19:06 -08:00