.. |
grammar
|
Better HF grammar implementation (#4953)
|
2023-12-17 02:01:23 -03:00 |
AutoGPTQ_loader.py
|
AutoGPTQ: Add --disable_exllamav2 flag (Mixtral CPU offloading needs this)
|
2023-12-15 06:46:13 -08:00 |
block_requests.py
|
Revert gradio to 3.50.2 (#5640)
|
2024-03-06 11:52:46 -03:00 |
cache_utils.py
|
Minor logging change
|
2024-03-11 16:31:13 -07:00 |
callbacks.py
|
Dynamic Temperature HF loader support (#5174)
|
2024-01-07 10:36:26 -03:00 |
chat.py
|
Right-truncate long chat completion prompts instead of left-truncating
|
2024-03-07 08:50:24 -08:00 |
ctransformers_model.py
|
Fix ctransformers model unload (#3711)
|
2023-08-27 10:53:48 -03:00 |
deepspeed_parameters.py
|
Fix typo in deepspeed_parameters.py (#3222)
|
2023-07-24 11:17:28 -03:00 |
evaluate.py
|
Perplexity evaluation: print to terminal after calculation is finished
|
2024-02-28 19:58:21 -08:00 |
exllamav2_hf.py
|
Update cache_4bit documentation (#5649)
|
2024-03-07 13:08:21 -03:00 |
exllamav2.py
|
Add cache_4bit option for ExLlamaV2 (#5645)
|
2024-03-06 23:02:25 -03:00 |
extensions.py
|
Move update_wizard_windows.sh to update_wizard_windows.bat (oops)
|
2024-03-04 19:26:24 -08:00 |
github.py
|
Lint
|
2023-09-25 20:31:11 -07:00 |
GPTQ_loader.py
|
Improve several log messages
|
2023-12-19 20:54:32 -08:00 |
html_generator.py
|
Optimize the UI by caching convert_to_markdown calls
|
2024-03-10 20:10:07 -07:00 |
llama_cpp_python_hijack.py
|
Add StreamingLLM for llamacpp & llamacpp_HF (2nd attempt) (#5669)
|
2024-03-09 00:25:33 -03:00 |
llamacpp_hf.py
|
Fix CFG with llamacpp_HF (2nd attempt)
|
2024-02-19 18:35:42 -08:00 |
llamacpp_model.py
|
llama.cpp: add a progress bar for prompt evaluation
|
2024-02-07 21:56:10 -08:00 |
loaders.py
|
Add StreamingLLM for llamacpp & llamacpp_HF (2nd attempt) (#5669)
|
2024-03-09 00:25:33 -03:00 |
logging_colors.py
|
Lint
|
2023-12-19 21:36:57 -08:00 |
logits.py
|
Revert "Remove non-HF ExLlamaV2 loader (#5431)"
|
2024-02-06 06:21:36 -08:00 |
LoRA.py
|
Revert "Remove non-HF ExLlamaV2 loader (#5431)"
|
2024-02-06 06:21:36 -08:00 |
metadata_gguf.py
|
llama.cpp: read instruction template from GGUF metadata (#4975)
|
2023-12-18 01:51:58 -03:00 |
models_settings.py
|
Autodetect llamacpp_HF loader when tokenizer exists
|
2024-02-16 09:29:26 -08:00 |
models.py
|
Autodetect llamacpp_HF loader when tokenizer exists
|
2024-02-16 09:29:26 -08:00 |
monkey_patch_gptq_lora.py
|
fix lora training with alpaca_lora_4bit (#3853)
|
2023-09-11 01:22:20 -03:00 |
one_click_installer_check.py
|
Lint
|
2023-11-16 18:03:06 -08:00 |
presets.py
|
Cubic sampling w/ curve param (#5551)
|
2024-03-03 13:22:21 -03:00 |
prompts.py
|
Fix "send instruction template to..." buttons (closes #4625)
|
2023-11-16 18:16:42 -08:00 |
relative_imports.py
|
Add ExLlama+LoRA support (#2756)
|
2023-06-19 12:31:24 -03:00 |
RoPE.py
|
Lint
|
2024-01-09 16:27:50 -08:00 |
sampler_hijack.py
|
Cubic sampling w/ curve param (#5551)
|
2024-03-03 13:22:21 -03:00 |
shared.py
|
Document StreamingLLM
|
2024-03-10 19:15:23 -07:00 |
text_generation.py
|
Optimize StreamingLLM by over 10x
|
2024-03-08 21:48:28 -08:00 |
training.py
|
Perplexity evaluation: make UI events more robust (attempt)
|
2024-02-22 07:13:22 -08:00 |
ui_chat.py
|
Move "Command for chat-instruct mode" to the main chat tab (closes #5634)
|
2024-03-08 12:52:52 -08:00 |
ui_default.py
|
Revert gradio to 3.50.2 (#5640)
|
2024-03-06 11:52:46 -03:00 |
ui_file_saving.py
|
Improve the file saving/deletion menus
|
2024-01-09 06:33:47 -08:00 |
ui_model_menu.py
|
Document StreamingLLM
|
2024-03-10 19:15:23 -07:00 |
ui_notebook.py
|
Revert gradio to 3.50.2 (#5640)
|
2024-03-06 11:52:46 -03:00 |
ui_parameters.py
|
Increase height of "Custom stopping strings" UI field
|
2024-03-08 12:54:30 -08:00 |
ui_session.py
|
Revert gradio to 3.50.2 (#5640)
|
2024-03-06 11:52:46 -03:00 |
ui.py
|
Add StreamingLLM for llamacpp & llamacpp_HF (2nd attempt) (#5669)
|
2024-03-09 00:25:33 -03:00 |
utils.py
|
Add a menu for customizing the instruction template for the model (#5521)
|
2024-02-16 14:21:17 -03:00 |