text-generation-webui

mirror of https://github.com/oobabooga/text-generation-webui.git synced 2024-10-01 01:26:03 -04:00

Author	SHA1	Message	Date
matatonic	68ae5d8262	more models: +orca_mini (#2859 )	2023-06-25 01:54:53 -03:00
oobabooga	f0fcd1f697	Sort some imports	2023-06-25 01:44:36 -03:00
oobabooga	365b672531	Minor change to prevent future bugs	2023-06-25 01:38:54 -03:00
oobabooga	e6e5f546b8	Reorganize Chat settings tab	2023-06-25 01:10:20 -03:00
matatonic	b45baeea41	extensions/openai: Major docs update, fix #2852 (critical bug), minor improvements (#2849 )	2023-06-24 22:50:04 -03:00
oobabooga	ebfcfa41f2	Update ExLlama.md	2023-06-24 20:25:34 -03:00
jllllll	bef67af23c	Use pre-compiled python module for ExLlama (#2770 )	2023-06-24 20:24:17 -03:00
oobabooga	a70a2ac3be	Update ExLlama.md	2023-06-24 20:23:01 -03:00
oobabooga	b071eb0d4b	Clean up the presets (#2854 )	2023-06-24 18:41:17 -03:00
oobabooga	cec5fb0ef6	Failed attempt at evaluating exllama_hf perplexity	2023-06-24 12:02:25 -03:00
快乐的我531	e356f69b36	Make stop_everything work with non-streamed generation (#2848 )	2023-06-24 11:19:16 -03:00
oobabooga	ec482f3dae	Apply input extensions after yielding Is typing...	2023-06-24 11:07:11 -03:00
oobabooga	3e80f2aceb	Apply the output extensions only once Relevant for google translate, silero	2023-06-24 10:59:07 -03:00
rizerphe	77baf43f6d	Add CORS support to the API (#2718 )	2023-06-24 10:16:06 -03:00
matatonic	8c36c19218	8k size only for minotaur-15B (#2815 ) Co-authored-by: Matthew Ashton <mashton-gitlab@zhero.org>	2023-06-24 10:14:19 -03:00
Roman	38897fbd8a	fix: added model parameter check (#2829 )	2023-06-24 10:09:34 -03:00
missionfloyd	51a388fa34	Organize chat history/character import menu (#2845 ) * Organize character import menu * Move Chat history upload/download labels	2023-06-24 09:55:02 -03:00
oobabooga	8bb3bb39b3	Implement stopping string search in string space (#2847 )	2023-06-24 09:43:00 -03:00
oobabooga	0f9088f730	Update README	2023-06-23 12:24:43 -03:00
oobabooga	3ae9af01aa	Add --no_use_cuda_fp16 param for AutoGPTQ	2023-06-23 12:22:56 -03:00
Panchovix	5646690769	Fix some models not loading on exllama_hf (#2835 )	2023-06-23 11:31:02 -03:00
oobabooga	383c50f05b	Replace old presets with the results of Preset Arena (#2830 )	2023-06-23 01:48:29 -03:00
missionfloyd	aa1f1ef46a	Fix printing, take two. (#2810 ) * Format chat for printing * Better printing	2023-06-22 16:06:49 -03:00
Panchovix	b4a38c24b7	Fix Multi-GPU not working on exllama_hf (#2803 )	2023-06-22 16:05:25 -03:00
matatonic	d94ea31d54	more models. +minotaur 8k (#2806 )	2023-06-21 21:05:08 -03:00
LarryVRH	580c1ee748	Implement a demo HF wrapper for exllama to utilize existing HF transformers decoding. (#2777 )	2023-06-21 15:31:42 -03:00
jllllll	a06acd6d09	Update bitsandbytes to 0.39.1 (#2799 )	2023-06-21 15:04:45 -03:00
Gaurav Bhagchandani	89fb6f9236	Fixed the ZeroDivisionError when downloading a model (#2797 )	2023-06-21 12:31:50 -03:00
matatonic	90be1d9fe1	More models (match more) & templates (starchat-beta, tulu) (#2790 )	2023-06-21 12:30:44 -03:00
missionfloyd	2661c9899a	Format chat for printing (#2793 )	2023-06-21 10:39:58 -03:00
oobabooga	5dfe0bec06	Remove old/useless code	2023-06-20 23:36:56 -03:00
oobabooga	faa92eee8d	Add spaces	2023-06-20 23:25:58 -03:00
Peter Sofronas	b22c7199c9	Download optimizations (#2786 ) * download_model_files metadata writing improvement * line swap * reduce line length * safer download and greater block size * Minor changes by pycodestyle --------- Co-authored-by: oobabooga <112222186+oobabooga@users.noreply.github.com>	2023-06-20 23:14:18 -03:00
Morgan Schweers	447569e31a	Add a download progress bar to the web UI. (#2472 ) * Show download progress on the model screen. * In case of error, mark as done to clear progress bar. * Increase the iteration block size to reduce overhead.	2023-06-20 22:59:14 -03:00
ramblingcoder	0d0d849478	Update Dockerfile to resolve superbooga requirement error (#2401 )	2023-06-20 18:31:28 -03:00
EugeoSynthesisThirtyTwo	7625c6de89	fix usage of self in classmethod (#2781 )	2023-06-20 16:18:42 -03:00
MikoAL	c40932eb39	Added Falcon LoRA training support (#2684 ) I am 50% sure this will work	2023-06-20 01:03:44 -03:00
oobabooga	c623e142ac	Bump llama-cpp-python	2023-06-20 00:49:38 -03:00
FartyPants	ce86f726e9	Added saving of training logs to training_log.json (#2769 )	2023-06-20 00:47:36 -03:00
oobabooga	017884132f	Merge remote-tracking branch 'refs/remotes/origin/main'	2023-06-20 00:46:29 -03:00
oobabooga	e1cd6cc410	Minor style change	2023-06-20 00:46:18 -03:00
Cebtenzzre	59e7ecb198	llama.cpp: implement ban_eos_token via logits_processor (#2765 )	2023-06-19 21:31:19 -03:00
oobabooga	0d9d70ec7e	Update docs	2023-06-19 12:52:23 -03:00
oobabooga	f6a602861e	Update docs	2023-06-19 12:51:30 -03:00
oobabooga	5d4b4d15a5	Update Using-LoRAs.md	2023-06-19 12:43:57 -03:00
oobabooga	eb30f4441f	Add ExLlama+LoRA support (#2756 )	2023-06-19 12:31:24 -03:00
oobabooga	a1cac88c19	Update README.md	2023-06-19 01:28:23 -03:00
oobabooga	5f418f6171	Fix a memory leak (credits for the fix: Ph0rk0z)	2023-06-19 01:19:28 -03:00
ThisIsPIRI	def3b69002	Fix loading condition for universal llama tokenizer (#2753 )	2023-06-18 18:14:06 -03:00
oobabooga	490a1795f0	Bump peft commit	2023-06-18 16:42:11 -03:00

1 2 3 4 5 ...

2048 Commits