gpt4all

AI/gpt4all

mirror of https://github.com/nomic-ai/gpt4all.git synced 2024-10-01 01:06:10 -04:00

Author	SHA1	Message	Date
chrisbarrera	f8b1069a1c	add min_p sampling parameter (#2014 ) Signed-off-by: Christopher Barrera <cb@arda.tx.rr.com> Co-authored-by: Jared Van Bortel <cebtenzzre@gmail.com>	2024-02-24 17:51:34 -05:00
Adam Treat	67bbce43ab	Fix state issues with reloading model. Signed-off-by: Adam Treat <treat.adam@gmail.com>	2024-02-21 16:05:49 -05:00
Jared Van Bortel	4fc4d94be4	fix chat-style prompt templates (#1970 ) Also use a new version of Mistral OpenOrca. Signed-off-by: Jared Van Bortel <jared@nomic.ai>	2024-02-21 15:45:32 -05:00
Adam Treat	fa0a2129dc	Don't try and detect model load error on startup. Signed-off-by: Adam Treat <treat.adam@gmail.com>	2024-02-21 10:15:20 -06:00
Adam Treat	67099f80ba	Add comment to make this clear. Signed-off-by: Adam Treat <treat.adam@gmail.com>	2024-02-21 10:15:20 -06:00
Adam Treat	d948a4f2ee	Complete revamp of model loading to allow for more discreet control by the user of the models loading behavior. Signed-off-by: Adam Treat <treat.adam@gmail.com>	2024-02-21 10:15:20 -06:00
Adam Treat	4461af35c7	Fix includes. Signed-off-by: Adam Treat <treat.adam@gmail.com>	2024-02-05 16:46:16 -05:00
Jared Van Bortel	10e3f7bbf5	Fix VRAM leak when model loading fails (#1901 ) Signed-off-by: Jared Van Bortel <jared@nomic.ai>	2024-02-01 15:45:45 -05:00
Adam Treat	d14b95f4bd	Add Nomic Embed model for atlas with localdocs.	2024-01-31 22:22:08 -05:00
Jared Van Bortel	061d1969f8	expose n_gpu_layers parameter of llama.cpp (#1890 ) Also dynamically limit the GPU layers and context length fields to the maximum supported by the model. Signed-off-by: Jared Van Bortel <jared@nomic.ai>	2024-01-31 14:17:44 -05:00
Jared Van Bortel	c7ea283f1f	chatllm: fix deserialization version mismatch (#1859 ) Signed-off-by: Jared Van Bortel <jared@nomic.ai>	2024-01-22 10:01:31 -05:00
Jared Van Bortel	d1c56b8b28	Implement configurable context length (#1749 )	2023-12-16 17:58:15 -05:00
Jared Van Bortel	0600f551b3	chatllm: do not attempt to serialize incompatible state (#1742 )	2023-12-12 11:45:03 -05:00
Adam Treat	fb3b1ceba2	Do not attempt to do a blocking retrieval if we don't have any collections.	2023-12-04 12:58:40 -05:00
Moritz Tim W	012f399639	fix typo (#1697 )	2023-11-30 12:37:52 -05:00
Adam Treat	9e27a118ed	Fix system prompt.	2023-11-21 10:42:12 -05:00
Adam Treat	5c0d077f74	Remove leading whitespace in responses.	2023-10-28 16:53:42 -04:00
Adam Treat	dc2e7d6e9b	Don't start recalculating context immediately upon switching to a new chat but rather wait until the first prompt. This allows users to switch between chats fast and to delete chats more easily. Fixes issue #1545	2023-10-28 16:41:23 -04:00
cebtenzzre	4338e72a51	MPT: use upstream llama.cpp implementation (#1515 )	2023-10-19 15:25:17 -04:00
cebtenzzre	04499d1c7d	chatllm: do not write uninitialized data to stream (#1486 )	2023-10-11 11:31:34 -04:00
Adam Treat	f0742c22f4	Restore state from text if necessary.	2023-10-11 09:16:02 -04:00
Adam Treat	b2cd3bdb3f	Fix crasher with an empty string for prompt template.	2023-10-06 12:44:53 -04:00
Cebtenzzre	5fe685427a	chat: clearer CPU fallback messages	2023-10-06 11:35:14 -04:00
Cebtenzzre	1534df3e9f	backend: do not use Vulkan with non-LLaMA models	2023-10-05 18:16:19 -04:00
Cebtenzzre	672cb850f9	differentiate between init failure and unsupported models	2023-10-05 18:16:19 -04:00
Cebtenzzre	a5b93cf095	more accurate fallback descriptions	2023-10-05 18:16:19 -04:00
Cebtenzzre	75deee9adb	chat: make sure to clear fallback reason on success	2023-10-05 18:16:19 -04:00
Cebtenzzre	2eb83b9f2a	chat: report reason for fallback to CPU	2023-10-05 18:16:19 -04:00
Adam Treat	12f943e966	Fix regenerate button to be deterministic and bump the llama version to latest we have for gguf.	2023-10-05 18:16:19 -04:00
Cebtenzzre	a49a1dcdf4	chatllm: grammar fix	2023-10-05 18:16:19 -04:00
Cebtenzzre	8f3abb37ca	fix references to removed model types	2023-10-05 18:16:19 -04:00
Adam Treat	d90d003a1d	Latest rebase on llama.cpp with gguf support.	2023-10-05 18:16:19 -04:00
Adam Treat	045f6e6cdc	Link against ggml in bin so we can get the available devices without loading a model.	2023-09-15 14:45:25 -04:00
Adam Treat	aa33419c6e	Fallback to CPU more robustly.	2023-09-14 16:53:11 -04:00
Adam Treat	3076e0bf26	Only show GPU when we're actually using it.	2023-09-14 09:59:19 -04:00
Adam Treat	1fa67a585c	Report the actual device we're using.	2023-09-14 08:25:37 -04:00
Adam Treat	21a3244645	Fix a bug where we're not properly falling back to CPU.	2023-09-13 19:30:27 -04:00
Aaron Miller	6f038c136b	init at most one vulkan device, submodule update fixes issues w/ multiple of the same gpu	2023-09-13 12:49:53 -07:00
Adam Treat	891ddafc33	When device is Auto (the default) then we will only consider discrete GPU's otherwise fallback to CPU.	2023-09-13 11:59:36 -04:00
Adam Treat	8f99dca70f	Bring the vulkan backend to the GUI.	2023-09-13 11:26:10 -04:00
Adam Treat	987546c63b	Nomic vulkan backend licensed under the Software for Open Models License (SOM), version 1.0.	2023-08-31 15:29:54 -04:00
Adam Treat	6d03b3e500	Add starcoder support.	2023-07-27 09:15:16 -04:00
Adam Treat	0efdbfcffe	Bert	2023-07-13 14:21:46 -04:00
Adam Treat	315a1f2aa2	Move it back as internal class.	2023-07-13 14:21:46 -04:00
Adam Treat	1f749d7633	Clean up backend code a bit and hide impl. details.	2023-07-13 14:21:46 -04:00
Adam Treat	8eb0844277	Check if the trimmed version is empty.	2023-07-12 14:31:43 -04:00
Adam Treat	be395c12cc	Make all system prompts empty by default if model does not include in training data.	2023-07-12 14:31:43 -04:00
Adam Treat	34a3b9c857	Don't block on exit when not connected.	2023-07-11 12:37:21 -04:00
Adam Treat	88bbe30952	Provide a guardrail for OOM errors.	2023-07-11 12:09:33 -04:00
Adam Treat	99cd555743	Provide some guardrails for thread count.	2023-07-10 17:29:51 -04:00

1 2

91 Commits