Commit Graph

1048 Commits

Author SHA1 Message Date
aaron miller
e6fd0a240d backend: fix buffer overrun in repeat penalty code
Caught with AddressSanitizer running a basic prompt test against llmodel
standalone. This fix allows ASan builds to complete a simple prompt
without illegal accesses but there are still notably several leaks.
2023-05-17 07:54:10 -04:00
kuvaus
26cb31c4e6 Bugfix on llmodel_model_create function
Fixes the bug where llmodel_model_create prints "Invalid model file" even though the model is loaded correctly. Credits and thanks to @serendipity for the fix.
2023-05-17 07:49:32 -04:00
drbh
d4861030b7
adds a simple cli chat repl (#566)
* adds a simple cli chat repl

* add n thread support and append assistant response
2023-05-16 16:47:54 -04:00
Andriy Mulyar
95a4516844
Update README.md with Docs (#607)
Signed-off-by: Andriy Mulyar <andriy.mulyar@gmail.com>
2023-05-16 16:15:00 -04:00
Richard Guo
e659ef5b2a
Improvements to documentation (#606) 2023-05-16 15:29:27 -04:00
Andriy Mulyar
bc481f2ab7
Chat doc typo (#605)
* Added modal labs example to documentation

* Added gpt4all chat

* Typo

* Andriy can't spell
2023-05-16 14:33:34 -04:00
Andriy Mulyar
5528e37660
Chat doc fixes (#604)
* Added modal labs example to documentation

* Added gpt4all chat

* Typo

---------

Signed-off-by: Andriy Mulyar <andriy.mulyar@gmail.com>
2023-05-16 14:23:37 -04:00
Andriy Mulyar
96cedc2558
Added better documentation to web server example in docs (#603)
* Added modal labs example to documentation

* Added gpt4all chat
2023-05-16 14:17:35 -04:00
Andriy Mulyar
3b407a3bd1
Update gpt4all_chat.md (#601)
Signed-off-by: Andriy Mulyar <andriy.mulyar@gmail.com>
2023-05-16 13:15:00 -04:00
Andriy Mulyar
9b70bb0714
Typo (#600)
Signed-off-by: Andriy Mulyar <andriy.mulyar@gmail.com>
2023-05-16 13:07:27 -04:00
Adam Treat
a86a1aea6b Fix for windows/mac installers. 2023-05-16 13:06:42 -04:00
Adam Treat
7221169eff Remove davinci-003 option. 2023-05-16 13:06:42 -04:00
Adam Treat
86b6d40d08 Bump the version. 2023-05-16 13:06:42 -04:00
Andriy Mulyar
8e3728f60e
Update gpt4all_chat.md (#598)
Signed-off-by: Andriy Mulyar <andriy.mulyar@gmail.com>
2023-05-16 13:02:19 -04:00
Andriy Mulyar
e424f9bd5a
Update gpt4all_chat.md (#597)
Signed-off-by: Andriy Mulyar <andriy.mulyar@gmail.com>
2023-05-16 12:58:05 -04:00
Andriy Mulyar
17de7f0529
Chat Client Documentation (#596)
* GPT4All Chat Client Documentation

* Updated documentation wording
2023-05-16 12:46:31 -04:00
kuvaus
3cb6dd7a66
gpt4all-backend: Add llmodel create and destroy functions (#554)
* Add llmodel create and destroy functions

* Fix capitalization

* Fix capitalization

* Fix capitalization

* Update CMakeLists.txt

---------

Co-authored-by: kuvaus <kuvaus@users.noreply.github.com>
2023-05-16 11:36:46 -04:00
kuvaus
507e913faf
gpt4all-backend: Add MSVC support to backend (#595)
* Add MSVC compatibility

* Add _MSC_VER macro

---------

Co-authored-by: kuvaus <kuvaus@users.noreply.github.com>
2023-05-16 11:35:33 -04:00
Yaroslav Halchenko
c942780f5a [DATALAD RUNCMD] run codespell throughout
=== Do not change lines below ===
{
 "chain": [],
 "cmd": "codespell -w",
 "exit": 0,
 "extra_inputs": [],
 "inputs": [],
 "outputs": [],
 "pwd": "."
}
^^^ Do not change lines above ^^^
2023-05-16 11:33:59 -04:00
Yaroslav Halchenko
e4bc9c0c3b fix ambigous typo 2023-05-16 11:33:59 -04:00
Yaroslav Halchenko
2195bfabba Add rudimentary codespell config 2023-05-16 11:33:59 -04:00
Yaroslav Halchenko
0f16aebbca Add github action to codespell main on push and PRs 2023-05-16 11:33:59 -04:00
Adam Treat
79d6243fe1 Use the default for max_tokens to avoid errors. 2023-05-16 10:31:55 -04:00
Adam Treat
b24ace372b Add ability to remove models. 2023-05-16 10:31:55 -04:00
Adam Treat
47059ad9f1 Send the prompt template with conversations to the datalake. 2023-05-16 10:31:55 -04:00
Adam Treat
748e7977ca Generate the new prompt/response pair before model loading in server mode. 2023-05-16 10:31:55 -04:00
Adam Treat
f931de21c5 Add save/restore to chatgpt chats and allow serialize/deseralize from disk. 2023-05-16 10:31:55 -04:00
Adam Treat
0cd509d530 Add large network icon background for chatgpt and server modes. 2023-05-16 10:31:55 -04:00
Adam Treat
dd27c10f54 Preliminary support for chatgpt models. 2023-05-16 10:31:55 -04:00
Adam Treat
da3828af89 Use compact json format. 2023-05-16 10:31:55 -04:00
Aaron Miller
d14936bfd6 backend: dedupe tokenizing code in mpt/gptj 2023-05-16 10:30:19 -04:00
Aaron Miller
6182026c70 backend: dedupe tokenizing code in gptj/mpt 2023-05-16 10:30:19 -04:00
Aaron Miller
4cd8bdf9a1 backend: make initial buf_size const in model impls
more unifying mpt and gptj code - this one's never written so also
changing the name to be clearer
2023-05-16 10:30:19 -04:00
Aaron Miller
08402a1b64 mpt: use buf in model struct (thread safety) 2023-05-16 10:30:19 -04:00
AT
3e0ef4c2c5
Update README.md
Signed-off-by: AT <manyoso@users.noreply.github.com>
2023-05-15 19:00:11 -04:00
Andriy Mulyar
a07237e54f
Golang Bindings in README (#586)
Signed-off-by: Andriy Mulyar <andriy.mulyar@gmail.com>
2023-05-15 12:59:32 -04:00
Ettore Di Giacinto
3f63cc6b47
Golang bindings initial working version(#534)
* WIP

* Fix includes

* Try to fix linking issues

* Refinements

* allow to load MPT and llama models too

* cleanup, add example, add README
2023-05-15 12:45:56 -04:00
Richard Guo
2433902460
label tag warning and moved PR template out of subdir (#584)
Co-authored-by: Richard Guo <richardguo@Richards-MacBook-Pro.local>
2023-05-15 10:15:17 -04:00
Andriy Mulyar
fbe89214b6
Update path for build and run demo (#578)
Signed-off-by: Andriy Mulyar <andriy.mulyar@gmail.com>
2023-05-14 17:26:28 -04:00
AT
171eee48e2
Update README.md
Signed-off-by: AT <manyoso@users.noreply.github.com>
2023-05-14 15:48:17 -04:00
AT
8f3c88440f
Create README.md
Signed-off-by: AT <manyoso@users.noreply.github.com>
2023-05-14 15:43:18 -04:00
AT
4920816c90
Update README.md
Signed-off-by: AT <manyoso@users.noreply.github.com>
2023-05-14 15:26:00 -04:00
Andriy Mulyar
eb84856344
Rename README to README.md (#575)
Signed-off-by: Andriy Mulyar <andriy.mulyar@gmail.com>
2023-05-14 14:12:38 -04:00
Andriy Mulyar
782e731e66
Typescript Bindings README (#574)
Signed-off-by: Andriy Mulyar <andriy.mulyar@gmail.com>
2023-05-14 13:59:03 -04:00
Andriy Mulyar
85d5d138de
Updated Bindings Links (#573)
Signed-off-by: Andriy Mulyar <andriy.mulyar@gmail.com>
2023-05-14 13:57:00 -04:00
AT
23f433dd18
Update README.md
Signed-off-by: AT <manyoso@users.noreply.github.com>
2023-05-14 07:59:50 -04:00
AT
31452769a4
Update README.md
Signed-off-by: AT <manyoso@users.noreply.github.com>
2023-05-14 07:41:45 -04:00
Adam Treat
b71c0ac3bd The server has different lifetime mgmt than the other chats. 2023-05-13 19:34:54 -04:00
Adam Treat
ddc24acf33 Much better memory mgmt for multi-threaded model loading/unloading. 2023-05-13 19:10:56 -04:00
Adam Treat
2989b74d43 httpserver 2023-05-13 19:07:06 -04:00