Commit Graph

853 Commits

Author SHA1 Message Date
AT
d9725c24be Revert "Improved implementation management code"
This reverts commit bc079a8fb1.
2023-05-30 13:12:51 -04:00
Adam Treat
57f1db9cfc Fix warnings in backend. 2023-05-30 12:37:18 -04:00
niansa
bc079a8fb1 Improved implementation management code 2023-05-30 08:09:51 -04:00
niansa
a77a4a7e20 Fixed clblast/cuda build in llama.cpp.cmake 2023-05-26 19:39:45 +02:00
Adam Treat
ff5be8865b Mark these as const 2023-05-26 13:27:15 -04:00
niansa
0bec24980f Initial build variant auto detection 2023-05-26 13:27:15 -04:00
niansa/tuxifan
0788fe0483 Reverted llama_sample_top_p_top_k back to being static
Signed-off-by: niansa/tuxifan <tuxifan@posteo.de>
2023-05-26 13:27:15 -04:00
niansa
4dc1df4719 Unified llamamodel{.cpp,_impl.h} for all llama.cpp versions 2023-05-26 13:27:15 -04:00
niansa/tuxifan
75c787de7f Support new 230519 llama.cpp 2023-05-26 13:27:15 -04:00
niansa
6fad98e28f Aligned llama implementation style to other implementations 2023-05-26 13:27:15 -04:00
niansa
5d723221b7 Fix BOS never getting added in llamamodel 2023-05-26 13:27:15 -04:00
niansa
7c3dd3cdad Properly check magic 2023-05-26 13:27:15 -04:00
niansa
e8a83be3e1 Re-add llamamodel.cpp based on upstream llama.cpp 2023-05-26 13:27:15 -04:00
niansa
c73b8ef9d4 Rename llama.cpp fork submodule to llama.cpp-old, add the upstream one as llama.cpp 2023-05-26 13:27:15 -04:00
niansa/tuxifan
af9ea34bd0 Fixed compilation due to dlhandle name change
Seems like I've overlooked an `#include` when renaming the file

Signed-off-by: niansa/tuxifan <tuxifan@posteo.de>
2023-05-26 13:27:15 -04:00
niansa
350033a23a Fixed llama capitalization 2023-05-26 13:27:15 -04:00
niansa
3607456cc3 Reverted line break changes 2023-05-26 13:27:15 -04:00
niansa
a342acfc73 Removed operator ""_MiB 2023-05-26 13:27:15 -04:00
niansa
5aac0fa04a Renamed dlhandle.hpp -> dlhandle.h; added file to CMake 2023-05-26 13:27:15 -04:00
niansa
c7f78fe7f9 Fixed oversight in last commit in llmodel_c.cpp 2023-05-26 13:27:15 -04:00
niansa
7878244170 Implemented basic dynamic model implementation loading 2023-05-26 13:27:15 -04:00
Richard Guo
73db20ba85 hotfix default verbose optioin 2023-05-26 12:49:32 -04:00
Konstantin Gukov
a6f3e94458 one funcion to append .bin suffix 2023-05-26 09:24:03 -04:00
Konstantin Gukov
659244f0a2 Correct indentation of the multiline error message 2023-05-26 09:24:03 -04:00
Konstantin Gukov
5e61008424 Add optional verbosity 2023-05-26 09:24:03 -04:00
Konstantin Gukov
e05ee9466a Correct return type 2023-05-26 09:24:03 -04:00
Konstantin Gukov
100c809f1e Do not ignore explicitly passed 4 threads 2023-05-26 09:24:03 -04:00
Konstantin Gukov
dcbdd369ad Redundant else 2023-05-26 09:24:03 -04:00
Konstantin Gukov
ace34afef2 1. Cleanup the interrupted download
2. with-syntax
2023-05-26 09:24:03 -04:00
Konstantin Gukov
8053dc014b less magic number 2023-05-26 09:24:03 -04:00
Konstantin Gukov
e98cfd97b3 convert to f-strings 2023-05-26 09:24:03 -04:00
Konstantin Gukov
2b6fb7b95e reduce nesting, better error reporting 2023-05-26 09:24:03 -04:00
Konstantin Gukov
a067f38544 Concise model matching 2023-05-26 09:24:03 -04:00
Konstantin Gukov
c1f3dd310c Log where the model was found 2023-05-26 09:24:03 -04:00
Konstantin Gukov
f96300534b Nicer handling of missing model directory.
Correct exception message.
2023-05-26 09:24:03 -04:00
Konstantin Gukov
59d7db9aad More precise condition 2023-05-26 09:24:03 -04:00
Konstantin Gukov
adc599b0a6 rm redundant json 2023-05-26 09:24:03 -04:00
Adam Treat
810a3b12cc This time remember to bump the version right after a release. 2023-05-25 18:26:33 -04:00
Adam Treat
d1ff7132c5 Bump the version number. 2023-05-25 17:08:50 -04:00
Adam Treat
afe3870b7a Libraries named differently on msvc. 2023-05-25 16:27:09 -04:00
Adam Treat
474c5387f9 Get the backend as well as the client building/working with msvc. 2023-05-25 15:22:45 -04:00
redthing1
63f57635d8 make sample print usage and cleaner 2023-05-25 11:34:21 -04:00
redthing1
dec8546abe create test project and basic model loading tests 2023-05-25 11:34:07 -04:00
redthing1
0cc86d19be ignore rider and vscode dirs 2023-05-25 11:34:07 -04:00
Adam Treat
265488e54a Add a newline 2023-05-25 11:28:06 -04:00
Adam Treat
98201540a2 Various fixes to remove unnecessary warnings. 2023-05-25 11:28:06 -04:00
Adam Treat
0403a122ca Don't use the full path in reference text. 2023-05-25 11:28:06 -04:00
Adam Treat
9b0629db8b Add context link to references. 2023-05-25 11:28:06 -04:00
Adam Treat
db9eecdce4 Store the references separately so they are not sent to datalake. 2023-05-25 11:28:06 -04:00
Adam Treat
b5380c9b7f Adds the collections to serialize and implement references for localdocs. 2023-05-25 11:28:06 -04:00