AT
|
d9725c24be
|
Revert "Improved implementation management code"
This reverts commit bc079a8fb1 .
|
2023-05-30 13:12:51 -04:00 |
|
Adam Treat
|
57f1db9cfc
|
Fix warnings in backend.
|
2023-05-30 12:37:18 -04:00 |
|
niansa
|
bc079a8fb1
|
Improved implementation management code
|
2023-05-30 08:09:51 -04:00 |
|
niansa
|
a77a4a7e20
|
Fixed clblast/cuda build in llama.cpp.cmake
|
2023-05-26 19:39:45 +02:00 |
|
Adam Treat
|
ff5be8865b
|
Mark these as const
|
2023-05-26 13:27:15 -04:00 |
|
niansa
|
0bec24980f
|
Initial build variant auto detection
|
2023-05-26 13:27:15 -04:00 |
|
niansa/tuxifan
|
0788fe0483
|
Reverted llama_sample_top_p_top_k back to being static
Signed-off-by: niansa/tuxifan <tuxifan@posteo.de>
|
2023-05-26 13:27:15 -04:00 |
|
niansa
|
4dc1df4719
|
Unified llamamodel{.cpp,_impl.h} for all llama.cpp versions
|
2023-05-26 13:27:15 -04:00 |
|
niansa/tuxifan
|
75c787de7f
|
Support new 230519 llama.cpp
|
2023-05-26 13:27:15 -04:00 |
|
niansa
|
6fad98e28f
|
Aligned llama implementation style to other implementations
|
2023-05-26 13:27:15 -04:00 |
|
niansa
|
5d723221b7
|
Fix BOS never getting added in llamamodel
|
2023-05-26 13:27:15 -04:00 |
|
niansa
|
7c3dd3cdad
|
Properly check magic
|
2023-05-26 13:27:15 -04:00 |
|
niansa
|
e8a83be3e1
|
Re-add llamamodel.cpp based on upstream llama.cpp
|
2023-05-26 13:27:15 -04:00 |
|
niansa
|
c73b8ef9d4
|
Rename llama.cpp fork submodule to llama.cpp-old, add the upstream one as llama.cpp
|
2023-05-26 13:27:15 -04:00 |
|
niansa/tuxifan
|
af9ea34bd0
|
Fixed compilation due to dlhandle name change
Seems like I've overlooked an `#include` when renaming the file
Signed-off-by: niansa/tuxifan <tuxifan@posteo.de>
|
2023-05-26 13:27:15 -04:00 |
|
niansa
|
350033a23a
|
Fixed llama capitalization
|
2023-05-26 13:27:15 -04:00 |
|
niansa
|
3607456cc3
|
Reverted line break changes
|
2023-05-26 13:27:15 -04:00 |
|
niansa
|
a342acfc73
|
Removed operator ""_MiB
|
2023-05-26 13:27:15 -04:00 |
|
niansa
|
5aac0fa04a
|
Renamed dlhandle.hpp -> dlhandle.h; added file to CMake
|
2023-05-26 13:27:15 -04:00 |
|
niansa
|
c7f78fe7f9
|
Fixed oversight in last commit in llmodel_c.cpp
|
2023-05-26 13:27:15 -04:00 |
|
niansa
|
7878244170
|
Implemented basic dynamic model implementation loading
|
2023-05-26 13:27:15 -04:00 |
|
Richard Guo
|
73db20ba85
|
hotfix default verbose optioin
|
2023-05-26 12:49:32 -04:00 |
|
Konstantin Gukov
|
a6f3e94458
|
one funcion to append .bin suffix
|
2023-05-26 09:24:03 -04:00 |
|
Konstantin Gukov
|
659244f0a2
|
Correct indentation of the multiline error message
|
2023-05-26 09:24:03 -04:00 |
|
Konstantin Gukov
|
5e61008424
|
Add optional verbosity
|
2023-05-26 09:24:03 -04:00 |
|
Konstantin Gukov
|
e05ee9466a
|
Correct return type
|
2023-05-26 09:24:03 -04:00 |
|
Konstantin Gukov
|
100c809f1e
|
Do not ignore explicitly passed 4 threads
|
2023-05-26 09:24:03 -04:00 |
|
Konstantin Gukov
|
dcbdd369ad
|
Redundant else
|
2023-05-26 09:24:03 -04:00 |
|
Konstantin Gukov
|
ace34afef2
|
1. Cleanup the interrupted download
2. with-syntax
|
2023-05-26 09:24:03 -04:00 |
|
Konstantin Gukov
|
8053dc014b
|
less magic number
|
2023-05-26 09:24:03 -04:00 |
|
Konstantin Gukov
|
e98cfd97b3
|
convert to f-strings
|
2023-05-26 09:24:03 -04:00 |
|
Konstantin Gukov
|
2b6fb7b95e
|
reduce nesting, better error reporting
|
2023-05-26 09:24:03 -04:00 |
|
Konstantin Gukov
|
a067f38544
|
Concise model matching
|
2023-05-26 09:24:03 -04:00 |
|
Konstantin Gukov
|
c1f3dd310c
|
Log where the model was found
|
2023-05-26 09:24:03 -04:00 |
|
Konstantin Gukov
|
f96300534b
|
Nicer handling of missing model directory.
Correct exception message.
|
2023-05-26 09:24:03 -04:00 |
|
Konstantin Gukov
|
59d7db9aad
|
More precise condition
|
2023-05-26 09:24:03 -04:00 |
|
Konstantin Gukov
|
adc599b0a6
|
rm redundant json
|
2023-05-26 09:24:03 -04:00 |
|
Adam Treat
|
810a3b12cc
|
This time remember to bump the version right after a release.
|
2023-05-25 18:26:33 -04:00 |
|
Adam Treat
|
d1ff7132c5
|
Bump the version number.
|
2023-05-25 17:08:50 -04:00 |
|
Adam Treat
|
afe3870b7a
|
Libraries named differently on msvc.
|
2023-05-25 16:27:09 -04:00 |
|
Adam Treat
|
474c5387f9
|
Get the backend as well as the client building/working with msvc.
|
2023-05-25 15:22:45 -04:00 |
|
redthing1
|
63f57635d8
|
make sample print usage and cleaner
|
2023-05-25 11:34:21 -04:00 |
|
redthing1
|
dec8546abe
|
create test project and basic model loading tests
|
2023-05-25 11:34:07 -04:00 |
|
redthing1
|
0cc86d19be
|
ignore rider and vscode dirs
|
2023-05-25 11:34:07 -04:00 |
|
Adam Treat
|
265488e54a
|
Add a newline
|
2023-05-25 11:28:06 -04:00 |
|
Adam Treat
|
98201540a2
|
Various fixes to remove unnecessary warnings.
|
2023-05-25 11:28:06 -04:00 |
|
Adam Treat
|
0403a122ca
|
Don't use the full path in reference text.
|
2023-05-25 11:28:06 -04:00 |
|
Adam Treat
|
9b0629db8b
|
Add context link to references.
|
2023-05-25 11:28:06 -04:00 |
|
Adam Treat
|
db9eecdce4
|
Store the references separately so they are not sent to datalake.
|
2023-05-25 11:28:06 -04:00 |
|
Adam Treat
|
b5380c9b7f
|
Adds the collections to serialize and implement references for localdocs.
|
2023-05-25 11:28:06 -04:00 |
|