oobabooga
|
549f106879
|
Bump ExLlamaV2 to v0.0.13.2
|
2024-02-14 21:57:48 -08:00 |
|
oobabooga
|
7123ac3f77
|
Remove "Maximum UI updates/second" parameter (#5507)
|
2024-02-14 23:34:30 -03:00 |
|
DominikKowalczyk
|
33c4ce0720
|
Bump gradio to 4.19 (#5419)
---------
Co-authored-by: oobabooga <112222186+oobabooga@users.noreply.github.com>
|
2024-02-14 23:28:26 -03:00 |
|
oobabooga
|
04d8bdf929
|
Fix ExLlamaV2 requirement on Windows
|
2024-02-14 06:31:20 -08:00 |
|
oobabooga
|
b16958575f
|
Minor bug fix
|
2024-02-13 19:48:32 -08:00 |
|
oobabooga
|
d47182d9d1
|
llamacpp_HF: do not use oobabooga/llama-tokenizer (#5499)
|
2024-02-14 00:28:51 -03:00 |
|
oobabooga
|
3a9ce3cfa6
|
Update stalebot message
|
2024-02-13 19:06:32 -08:00 |
|
oobabooga
|
93dd31fc0f
|
Increase stalebot timeout
|
2024-02-13 16:07:33 -08:00 |
|
oobabooga
|
069ed7c6ef
|
Lint
|
2024-02-13 16:05:41 -08:00 |
|
oobabooga
|
193548edce
|
Minor fix to ExLlamaV2 requirements
|
2024-02-13 16:00:06 -08:00 |
|
oobabooga
|
25b655faeb
|
Merge remote-tracking branch 'refs/remotes/origin/dev' into dev
|
2024-02-13 15:49:53 -08:00 |
|
oobabooga
|
f99f1fc68e
|
Bump llama-cpp-python to 0.2.42
|
2024-02-13 15:49:20 -08:00 |
|
dependabot[bot]
|
d8081e85ec
|
Update peft requirement from ==0.7.* to ==0.8.* (#5446)
|
2024-02-13 16:27:18 -03:00 |
|
dependabot[bot]
|
653b195b1e
|
Update numpy requirement from ==1.24.* to ==1.26.* (#5490)
|
2024-02-13 16:26:35 -03:00 |
|
dependabot[bot]
|
147b4cf3e0
|
Bump hqq from 0.1.2.post1 to 0.1.3 (#5489)
|
2024-02-13 16:25:02 -03:00 |
|
Steven K
|
512933fa44
|
Update main.css to allow scrolling in code blocks (#5495)
|
2024-02-13 16:24:30 -03:00 |
|
oobabooga
|
e9fea353c5
|
Bump llama-cpp-python to 0.2.40
|
2024-02-13 11:22:34 -08:00 |
|
oobabooga
|
7342afaf19
|
Update the PyTorch installation instructions
|
2024-02-08 20:36:11 -08:00 |
|
oobabooga
|
86c320ab5a
|
llama.cpp: add a progress bar for prompt evaluation
|
2024-02-07 21:56:10 -08:00 |
|
oobabooga
|
acea6a6669
|
Add more exllamav2 wheels
|
2024-02-07 08:24:29 -08:00 |
|
oobabooga
|
35537ad3d1
|
Bump exllamav2 to 0.0.13.1 (#5463)
|
2024-02-07 13:17:04 -03:00 |
|
oobabooga
|
b8e25e8678
|
Bump llama-cpp-python to 0.2.39
|
2024-02-07 06:50:47 -08:00 |
|
oobabooga
|
c55b8ce932
|
Improved random preset generation
|
2024-02-06 08:51:52 -08:00 |
|
oobabooga
|
4e34ae0587
|
Minor logging improvements
|
2024-02-06 08:22:08 -08:00 |
|
oobabooga
|
3add2376cd
|
Better warpers logging
|
2024-02-06 07:09:21 -08:00 |
|
oobabooga
|
494cc3c5b0
|
Handle empty sampler priority field, use default values
|
2024-02-06 07:05:32 -08:00 |
|
oobabooga
|
775902c1f2
|
Sampler priority: better logging, always save to presets
|
2024-02-06 06:49:22 -08:00 |
|
oobabooga
|
acfbe6b3b3
|
Minor doc changes
|
2024-02-06 06:35:01 -08:00 |
|
oobabooga
|
8ee3cea7cb
|
Improve some log messages
|
2024-02-06 06:31:27 -08:00 |
|
oobabooga
|
8a6d9abb41
|
Small fixes
|
2024-02-06 06:26:27 -08:00 |
|
oobabooga
|
2a1063eff5
|
Revert "Remove non-HF ExLlamaV2 loader (#5431)"
This reverts commit cde000d478 .
|
2024-02-06 06:21:36 -08:00 |
|
oobabooga
|
8c35fefb3b
|
Add custom sampler order support (#5443)
|
2024-02-06 11:20:10 -03:00 |
|
oobabooga
|
7301c7618f
|
Minor change to Models tab
|
2024-02-04 21:49:58 -08:00 |
|
oobabooga
|
f234fbe83f
|
Improve a log message after previous commit
|
2024-02-04 21:44:53 -08:00 |
|
oobabooga
|
7073665a10
|
Truncate long chat completions inputs (#5439)
|
2024-02-05 02:31:24 -03:00 |
|
oobabooga
|
9033fa5eee
|
Organize the Model tab
|
2024-02-04 19:30:22 -08:00 |
|
oobabooga
|
cd4ffd3dd4
|
Update docs
|
2024-02-04 18:48:04 -08:00 |
|
oobabooga
|
92d0617bce
|
Merge remote-tracking branch 'refs/remotes/origin/dev' into dev
|
2024-02-04 18:40:46 -08:00 |
|
oobabooga
|
a210999255
|
Bump safetensors version
|
2024-02-04 18:40:25 -08:00 |
|
Badis Ghoubali
|
9fdee65cf5
|
Improve ChatML template (#5411)
|
2024-02-04 23:39:15 -03:00 |
|
Forkoz
|
2a45620c85
|
Split by rows instead of layers for llama.cpp multi-gpu (#5435)
|
2024-02-04 23:36:40 -03:00 |
|
Badis Ghoubali
|
3df7e151f7
|
fix the n_batch slider (#5436)
|
2024-02-04 18:15:30 -03:00 |
|
oobabooga
|
4e188eeb80
|
Lint
|
2024-02-03 20:40:10 -08:00 |
|
oobabooga
|
cde000d478
|
Remove non-HF ExLlamaV2 loader (#5431)
|
2024-02-04 01:15:51 -03:00 |
|
kalomaze
|
b6077b02e4
|
Quadratic sampling (#5403)
---------
Co-authored-by: oobabooga <112222186+oobabooga@users.noreply.github.com>
|
2024-02-04 00:20:02 -03:00 |
|
oobabooga
|
e98d1086f5
|
Bump llama-cpp-python to 0.2.38 (#5420)
|
2024-02-01 20:09:30 -03:00 |
|
oobabooga
|
167ee72d4e
|
Lint
|
2024-01-30 09:16:23 -08:00 |
|
oobabooga
|
ee65f4f014
|
Downloader: don't assume that huggingface_hub is installed
|
2024-01-30 09:14:11 -08:00 |
|
oobabooga
|
89f6036e98
|
Bump llama-cpp-python, remove python 3.8/3.9, cuda 11.7 (#5397)
|
2024-01-30 13:19:20 -03:00 |
|
Forkoz
|
528318b700
|
API: Remove tiktoken from logit bias (#5391)
|
2024-01-28 21:42:03 -03:00 |
|