Commit Graph

382 Commits

Author SHA1 Message Date
oobabooga
ac30b004ef Pin fastapi/pydantic requirement versions 2024-09-06 18:45:15 -07:00
dependabot[bot]
64e16e9a46
Update accelerate requirement from ==0.32.* to ==0.33.* (#6291) 2024-08-19 23:34:10 -03:00
dependabot[bot]
68f928b5e0
Update peft requirement from ==0.8.* to ==0.12.* (#6292) 2024-08-19 23:33:56 -03:00
oobabooga
4d8c1801c2 Bump llama-cpp-python to 0.2.89 2024-08-19 17:45:01 -07:00
oobabooga
bf8187124d Bump llama-cpp-python to 0.2.88 2024-08-13 12:40:18 -07:00
oobabooga
089d5a9415 Bump llama-cpp-python to 0.2.87 2024-08-07 20:36:28 -07:00
oobabooga
81773f7f36 Bump transformers to 4.44 2024-08-06 20:07:05 -07:00
oobabooga
608545d282 Bump llama-cpp-python to 0.2.85 2024-07-31 18:44:46 -07:00
oobabooga
92ab3a9a6a Bump llama-cpp-python to 0.2.84 2024-07-28 15:13:06 -07:00
oobabooga
e4624fbc68
Merge branch 'main' into dev 2024-07-25 12:03:45 -03:00
oobabooga
3b2c23dfb5 Add AutoAWQ 0.2.6 wheels for PyTorch 2.2.2 2024-07-24 11:15:00 -07:00
oobabooga
8a5f110c14 Bump ExLlamaV2 to 0.1.8 2024-07-24 09:22:48 -07:00
oobabooga
af839d20ac Remove the AutoAWQ requirement 2024-07-23 19:38:39 -07:00
oobabooga
9d5513fda0 Remove the AutoAWQ requirement 2024-07-23 19:38:04 -07:00
oobabooga
f66ab63d64 Bump transformers to 4.43 2024-07-23 14:06:34 -07:00
oobabooga
3ee682208c Revert "Bump hqq from 0.1.7.post3 to 0.1.8 (#6238)"
This reverts commit 1c3671699c.
2024-07-22 19:53:56 -07:00
oobabooga
aa809e420e Bump llama-cpp-python to 0.2.83, add back tensorcore wheels
Also add back the progress bar patch
2024-07-22 18:05:11 -07:00
oobabooga
11bbf71aa5
Bump back llama-cpp-python (#6257) 2024-07-22 16:19:41 -03:00
oobabooga
0f53a736c1 Revert the llama-cpp-python update 2024-07-22 12:02:25 -07:00
oobabooga
a687f950ba Remove the tensorcores llama.cpp wheels
They are not faster than the default wheels anymore and they use a lot of space.
2024-07-22 11:54:35 -07:00
oobabooga
7d2449f8b0 Bump llama-cpp-python to 0.2.82.3 (unofficial build) 2024-07-22 11:49:20 -07:00
dependabot[bot]
1c3671699c
Bump hqq from 0.1.7.post3 to 0.1.8 (#6238) 2024-07-20 18:20:26 -03:00
oobabooga
b19d239a60 Bump flash-attention to 2.6.1 2024-07-12 20:16:11 -07:00
dependabot[bot]
063d2047dd
Update accelerate requirement from ==0.31.* to ==0.32.* (#6217) 2024-07-11 19:56:42 -03:00
oobabooga
01e4721da7 Bump ExLlamaV2 to 0.1.7 2024-07-11 12:33:46 -07:00
oobabooga
fa075e41f4 Bump llama-cpp-python to 0.2.82 2024-07-10 06:03:24 -07:00
oobabooga
7e22eaa36c Bump llama-cpp-python to 0.2.81 2024-07-02 20:29:35 -07:00
dependabot[bot]
9660f6f10e
Bump aqlm[cpu,gpu] from 1.1.5 to 1.1.6 (#6157) 2024-06-27 21:13:02 -03:00
dependabot[bot]
a5df8f4e3c
Bump jinja2 from 3.1.2 to 3.1.4 (#6172) 2024-06-27 21:12:39 -03:00
dependabot[bot]
c6cec0588c
Update accelerate requirement from ==0.30.* to ==0.31.* (#6156) 2024-06-27 21:12:02 -03:00
oobabooga
66090758df Bump transformers to 4.42 (for gemma support) 2024-06-27 11:26:02 -07:00
oobabooga
602b455507 Bump llama-cpp-python to 0.2.79 2024-06-24 20:26:38 -07:00
oobabooga
7db8b3b532 Bump ExLlamaV2 to 0.1.6 2024-06-24 05:38:11 -07:00
oobabooga
125bb7b03b Revert "Bump llama-cpp-python to 0.2.78"
This reverts commit b6eaf7923e.
2024-06-23 19:54:28 -07:00
oobabooga
b6eaf7923e Bump llama-cpp-python to 0.2.78 2024-06-14 21:22:09 -07:00
oobabooga
9420973b62
Downgrade PyTorch to 2.2.2 (#6124) 2024-06-14 16:42:03 -03:00
dependabot[bot]
fdd8fab9cf
Bump hqq from 0.1.7.post2 to 0.1.7.post3 (#6090) 2024-06-14 13:46:35 -03:00
oobabooga
8930bfc5f4
Bump PyTorch, ExLlamaV2, flash-attention (#6122) 2024-06-13 20:38:31 -03:00
oobabooga
bd7cc4234d
Backend cleanup (#6025) 2024-05-21 13:32:02 -03:00
dependabot[bot]
2de586f586
Update accelerate requirement from ==0.27.* to ==0.30.* (#5989) 2024-05-19 20:03:18 -03:00
oobabooga
0d90b3a25c Bump llama-cpp-python to 0.2.75 2024-05-18 05:26:26 -07:00
oobabooga
9557f49f2f Bump llama-cpp-python to 0.2.73 2024-05-11 10:53:19 -07:00
oobabooga
e61055253c Bump llama-cpp-python to 0.2.69, add --flash-attn option 2024-05-03 04:31:22 -07:00
oobabooga
0476f9fe70 Bump ExLlamaV2 to 0.0.20 2024-05-01 16:20:50 -07:00
oobabooga
ae0f28530c Bump llama-cpp-python to 0.2.68 2024-05-01 08:40:50 -07:00
oobabooga
51fb766bea
Add back my llama-cpp-python wheels, bump to 0.2.65 (#5964) 2024-04-30 09:11:31 -03:00
oobabooga
9b623b8a78
Bump llama-cpp-python to 0.2.64, use official wheels (#5921) 2024-04-23 23:17:05 -03:00
Ashley Kleynhans
0877741b03
Bumped ExLlamaV2 to version 0.0.19 to resolve #5851 (#5880) 2024-04-19 19:04:40 -03:00
oobabooga
b30bce3b2f Bump transformers to 4.40 2024-04-18 16:19:31 -07:00
Philipp Emanuel Weidmann
a0c69749e6
Revert sse-starlette version bump because it breaks API request cancellation (#5873) 2024-04-18 15:05:00 -03:00