Eric Wang
|
8955a9c5a1
|
bos, eos in generate.py
|
2023-03-23 13:44:45 -07:00 |
|
Eric J. Wang
|
1384a4d24c
|
Update README.md for multi-GPU training
|
2023-03-22 22:05:36 -07:00 |
|
bofeng huang
|
c7eabb86e2
|
Add french version "vigogne" (#127)
|
2023-03-22 15:59:14 -07:00 |
|
Eric J. Wang
|
a74793c571
|
Rearrange resources on README, add 13B-30B models
|
2023-03-22 14:17:31 -07:00 |
|
Eric Wang
|
b12c3b90f8
|
Unwind input masking to avoid confusion
|
2023-03-22 13:52:27 -07:00 |
|
Eric Wang
|
e04897baae
|
fix fp16 inference
|
2023-03-21 14:31:30 -07:00 |
|
Eric J. Wang
|
052da42cbb
|
Replace Colab with HF in README
|
2023-03-21 13:59:44 -07:00 |
|
Eric Wang
|
7fb06c6c22
|
Revert "Mask out prompt tokens for real"
This reverts commit 4a712d4d8e .
|
2023-03-21 12:42:06 -07:00 |
|
Eric Wang
|
2204a71505
|
set EPOCHS back to 3
|
2023-03-21 11:52:28 -07:00 |
|
Eric Wang
|
4a712d4d8e
|
Mask out prompt tokens for real
|
2023-03-21 11:24:38 -07:00 |
|
Eric Wang
|
fac53721a2
|
masking bugfix
|
2023-03-20 21:37:39 -07:00 |
|
Eric J. Wang
|
3cdbfe5b0c
|
Update README.md
|
2023-03-20 14:32:55 -07:00 |
|
Eric J. Wang
|
c08c34eabb
|
mention chatbot project in README.md
|
2023-03-20 14:26:56 -07:00 |
|
Eric J. Wang
|
f0082d8e8b
|
Link to resources more prominently
|
2023-03-20 11:30:42 -07:00 |
|
Eric J. Wang
|
d38802e843
|
Point volunteers to Open Assistant
|
2023-03-20 10:52:39 -07:00 |
|
Kohaku-Blueleaf
|
b5a1a0bca7
|
Add support for valid set size 0 (#83)
* Add support for valid set size 0
* Make param about valid to default when 0
|
2023-03-19 22:02:14 -07:00 |
|
Kohaku-Blueleaf
|
0af44f0262
|
Add option for output dir (#84)
|
2023-03-19 22:01:24 -07:00 |
|
Kohaku-Blueleaf
|
450206caaf
|
Fix torch.compile call on windows (#81)
* Windows not support compile
* Fix code style
|
2023-03-19 20:16:02 -07:00 |
|
Karun
|
81eb72f707
|
cleans up alphabetical prompts (#76)
|
2023-03-19 15:55:02 -07:00 |
|
Eric Wang
|
997f6cd81f
|
slider for tokens generated
|
2023-03-19 15:53:21 -07:00 |
|
Eric Wang
|
cfad895aa1
|
mask prompt in loss
|
2023-03-19 15:53:21 -07:00 |
|
Eric J. Wang
|
d66908c0ca
|
Remove messy test code
|
2023-03-19 11:22:02 -07:00 |
|
Yaqub Mahmoud
|
0e752ea5f3
|
Update requirements.txt (#67)
Added appdirs package to requirements.txt
|
2023-03-19 11:15:07 -07:00 |
|
Eric Wang
|
c83e30ab78
|
generate.py tweaks
|
2023-03-18 23:00:18 -07:00 |
|
Eric Wang
|
80fd9833db
|
don't share publicly
|
2023-03-18 16:43:53 -07:00 |
|
Eric Wang
|
6ced8d9907
|
fix HF export script
|
2023-03-18 16:42:58 -07:00 |
|
Eric J. Wang
|
8dc0f614c6
|
Update README.md
|
2023-03-18 13:24:42 -07:00 |
|
Eric J. Wang
|
d9c19ff34e
|
Update README.md
|
2023-03-17 22:27:58 -07:00 |
|
Kakigōri Maker
|
9dab7ba438
|
add multi-gpu support (ddp) (#54)
* add multi-gpu support (ddp)
* Update finetune.py
|
2023-03-17 22:27:33 -07:00 |
|
Eric Wang
|
a0295813b0
|
normalize cleaned data row with missing output
|
2023-03-17 20:52:14 -07:00 |
|
Eric Wang
|
3b160d745b
|
HF export script
|
2023-03-17 17:56:10 -07:00 |
|
Eric Wang
|
8aecde83cd
|
construciton
|
2023-03-17 15:11:35 -07:00 |
|
Eric Wang
|
cb046d647e
|
min beams = 1
|
2023-03-17 15:07:08 -07:00 |
|
Eric Wang
|
f7044049ab
|
dataset cleaning, visualizations
|
2023-03-17 15:04:25 -07:00 |
|
Peter Marelas
|
db4af6a7ff
|
Enable inference on CPU and Mac GPU using pytorch support for MPS (#48)
|
2023-03-17 13:53:21 -07:00 |
|
Eric J. Wang
|
9bff21cc68
|
huggingface -> Hugging Face
Update README.md
|
2023-03-17 11:08:01 -07:00 |
|
Ikko Eltociear Ashimine
|
65299df970
|
Update README.md
huggingface -> Hugging Face
|
2023-03-17 16:34:56 +09:00 |
|
Eric J. Wang
|
daf13eea40
|
Add notes about dataset and model updates
|
2023-03-16 21:17:55 -07:00 |
|
Eric J. Wang
|
d60701b895
|
Merge pull request #35 from T-Atlas/patch-1
Update generate.py
|
2023-03-16 19:39:43 -07:00 |
|
Lian Junhong
|
3a47bd18e8
|
Update generate.py
Adapting to the input function, a text box for inputting content has been added.
|
2023-03-17 10:30:27 +08:00 |
|
Eric Wang
|
c39da83e2b
|
add Gradio interface to generate.py
|
2023-03-16 16:04:06 -07:00 |
|
Eric Wang
|
35029da078
|
Validation set
|
2023-03-16 15:05:17 -07:00 |
|
Eric Wang
|
5f6614e6fc
|
Catch outdated installs
|
2023-03-16 12:11:47 -07:00 |
|
Eric Wang
|
6adce2f6c5
|
Fix req.txt specification to install from github
|
2023-03-16 12:11:47 -07:00 |
|
Eric Wang
|
060e7f4a50
|
README and requirements.txt updates
|
2023-03-16 12:11:46 -07:00 |
|
andreas.echavez
|
1862976b33
|
Update alpaca-lora to use transformers main branch
|
2023-03-16 12:11:29 -07:00 |
|
Eric Wang
|
c3d94707ec
|
Expand sampling in generate.py for new test
|
2023-03-16 09:59:10 -07:00 |
|
Eric Wang
|
9c1933721d
|
Update README to reflect new weights
|
2023-03-16 09:59:10 -07:00 |
|
Eric J. Wang
|
6681523bbe
|
Merge pull request #19 from antimatter15/patch-1
Fix LoRa weight merging in export
It can't hurt
|
2023-03-16 01:42:48 -07:00 |
|
Kevin Kwok
|
dde89950f3
|
Fix LoRa weight merging
|
2023-03-16 00:50:24 -07:00 |
|