Commit Graph

92 Commits

Author SHA1 Message Date
Eric Wang
8955a9c5a1 bos, eos in generate.py 2023-03-23 13:44:45 -07:00
Eric J. Wang
1384a4d24c
Update README.md for multi-GPU training 2023-03-22 22:05:36 -07:00
bofeng huang
c7eabb86e2
Add french version "vigogne" (#127) 2023-03-22 15:59:14 -07:00
Eric J. Wang
a74793c571
Rearrange resources on README, add 13B-30B models 2023-03-22 14:17:31 -07:00
Eric Wang
b12c3b90f8 Unwind input masking to avoid confusion 2023-03-22 13:52:27 -07:00
Eric Wang
e04897baae fix fp16 inference 2023-03-21 14:31:30 -07:00
Eric J. Wang
052da42cbb
Replace Colab with HF in README 2023-03-21 13:59:44 -07:00
Eric Wang
7fb06c6c22 Revert "Mask out prompt tokens for real"
This reverts commit 4a712d4d8e.
2023-03-21 12:42:06 -07:00
Eric Wang
2204a71505 set EPOCHS back to 3 2023-03-21 11:52:28 -07:00
Eric Wang
4a712d4d8e Mask out prompt tokens for real 2023-03-21 11:24:38 -07:00
Eric Wang
fac53721a2 masking bugfix 2023-03-20 21:37:39 -07:00
Eric J. Wang
3cdbfe5b0c
Update README.md 2023-03-20 14:32:55 -07:00
Eric J. Wang
c08c34eabb
mention chatbot project in README.md 2023-03-20 14:26:56 -07:00
Eric J. Wang
f0082d8e8b
Link to resources more prominently 2023-03-20 11:30:42 -07:00
Eric J. Wang
d38802e843
Point volunteers to Open Assistant 2023-03-20 10:52:39 -07:00
Kohaku-Blueleaf
b5a1a0bca7
Add support for valid set size 0 (#83)
* Add support for valid set size 0

* Make param about valid to default when 0
2023-03-19 22:02:14 -07:00
Kohaku-Blueleaf
0af44f0262
Add option for output dir (#84) 2023-03-19 22:01:24 -07:00
Kohaku-Blueleaf
450206caaf
Fix torch.compile call on windows (#81)
* Windows not support compile

* Fix code style
2023-03-19 20:16:02 -07:00
Karun
81eb72f707
cleans up alphabetical prompts (#76) 2023-03-19 15:55:02 -07:00
Eric Wang
997f6cd81f slider for tokens generated 2023-03-19 15:53:21 -07:00
Eric Wang
cfad895aa1 mask prompt in loss 2023-03-19 15:53:21 -07:00
Eric J. Wang
d66908c0ca
Remove messy test code 2023-03-19 11:22:02 -07:00
Yaqub Mahmoud
0e752ea5f3
Update requirements.txt (#67)
Added appdirs package to requirements.txt
2023-03-19 11:15:07 -07:00
Eric Wang
c83e30ab78 generate.py tweaks 2023-03-18 23:00:18 -07:00
Eric Wang
80fd9833db don't share publicly 2023-03-18 16:43:53 -07:00
Eric Wang
6ced8d9907 fix HF export script 2023-03-18 16:42:58 -07:00
Eric J. Wang
8dc0f614c6
Update README.md 2023-03-18 13:24:42 -07:00
Eric J. Wang
d9c19ff34e
Update README.md 2023-03-17 22:27:58 -07:00
Kakigōri Maker
9dab7ba438
add multi-gpu support (ddp) (#54)
* add multi-gpu support (ddp)

* Update finetune.py
2023-03-17 22:27:33 -07:00
Eric Wang
a0295813b0 normalize cleaned data row with missing output 2023-03-17 20:52:14 -07:00
Eric Wang
3b160d745b HF export script 2023-03-17 17:56:10 -07:00
Eric Wang
8aecde83cd construciton 2023-03-17 15:11:35 -07:00
Eric Wang
cb046d647e min beams = 1 2023-03-17 15:07:08 -07:00
Eric Wang
f7044049ab dataset cleaning, visualizations 2023-03-17 15:04:25 -07:00
Peter Marelas
db4af6a7ff
Enable inference on CPU and Mac GPU using pytorch support for MPS (#48) 2023-03-17 13:53:21 -07:00
Eric J. Wang
9bff21cc68
huggingface -> Hugging Face
Update README.md
2023-03-17 11:08:01 -07:00
Ikko Eltociear Ashimine
65299df970
Update README.md
huggingface -> Hugging Face
2023-03-17 16:34:56 +09:00
Eric J. Wang
daf13eea40
Add notes about dataset and model updates 2023-03-16 21:17:55 -07:00
Eric J. Wang
d60701b895
Merge pull request #35 from T-Atlas/patch-1
Update generate.py
2023-03-16 19:39:43 -07:00
Lian Junhong
3a47bd18e8
Update generate.py
Adapting to the input function, a text box for inputting content has been added.
2023-03-17 10:30:27 +08:00
Eric Wang
c39da83e2b add Gradio interface to generate.py 2023-03-16 16:04:06 -07:00
Eric Wang
35029da078 Validation set 2023-03-16 15:05:17 -07:00
Eric Wang
5f6614e6fc Catch outdated installs 2023-03-16 12:11:47 -07:00
Eric Wang
6adce2f6c5 Fix req.txt specification to install from github 2023-03-16 12:11:47 -07:00
Eric Wang
060e7f4a50 README and requirements.txt updates 2023-03-16 12:11:46 -07:00
andreas.echavez
1862976b33 Update alpaca-lora to use transformers main branch 2023-03-16 12:11:29 -07:00
Eric Wang
c3d94707ec Expand sampling in generate.py for new test 2023-03-16 09:59:10 -07:00
Eric Wang
9c1933721d Update README to reflect new weights 2023-03-16 09:59:10 -07:00
Eric J. Wang
6681523bbe
Merge pull request #19 from antimatter15/patch-1
Fix LoRa weight merging in export

It can't hurt
2023-03-16 01:42:48 -07:00
Kevin Kwok
dde89950f3
Fix LoRa weight merging 2023-03-16 00:50:24 -07:00