Eric Wang
|
056b81177a
|
Add script for converting weights from HF
|
2023-03-15 17:17:32 -07:00 |
|
Eric Wang
|
07f5b68e0f
|
torch.no_grad
|
2023-03-15 11:11:26 -07:00 |
|
Eric Wang
|
956dea5d28
|
update length notebook
|
2023-03-15 11:11:01 -07:00 |
|
Eric Wang
|
a2607faff0
|
fix finetuning code :(
|
2023-03-14 21:45:12 -07:00 |
|
Eric Wang
|
6149706680
|
add text-davinci-003 to comparisons
|
2023-03-14 21:41:02 -07:00 |
|
Eric Wang
|
d714a73e8c
|
Update README.md with new checkpoint details
|
2023-03-14 21:33:12 -07:00 |
|
Eric J. Wang
|
6a8b163f3e
|
Link to HuggingFace Hub
|
2023-03-14 20:53:03 -07:00 |
|
Eric J. Wang
|
19af668cb4
|
Add CoLab demo
|
2023-03-14 20:47:10 -07:00 |
|
Eric Wang
|
ec98533876
|
Update README.md; clean up hyperparameters
|
2023-03-14 16:30:38 -07:00 |
|
Eric Wang
|
46ddd2ca85
|
Ready to go
|
2023-03-14 15:10:33 -07:00 |
|
Eric Wang
|
648af26073
|
update hyperparams
|
2023-03-14 08:51:30 -07:00 |
|
Eric Wang
|
5cd474bcc0
|
lr=2e-5
|
2023-03-14 08:47:49 -07:00 |
|
Eric J. Wang
|
1193c63833
|
Merge pull request #6 from janmaltel/janmaltel/input-bug
Fix bug in generate promp using 'instruction' instead of 'input'
|
2023-03-14 08:42:13 -07:00 |
|
Jan Malte Lichtenberg
|
a3b80fdbd5
|
Fix bug in generate promp using 'instruction' instead of 'input'
|
2023-03-14 15:14:37 +01:00 |
|
Eric J. Wang
|
6f465812d8
|
Update README.md
|
2023-03-13 23:20:11 -07:00 |
|
Eric Wang
|
29336ecdd1
|
typos
|
2023-03-13 23:13:05 -07:00 |
|
Eric Wang
|
c978ee6f71
|
fix zphang commit in place
|
2023-03-13 23:10:41 -07:00 |
|
Eric Wang
|
9aefbd6fe1
|
fix alpaca citation
|
2023-03-13 23:07:24 -07:00 |
|
Eric Wang
|
01cb7f73c8
|
README copy
|
2023-03-13 23:06:36 -07:00 |
|
Eric Wang
|
33ce1ed54e
|
elaborate on inference, training
|
2023-03-13 22:49:33 -07:00 |
|
Eric Wang
|
b5538e1df5
|
clean up dependencies
|
2023-03-13 22:47:26 -07:00 |
|
Eric Wang
|
bf4ca26b21
|
clean up unused files
|
2023-03-13 22:45:34 -07:00 |
|
Eric Wang
|
13d55f437e
|
update lengths notebook
|
2023-03-13 22:38:22 -07:00 |
|
Eric Wang
|
b6ee217aa9
|
Add header removal to TODOs
|
2023-03-13 22:25:47 -07:00 |
|
Eric J. Wang
|
c95ebb07de
|
Update README.md
|
2023-03-13 21:53:51 -07:00 |
|
Eric Wang
|
15fb2b178b
|
lengths cleanup
|
2023-03-13 21:53:19 -07:00 |
|
Eric Wang
|
41e0ff6c78
|
tokenizer changes
|
2023-03-13 21:53:19 -07:00 |
|
Eric J. Wang
|
0b8b0e0f90
|
Update README.md
|
2023-03-13 21:03:36 -07:00 |
|
Eric Wang
|
a7ccc3603e
|
To-dos, etc
|
2023-03-13 17:50:38 -07:00 |
|
Eric Wang
|
6707775517
|
README formatting
|
2023-03-13 17:44:21 -07:00 |
|
Eric Wang
|
8f7447ea01
|
document generate.py
|
2023-03-13 17:42:46 -07:00 |
|
Eric Wang
|
df2a5dc4be
|
cleanup notebooks
|
2023-03-13 17:33:27 -07:00 |
|
Eric Wang
|
357ec81a17
|
decapoda
|
2023-03-13 17:23:29 -07:00 |
|
Eric Wang
|
63121244c8
|
Licenses and whatnot
|
2023-03-13 15:00:05 -07:00 |
|
Eric Wang
|
26f64780ad
|
initial commit
|
2023-03-13 14:34:26 -07:00 |
|