Commit Graph

142 Commits

Author SHA1 Message Date
Eric Wang
b8c32be806 Add counting test 2023-03-16 00:05:32 -07:00
Eric J. Wang
88bfa8f8b0
Add Windows install instructions
Resolves #17
2023-03-16 00:01:01 -07:00
Eric Wang
2fa1c66388 repair tokenization logic, again 2023-03-15 23:58:44 -07:00
Eric Wang
024dde7dab Revert "fix <eos> tokenization"
This reverts commit 6b69ea8665.
2023-03-15 22:52:54 -07:00
Eric Wang
6b69ea8665 fix <eos> tokenization 2023-03-15 18:21:06 -07:00
Eric Wang
6f218218f4 Add checkpoint export to README 2023-03-15 17:24:59 -07:00
Eric Wang
d68ff15ceb generate.py memory, perf updates 2023-03-15 17:22:22 -07:00
Eric Wang
056b81177a Add script for converting weights from HF 2023-03-15 17:17:32 -07:00
Eric Wang
07f5b68e0f torch.no_grad 2023-03-15 11:11:26 -07:00
Eric Wang
956dea5d28 update length notebook 2023-03-15 11:11:01 -07:00
Eric Wang
a2607faff0 fix finetuning code :( 2023-03-14 21:45:12 -07:00
Eric Wang
6149706680 add text-davinci-003 to comparisons 2023-03-14 21:41:02 -07:00
Eric Wang
d714a73e8c Update README.md with new checkpoint details 2023-03-14 21:33:12 -07:00
Eric J. Wang
6a8b163f3e
Link to HuggingFace Hub 2023-03-14 20:53:03 -07:00
Eric J. Wang
19af668cb4
Add CoLab demo 2023-03-14 20:47:10 -07:00
Eric Wang
ec98533876 Update README.md; clean up hyperparameters 2023-03-14 16:30:38 -07:00
Eric Wang
46ddd2ca85 Ready to go 2023-03-14 15:10:33 -07:00
Eric Wang
648af26073 update hyperparams 2023-03-14 08:51:30 -07:00
Eric Wang
5cd474bcc0 lr=2e-5 2023-03-14 08:47:49 -07:00
Eric J. Wang
1193c63833
Merge pull request #6 from janmaltel/janmaltel/input-bug
Fix bug in generate promp using 'instruction' instead of 'input'
2023-03-14 08:42:13 -07:00
Jan Malte Lichtenberg
a3b80fdbd5 Fix bug in generate promp using 'instruction' instead of 'input' 2023-03-14 15:14:37 +01:00
Eric J. Wang
6f465812d8
Update README.md 2023-03-13 23:20:11 -07:00
Eric Wang
29336ecdd1 typos 2023-03-13 23:13:05 -07:00
Eric Wang
c978ee6f71 fix zphang commit in place 2023-03-13 23:10:41 -07:00
Eric Wang
9aefbd6fe1 fix alpaca citation 2023-03-13 23:07:24 -07:00
Eric Wang
01cb7f73c8 README copy 2023-03-13 23:06:36 -07:00
Eric Wang
33ce1ed54e elaborate on inference, training 2023-03-13 22:49:33 -07:00
Eric Wang
b5538e1df5 clean up dependencies 2023-03-13 22:47:26 -07:00
Eric Wang
bf4ca26b21 clean up unused files 2023-03-13 22:45:34 -07:00
Eric Wang
13d55f437e update lengths notebook 2023-03-13 22:38:22 -07:00
Eric Wang
b6ee217aa9 Add header removal to TODOs 2023-03-13 22:25:47 -07:00
Eric J. Wang
c95ebb07de
Update README.md 2023-03-13 21:53:51 -07:00
Eric Wang
15fb2b178b lengths cleanup 2023-03-13 21:53:19 -07:00
Eric Wang
41e0ff6c78 tokenizer changes 2023-03-13 21:53:19 -07:00
Eric J. Wang
0b8b0e0f90
Update README.md 2023-03-13 21:03:36 -07:00
Eric Wang
a7ccc3603e To-dos, etc 2023-03-13 17:50:38 -07:00
Eric Wang
6707775517 README formatting 2023-03-13 17:44:21 -07:00
Eric Wang
8f7447ea01 document generate.py 2023-03-13 17:42:46 -07:00
Eric Wang
df2a5dc4be cleanup notebooks 2023-03-13 17:33:27 -07:00
Eric Wang
357ec81a17 decapoda 2023-03-13 17:23:29 -07:00
Eric Wang
63121244c8 Licenses and whatnot 2023-03-13 15:00:05 -07:00
Eric Wang
26f64780ad initial commit 2023-03-13 14:34:26 -07:00