Added reference to Dolly 15k Dataset (#330)

* Update README.md

* Feature HF dataset more prominently

---------

Co-authored-by: Eric Wang <eric.james.wang@gmail.com>
This commit is contained in:
Chris Alexiuk 2023-04-13 16:46:48 -04:00 committed by GitHub
parent 148cc78d26
commit ea0e96b017
No known key found for this signature in database
GPG Key ID: 4AEE18F83AFDEB23

View File

@ -155,6 +155,7 @@ docker-compose down --volumes --rmi all
- [Alpaca-LoRA-Serve](https://github.com/deep-diver/Alpaca-LoRA-Serve), a ChatGPT-style interface for Alpaca models
- [AlpacaDataCleaned](https://github.com/gururise/AlpacaDataCleaned), a project to improve the quality of the Alpaca dataset
- [GPT-4 Alpaca Data](https://github.com/Instruction-Tuning-with-GPT-4/GPT-4-LLM) a project to port synthetic data creation to GPT-4
- [dolly-15k-instruction-alpaca-format](https://huggingface.co/datasets/c-s-ale/dolly-15k-instruction-alpaca-format), an Alpaca-compatible version of [Databricks' Dolly 15k human-generated instruct dataset](https://github.com/databrickslabs/dolly/tree/master/data) (see [blog](https://www.databricks.com/blog/2023/04/12/dolly-first-open-commercially-viable-instruction-tuned-llm))
- Various adapter weights (download at own risk):
- 7B:
- <https://huggingface.co/tloen/alpaca-lora-7b>