Added reference to Dolly 15k Dataset (#330)

* Update README.md * Feature HF dataset more prominently --------- Co-authored-by: Eric Wang <eric.james.wang@gmail.com>
2024-10-01 01:05:56 -04:00 · 2023-04-13 16:46:48 -04:00 · 2023-04-13 16:46:48 -04:00 · ea0e96b017
commit ea0e96b017
parent 148cc78d26
1 changed files with 1 additions and 0 deletions
--- a/README.md
+++ b/README.md
@ -155,6 +155,7 @@ docker-compose down --volumes --rmi all
 - [Alpaca-LoRA-Serve](https://github.com/deep-diver/Alpaca-LoRA-Serve), a ChatGPT-style interface for Alpaca models
 - [AlpacaDataCleaned](https://github.com/gururise/AlpacaDataCleaned), a project to improve the quality of the Alpaca dataset
 - [GPT-4 Alpaca Data](https://github.com/Instruction-Tuning-with-GPT-4/GPT-4-LLM) a project to port synthetic data creation to GPT-4
+- [dolly-15k-instruction-alpaca-format](https://huggingface.co/datasets/c-s-ale/dolly-15k-instruction-alpaca-format), an Alpaca-compatible version of [Databricks' Dolly 15k human-generated instruct dataset](https://github.com/databrickslabs/dolly/tree/master/data) (see [blog](https://www.databricks.com/blog/2023/04/12/dolly-first-open-commercially-viable-instruction-tuned-llm))
 - Various adapter weights (download at own risk):
  - 7B:
    - <https://huggingface.co/tloen/alpaca-lora-7b>