gpt4all/README.md

<h1 align="center">GPT4All</h1>
<p align="center">Demo, data and code to train an assistant-style large language model with ~800k GPT-3.5-Turbo Generations based on LLaMa</p>

<p align="center">
<a href="https://s3.amazonaws.com/static.nomic.ai/gpt4all/2023_GPT4All_Technical_Report.pdf">:green_book: Technical Report</a>
</p>


![gpt4all-lora-demo](https://user-images.githubusercontent.com/13879686/228352356-de66ca7a-df70-474e-b929-2e3656165051.gif)

Run on M1 Mac (not sped up!)

# Try it yourself

Clone this repository down and download the CPU quantized gpt4all model.
- [gpt4all-quantized](https://s3.amazonaws.com/static.nomic.ai/gpt4all/models/gpt4all-lora-quantized.bin)

Place the quantized model in the `chat` directory and start chatting by running:

- `./chat/gpt4all-lora-quantized-OSX-m1` on M1 Mac/OSX
- `./chat/gpt4all-lora-quantized-linux-x86` on Windows/Linux

To compile for custom hardware, see our fork of the [Alpaca C++](https://github.com/zanussbaum/gpt4all.cpp) repo.

Note: the full model on GPU (16GB of RAM required) performs much better in our qualitative evaluations.

# Reproducibility

Trained LoRa Weights:
- gpt4all-lora:  https://huggingface.co/nomic-ai/gpt4all-lora
- gpt4all-lora-epoch-2 https://huggingface.co/nomic-ai/gpt4all-lora-epoch-2

Raw Data:
- [Training Data Without P3](https://s3.amazonaws.com/static.nomic.ai/gpt4all/2022_03_27/gpt4all_curated_data_without_p3_2022_03_27.tar.gz)
  - Explorer: https://atlas.nomic.ai/map/gpt4all_data_clean_without_p3
- [Full Dataset with P3](https://s3.amazonaws.com/static.nomic.ai/gpt4all/2022_03_27/gpt4all_curated_data_full_2022_03_27.tar.gz)
  - Explorer: https://atlas.nomic.ai/map/gpt4all_data_clean

We are not distributing a LLaMa 7B checkpoint.

You can reproduce our trained model by doing the following:

## Setup

Clone the repo

`git clone --recurse-submodules git@github.com:nomic-ai/gpt4all.git`

`git submodule configure && git submodule update`

Setup the environment

```
python -m pip install -r requirements.txt

cd transformers
pip install -e . 

cd ../peft
pip install -e .
```

## Training

```bash
accelerate launch --dynamo_backend=inductor --num_processes=8 --num_machines=1 --machine_rank=0 --deepspeed_multinode_launcher standard --mixed_precision=bf16  --use_deepspeed --deepspeed_config_file=configs/deepspeed/ds_config.json train.py --config configs/train/finetune-7b.yaml
```

## Generate

```bash
python generate.py --config configs/generate/generate.yaml --prompt "Write a script to reverse a string in Python
```

# Sample Generations

### Provide instructions for the given exercise. Leg Raises
>Leg Raises
>1. Stand with your feet shoulder-width apart and your knees slightly bent.
>2. Raise your arms straight out in front of you.
>3. Slowly bend your knees and raise your heels off the ground.
>4. Hold this position for a count of 3, then slowly lower your heels back to the ground.
>5. Repeat the exercise 10 times.
>6. Do not let your knees go beyond your toes.
>7. Keep your back straight and your abdominal muscles engaged.
>8. Do not let your hips sag or your back arch.
>9. Keep your arms straight and your elbows locked.
>10. Do not let your head drop or your shoulders rise.
>11. Breathe normally throughout the exercise.
>12. Rest for 30 seconds before repeating the exercise.
>13. Gradually increase the number of repetitions as you become stronger.


### A color description has been provided. Find the CSS code associated with that color. A light red color with a medium light shade of pink

>The CSS code associated with the color provided is: #FF6347


### Come up with an interesting idea for a new movie plot. Your plot should be described with a title and a summary.

>1. The title of your movie plot should be "The Last Stand". 2. The summary of your movie plot should be: A group of mercenaries are hired to protect a small town from a dangerous criminal who is on the run. However, the town's sheriff is not willing to let them take over and decides to take matters into his own hands.


If you utilize this reposistory, models or data in a downstream project, please consider citing it with:
```
@misc{gpt4all,
  author = {Yuvanesh Anand and Zach Nussbaum and Brandon Duderstadt and Benjamin Schmidt and Andriy Mulyar},
  title = {GPT4All: Training an Assistant-style Chatbot with Large Scale Data Distillation from GPT-3.5-Turbo},
  year = {2023},
  publisher = {GitHub},
  journal = {GitHub repository},
  howpublished = {\url{https://github.com/nomic-ai/gpt4all}},
}
```
Initial readme update. 2023-03-28 00:20:59 +00:00			`<h1 align="center">GPT4All</h1>`
Update README.md 2023-03-28 21:06:53 +00:00			`<p align="center">Demo, data and code to train an assistant-style large language model with ~800k GPT-3.5-Turbo Generations based on LLaMa</p>`
Update README.md 2023-03-28 20:04:18 +00:00
			`<p align="center">`
Update README.md 2023-03-28 20:12:30 +00:00			`<a href="https://s3.amazonaws.com/static.nomic.ai/gpt4all/2023_GPT4All_Technical_Report.pdf">:green_book: Technical Report</a>`
Update README.md 2023-03-28 20:04:18 +00:00			`</p>`


Update README.md 2023-03-28 19:55:45 +00:00
			`![gpt4all-lora-demo](https://user-images.githubusercontent.com/13879686/228352356-de66ca7a-df70-474e-b929-2e3656165051.gif)`
Update README.md 2023-03-28 21:25:06 +00:00
Update README.md 2023-03-28 21:06:28 +00:00			`Run on M1 Mac (not sped up!)`
feat: generation works 2023-03-25 16:43:27 +00:00
Initial readme update. 2023-03-28 00:20:59 +00:00			`# Try it yourself`
Update README.md 2023-03-28 19:55:45 +00:00
Update README.md 2023-03-28 20:38:45 +00:00			`Clone this repository down and download the CPU quantized gpt4all model.`
Update README.md 2023-03-28 20:21:09 +00:00			`- [gpt4all-quantized](https://s3.amazonaws.com/static.nomic.ai/gpt4all/models/gpt4all-lora-quantized.bin)`
Initial readme update. 2023-03-28 00:20:59 +00:00
Update README.md 2023-03-28 20:38:17 +00:00			Place the quantized model in the `chat` directory and start chatting by running:

Update README.md 2023-03-28 21:02:54 +00:00			- `./chat/gpt4all-lora-quantized-OSX-m1` on M1 Mac/OSX
Update README.md 2023-03-28 20:42:30 +00:00			- `./chat/gpt4all-lora-quantized-linux-x86` on Windows/Linux
feat: generation works 2023-03-25 16:43:27 +00:00
Update README.md 2023-03-28 20:31:53 +00:00			`To compile for custom hardware, see our fork of the [Alpaca C++](https://github.com/zanussbaum/gpt4all.cpp) repo.`
feat: generation works 2023-03-25 16:43:27 +00:00
Update README.md 2023-03-28 21:05:03 +00:00			`Note: the full model on GPU (16GB of RAM required) performs much better in our qualitative evaluations.`
Update README.md 2023-03-28 15:56:16 +00:00
			`# Reproducibility`
Update README.md 2023-03-28 16:26:23 +00:00
Update README.md 2023-03-28 20:21:09 +00:00			`Trained LoRa Weights:`
			`- gpt4all-lora: https://huggingface.co/nomic-ai/gpt4all-lora`
			`- gpt4all-lora-epoch-2 https://huggingface.co/nomic-ai/gpt4all-lora-epoch-2`

			`Raw Data:`
			`- [Training Data Without P3](https://s3.amazonaws.com/static.nomic.ai/gpt4all/2022_03_27/gpt4all_curated_data_without_p3_2022_03_27.tar.gz)`
Update README.md 2023-03-28 21:24:46 +00:00			`- Explorer: https://atlas.nomic.ai/map/gpt4all_data_clean_without_p3`
Update README.md 2023-03-28 20:21:09 +00:00			`- [Full Dataset with P3](https://s3.amazonaws.com/static.nomic.ai/gpt4all/2022_03_27/gpt4all_curated_data_full_2022_03_27.tar.gz)`
Update README.md 2023-03-28 21:24:46 +00:00			`- Explorer: https://atlas.nomic.ai/map/gpt4all_data_clean`
Update README.md 2023-03-28 16:26:23 +00:00
Update README.md 2023-03-28 19:39:03 +00:00			`We are not distributing a LLaMa 7B checkpoint.`
Update README.md 2023-03-28 16:26:23 +00:00
Update README.md 2023-03-28 19:32:48 +00:00			`You can reproduce our trained model by doing the following:`
Update README.md 2023-03-28 15:56:16 +00:00
			`## Setup`
feat: generation works 2023-03-25 16:43:27 +00:00
			`Clone the repo`

			`git clone --recurse-submodules git@github.com:nomic-ai/gpt4all.git`

Update README.md 2023-03-28 00:46:14 +00:00			`git submodule configure && git submodule update`
Update README.md 2023-03-28 00:46:24 +00:00
feat: generation works 2023-03-25 16:43:27 +00:00			`Setup the environment`

			```
			`python -m pip install -r requirements.txt`

			`cd transformers`
			`pip install -e .`

			`cd ../peft`
			`pip install -e .`
			```

Update README.md 2023-03-28 20:11:43 +00:00			`## Training`
feat: generation works 2023-03-25 16:43:27 +00:00
Update README.md 2023-03-28 18:52:27 +00:00			```bash
Update README.md 2023-03-28 20:11:43 +00:00			`accelerate launch --dynamo_backend=inductor --num_processes=8 --num_machines=1 --machine_rank=0 --deepspeed_multinode_launcher standard --mixed_precision=bf16 --use_deepspeed --deepspeed_config_file=configs/deepspeed/ds_config.json train.py --config configs/train/finetune-7b.yaml`
Update README.md 2023-03-28 18:52:27 +00:00			```
fix: update to how to train 2023-03-25 21:57:01 +00:00
Update README.md 2023-03-28 20:11:43 +00:00			`## Generate`
fix: update to how to train 2023-03-25 21:57:01 +00:00
Update README.md 2023-03-28 18:52:27 +00:00			```bash
Update README.md 2023-03-28 20:11:43 +00:00			`python generate.py --config configs/generate/generate.yaml --prompt "Write a script to reverse a string in Python`
Update README.md 2023-03-28 18:52:27 +00:00			```
Update README.md 2023-03-28 16:00:25 +00:00
Update README.md 2023-03-28 21:24:46 +00:00			`# Sample Generations`

Update README.md 2023-03-28 21:22:11 +00:00			`### Provide instructions for the given exercise. Leg Raises`
			`>Leg Raises`
			`>1. Stand with your feet shoulder-width apart and your knees slightly bent.`
			`>2. Raise your arms straight out in front of you.`
			`>3. Slowly bend your knees and raise your heels off the ground.`
			`>4. Hold this position for a count of 3, then slowly lower your heels back to the ground.`
			`>5. Repeat the exercise 10 times.`
			`>6. Do not let your knees go beyond your toes.`
			`>7. Keep your back straight and your abdominal muscles engaged.`
			`>8. Do not let your hips sag or your back arch.`
			`>9. Keep your arms straight and your elbows locked.`
			`>10. Do not let your head drop or your shoulders rise.`
			`>11. Breathe normally throughout the exercise.`
			`>12. Rest for 30 seconds before repeating the exercise.`
			`>13. Gradually increase the number of repetitions as you become stronger.`


			`### A color description has been provided. Find the CSS code associated with that color. A light red color with a medium light shade of pink`

			`>The CSS code associated with the color provided is: #FF6347`


			`### Come up with an interesting idea for a new movie plot. Your plot should be described with a title and a summary.`

			`>1. The title of your movie plot should be "The Last Stand". 2. The summary of your movie plot should be: A group of mercenaries are hired to protect a small town from a dangerous criminal who is on the run. However, the town's sheriff is not willing to let them take over and decides to take matters into his own hands.`
Update README.md 2023-03-28 16:00:25 +00:00

			`If you utilize this reposistory, models or data in a downstream project, please consider citing it with:`
			```
			`@misc{gpt4all,`
Update README.md 2023-03-28 18:50:27 +00:00			`author = {Yuvanesh Anand and Zach Nussbaum and Brandon Duderstadt and Benjamin Schmidt and Andriy Mulyar},`
Update README.md 2023-03-28 16:00:25 +00:00			`title = {GPT4All: Training an Assistant-style Chatbot with Large Scale Data Distillation from GPT-3.5-Turbo},`
			`year = {2023},`
			`publisher = {GitHub},`
			`journal = {GitHub repository},`
			`howpublished = {\url{https://github.com/nomic-ai/gpt4all}},`
			`}`
			```