turbopilot/README.md
James Ravenscroft f8713c089b add readme
2023-04-09 17:56:27 +01:00

1.0 KiB

TurboPilot

TurboPilot is a super fast fauxpilot clone which uses the library behind llama.cpp to run huge 6 Billion Parameter Salesforce Codegen models in 2GiB of RAM.

Getting Started

git clone https://github.com/ravenscroftj/turbopilot
git submodule init
cd ggml
mkdir build
cd build
cmake ..
make codegen codegen-quantize

Getting The Models

Start by downloading either the 2B or 6B GPT-J versions of CodeGen.

Convert The Model

python convert-codegen-to-ggml.py ./codegen-6B-multi-gptj 0

Quantize the model

./bin/codegen-quantize ../../codegen-6B-multi-gptj/ggml-model-f32.bin ../../codegen-6B-multi-gptj/ggml-model-quant.bin 2

Run the model

./bin/codegen -t 6 -m ../../codegen-6B-multi-gptj/ggml-model-quant.bin -p "def main("