mirror of
https://github.com/ravenscroftj/turbopilot.git
synced 2024-10-01 01:06:01 -04:00
Turbopilot is an open source large-language-model based code completion engine that runs locally on CPU
ggml@a25b0a84ab | ||
.gitmodules | ||
convert-codegen-to-ggml.py | ||
README.md |
TurboPilot
TurboPilot is a super fast fauxpilot clone which uses the library behind llama.cpp to run huge 6 Billion Parameter Salesforce Codegen models in 2GiB of RAM.
Getting Started
git clone https://github.com/ravenscroftj/turbopilot
git submodule init
cd ggml
mkdir build
cd build
cmake ..
make codegen codegen-quantize
Getting The Models
Start by downloading either the 2B or 6B GPT-J versions of CodeGen.
Convert The Model
python convert-codegen-to-ggml.py ./codegen-6B-multi-gptj 0
Quantize the model
./bin/codegen-quantize ../../codegen-6B-multi-gptj/ggml-model-f32.bin ../../codegen-6B-multi-gptj/ggml-model-quant.bin 2
Run the model
./bin/codegen -t 6 -m ../../codegen-6B-multi-gptj/ggml-model-quant.bin -p "def main("