mirror of https://github.com/ravenscroftj/turbopilot.git synced 2024-10-01 01:06:01 -04:00

Turbopilot is an open source large-language-model based code completion engine that runs locally on CPU

Go to file

James Ravenscroft e8b15d1cc6 update ram requirements in readme		2023-04-10 08:05:52 +01:00
assets	add screen recording gif	2023-04-10 08:05:21 +01:00
ggml@a25b0a84ab	add ggml	2023-04-09 17:49:03 +01:00
.gitmodules	add ggml	2023-04-09 17:49:03 +01:00
convert-codegen-to-ggml.py	add conversion script	2023-04-09 17:49:42 +01:00
README.md	update ram requirements in readme	2023-04-10 08:05:52 +01:00

TurboPilot

TurboPilot is a self-hosted copilot clone which uses the library behind llama.cpp to run huge 6 Billion Parameter Salesforce Codegen models in 4GiB of RAM. It is heavily based and inspired by on the fauxpilot project.

Getting Started

git clone https://github.com/ravenscroftj/turbopilot
git submodule init
cd ggml
mkdir build
cd build
cmake ..
make codegen codegen-quantize

Start by downloading either the 2B or 6B GPT-J versions of CodeGen.

python convert-codegen-to-ggml.py ./codegen-6B-multi-gptj 0

./bin/codegen-quantize ../../codegen-6B-multi-gptj/ggml-model-f32.bin ../../codegen-6B-multi-gptj/ggml-model-quant.bin 2

./bin/codegen -t 6 -m ../../codegen-6B-multi-gptj/ggml-model-quant.bin -p "def main("