turbopilot/BUILD.md

# Build TurboPilot

TurboPilot is a C++ program that uses the [GGML](https://github.com/ggerganov/ggml) project to parse and run language models.

### Dependencies

To build turbopilot you will need CMake, Libboost, a C++ toolchain and GNU Make.

#### Ubuntu

On Ubuntu you can install these things with:

```bash
sudo apt-get update
sudo apt-get install libboost-dev cmake build-essential
```

#### MacOS

If you use [brew](https://brew.sh/) you can simply add these dependencies by running:

```bash
brew install cmake boost
```

### Checkout Submodules

Make sure the ggml subproject is checked out with `git submodule init` and `git submodule update`

### Prepare and Build

Configure cmake to build the project with the following:

```bash
mkdir build
cd build
cmake ..
```

If you are running on linux you can optionally compile a static build with `cmake -D CMAKE_EXE_LINKER_FLAGS="-static" ..` which should make your binary portable across different flavours of the OS.

From here you can now build the components that make up turbopilot by running:

```bash
make
```

### Building with OpenBLAS


[BLAS](https://en.wikipedia.org/wiki/Basic_Linear_Algebra_Subprograms) libraries accelerate mathematical operations. You can use the OpenBLAS implementation with Turbopilot to make generation faster - particularly for longer prompts.

When you run cmake, you can additionally set `-D GGML_OPENBLAS=On` to enable BLAS support.

E.g. `cmake .. -D GGML_OPENBLAS=On`

### Building with CuBLAS

CuBLAS is the [BLAS](https://en.wikipedia.org/wiki/Basic_Linear_Algebra_Subprograms) library provided by nvidia that runs linear algebra code on your GPU. This can speed up the application significantly, especially when working with long prompts.

#### Install Cuda SDK for your Operating System

You will need `nvcc` and the `libcublas-dev` dependencies as a bare minimum. Follow the guide from nvidia [here](https://docs.nvidia.com/cuda/cuda-installation-guide-linux/) for more detailed installation instructions.

#### Configuring Cmake with CuBLAS

You will need to set `-DGGML_CUBLAS=ON` and also pass the path to your `nvcc` executable with `-DCMAKE_CUDA_COMPILER=/path/to/nvcc`.

Full example: `cmake -DGGML_CUBLAS=ON -DCMAKE_CUDA_COMPILER=/usr/local/cuda/bin/nvcc ..`
add build markdown 2023-04-10 05:19:46 -04:00			`# Build TurboPilot`

			`TurboPilot is a C++ program that uses the [GGML](https://github.com/ggerganov/ggml) project to parse and run language models.`

			`### Dependencies`

			`To build turbopilot you will need CMake, Libboost, a C++ toolchain and GNU Make.`

added some guidance for building on mac to build.md 2023-04-12 02:36:03 -04:00			`#### Ubuntu`

add build markdown 2023-04-10 05:19:46 -04:00			`On Ubuntu you can install these things with:`

			```bash
			`sudo apt-get update`
			`sudo apt-get install libboost-dev cmake build-essential`
			```

added some guidance for building on mac to build.md 2023-04-12 02:36:03 -04:00			`#### MacOS`

			`If you use [brew](https://brew.sh/) you can simply add these dependencies by running:`

			```bash
			`brew install cmake boost`
			```

add build markdown 2023-04-10 05:19:46 -04:00			`### Checkout Submodules`

			Make sure the ggml subproject is checked out with `git submodule init` and `git submodule update`

			`### Prepare and Build`

			`Configure cmake to build the project with the following:`

			```bash
update build instructions 2023-08-05 04:22:15 -04:00			`mkdir build`
			`cd build`
added some guidance for building on mac to build.md 2023-04-12 02:36:03 -04:00			`cmake ..`
add build markdown 2023-04-10 05:19:46 -04:00			```

added some guidance for building on mac to build.md 2023-04-12 02:36:03 -04:00			If you are running on linux you can optionally compile a static build with `cmake -D CMAKE_EXE_LINKER_FLAGS="-static" ..` which should make your binary portable across different flavours of the OS.

update build instructions 2023-08-05 04:22:15 -04:00			`From here you can now build the components that make up turbopilot by running:`
add build markdown 2023-04-10 05:19:46 -04:00
			```bash
update build instructions 2023-08-05 04:22:15 -04:00			`make`
add build markdown 2023-04-10 05:19:46 -04:00			```

add build instructions for blas and cublas 2023-04-23 03:38:17 -04:00			`### Building with OpenBLAS`


			`[BLAS](https://en.wikipedia.org/wiki/Basic_Linear_Algebra_Subprograms) libraries accelerate mathematical operations. You can use the OpenBLAS implementation with Turbopilot to make generation faster - particularly for longer prompts.`

			When you run cmake, you can additionally set `-D GGML_OPENBLAS=On` to enable BLAS support.

			E.g. `cmake .. -D GGML_OPENBLAS=On`

			`### Building with CuBLAS`

			`CuBLAS is the [BLAS](https://en.wikipedia.org/wiki/Basic_Linear_Algebra_Subprograms) library provided by nvidia that runs linear algebra code on your GPU. This can speed up the application significantly, especially when working with long prompts.`

			`#### Install Cuda SDK for your Operating System`

			You will need `nvcc` and the `libcublas-dev` dependencies as a bare minimum. Follow the guide from nvidia [here](https://docs.nvidia.com/cuda/cuda-installation-guide-linux/) for more detailed installation instructions.

			`#### Configuring Cmake with CuBLAS`

			You will need to set `-DGGML_CUBLAS=ON` and also pass the path to your `nvcc` executable with `-DCMAKE_CUDA_COMPILER=/path/to/nvcc`.

			Full example: `cmake -DGGML_CUBLAS=ON -DCMAKE_CUDA_COMPILER=/usr/local/cuda/bin/nvcc ..`