gpt4all: an ecosystem of open-source chatbots trained on a massive collections of clean assistant data including code, stories and dialogue
Go to file
2023-05-30 12:59:00 -04:00
.circleci comment out pip job 2023-05-18 12:02:11 -04:00
.github Paginate through all issues for close_issues workflow (#630) 2023-05-18 14:30:47 -04:00
gpt4all-api mono repo structure 2023-05-01 15:45:23 -04:00
gpt4all-backend Revert "New tokenizer implementation for MPT and GPT-J" 2023-05-30 12:59:00 -04:00
gpt4all-bindings Improved localdocs documentation (#762) 2023-05-30 11:26:34 -04:00
gpt4all-chat This time remember to bump the version right after a release. 2023-05-25 18:26:33 -04:00
gpt4all-docker mono repo structure 2023-05-01 15:45:23 -04:00
gpt4all-training fix(training instructions): model repo name (#728) 2023-05-28 19:56:24 -04:00
.codespellrc Revert "New tokenizer implementation for MPT and GPT-J" 2023-05-30 12:59:00 -04:00
.gitignore Improved documentation landing page (#665) 2023-05-21 23:14:18 -04:00
.gitmodules Move the llmodel C API to new top-level directory and version it. 2023-05-10 11:46:40 -04:00
CONTRIBUTING.md [DATALAD RUNCMD] run codespell throughout 2023-05-16 11:33:59 -04:00
gpt4all-lora-demo.gif GIF 2023-03-28 15:54:44 -04:00
LICENSE.txt Add MIT license. 2023-04-06 11:28:59 -04:00
monorepo_plan.md Update monorepo_plan.md 2023-05-05 09:32:45 -04:00
README.md Update README.md - very minor typo (#688) 2023-05-22 17:14:52 -07:00

GPT4All

Open-source assistant-style large language models that run locally on your CPU

GPT4All Website

GPT4All Documentation

Discord

🦜🔗 Official Langchain Backend

GPT4All is made possible by our compute partner Paperspace.

Run on an M1 Mac (not sped up!)

GPT4All: An ecosystem of open-source on-edge large language models.

GPT4All is an ecosystem to train and deploy powerful and customized large language models that run locally on consumer grade CPUs.

Learn more in the documentation.

The goal is simple - be the best instruction tuned assistant-style language model that any person or enterprise can freely use, distribute and build on.

A GPT4All model is a 3GB - 8GB file that you can download and plug into the GPT4All open-source ecosystem software. Nomic AI supports and maintains this software ecosystem to enforce quality and security alongside spearheading the effort to allow any person or enterprise to easily train and deploy their own on-edge large language models.

Chat Client

Run any GPT4All model natively on your home desktop with the auto-updating desktop chat client. See GPT4All Website for a full list of open-source models you can run with this powerful desktop application.

Direct Installer Links:

If you have older hardware that only supports avx and not avx2 you can use these.

Find the most up-to-date information on the GPT4All Website

Chat Client building and running

  • Follow the visual instructions on the chat client build_and_run page

Bindings

Contributing

GPT4All welcomes contributions, involvement, and discussion from the open source community! Please see CONTRIBUTING.md and follow the issues, bug reports, and PR markdown templates.

Check project discord, with project owners, or through existing issues/PRs to avoid duplicate work. Please make sure to tag all of the above with relevant project identifiers or your contribution could potentially get lost. Example tags: backend, bindings, python-bindings, documentation, etc.

Technical Reports

📗 Technical Report 3: GPT4All Snoozy and Groovy

📗 Technical Report 2: GPT4All-J

📗 Technical Report 1: GPT4All

Citation

If you utilize this repository, models or data in a downstream project, please consider citing it with:

@misc{gpt4all,
  author = {Yuvanesh Anand and Zach Nussbaum and Brandon Duderstadt and Benjamin Schmidt and Andriy Mulyar},
  title = {GPT4All: Training an Assistant-style Chatbot with Large Scale Data Distillation from GPT-3.5-Turbo},
  year = {2023},
  publisher = {GitHub},
  journal = {GitHub repository},
  howpublished = {\url{https://github.com/nomic-ai/gpt4all}},
}