a602f7fde7
* remove outdated comments Signed-off-by: limez <limez@protonmail.com> * simpler build from source Signed-off-by: limez <limez@protonmail.com> * update unix build script to create .so runtimes correctly Signed-off-by: limez <limez@protonmail.com> * configure ci build type, use RelWithDebInfo for dev build script Signed-off-by: limez <limez@protonmail.com> * add clean script Signed-off-by: limez <limez@protonmail.com> * fix streamed token decoding / emoji Signed-off-by: limez <limez@protonmail.com> * remove deprecated nCtx Signed-off-by: limez <limez@protonmail.com> * update typings Signed-off-by: jacob <jacoobes@sern.dev> update typings Signed-off-by: jacob <jacoobes@sern.dev> * readme,mspell Signed-off-by: jacob <jacoobes@sern.dev> * cuda/backend logic changes + name napi methods like their js counterparts Signed-off-by: limez <limez@protonmail.com> * convert llmodel example into a test, separate test suite that can run in ci Signed-off-by: limez <limez@protonmail.com> * update examples / naming Signed-off-by: limez <limez@protonmail.com> * update deps, remove the need for binding.ci.gyp, make node-gyp-build fallback easier testable Signed-off-by: limez <limez@protonmail.com> * make sure the assert-backend-sources.js script is published, but not the others Signed-off-by: limez <limez@protonmail.com> * build correctly on windows (regression on node-gyp-build) Signed-off-by: Jacob Nguyen <76754747+jacoobes@users.noreply.github.com> * codespell Signed-off-by: limez <limez@protonmail.com> * make sure dlhandle.cpp gets linked correctly Signed-off-by: limez <limez@protonmail.com> * add include for check_cxx_compiler_flag call during aarch64 builds Signed-off-by: limez <limez@protonmail.com> * x86 > arm64 cross compilation of runtimes and bindings Signed-off-by: limez <limez@protonmail.com> * default to cpu instead of kompute on arm64 Signed-off-by: limez <limez@protonmail.com> * formatting, more minimal example Signed-off-by: limez <limez@protonmail.com> --------- Signed-off-by: limez <limez@protonmail.com> Signed-off-by: jacob <jacoobes@sern.dev> Signed-off-by: Jacob Nguyen <76754747+jacoobes@users.noreply.github.com> Co-authored-by: Jacob Nguyen <76754747+jacoobes@users.noreply.github.com> Co-authored-by: jacob <jacoobes@sern.dev> |
||
---|---|---|
.circleci | ||
.github | ||
gpt4all-backend | ||
gpt4all-bindings | ||
gpt4all-chat | ||
gpt4all-training | ||
.codespellrc | ||
.gitignore | ||
.gitmodules | ||
CONTRIBUTING.md | ||
gpt4all-lora-demo.gif | ||
LICENSE.txt | ||
README.md |
GPT4All
Privacy-oriented software for chatting with large language models that run on your own computer.
Official Website • Documentation • Discord
Official Download Links: Windows — macOS — Ubuntu
NEW: Subscribe to our mailing list for updates and news!
GPT4All is made possible by our compute partner Paperspace.
Run on an M2 MacBook Pro (not sped up!)
About GPT4All
GPT4All is an ecosystem to run powerful and customized large language models that work locally on consumer grade CPUs and NVIDIA and AMD GPUs. Note that your CPU needs to support AVX instructions.
Learn more in the documentation.
A GPT4All model is a 3GB - 8GB file that you can download and plug into the GPT4All software. Nomic AI supports and maintains this software ecosystem to enforce quality and security alongside spearheading the effort to allow any person or enterprise to easily deploy their own on-edge large language models.
Installation
The recommended way to install GPT4All is to use one of the online installers linked above in this README, which are also available at the GPT4All website. These require an internet connection at install time, are slightly easier to use on macOS due to code signing, and provide a version of GPT4All that can check for updates.
An alternative way to install GPT4All is to use one of the offline installers available on the Releases page. These do not require an internet connection at install time, and can be used to install an older version of GPT4All if so desired. But using these requires acknowledging a security warning on macOS, and they provide a version of GPT4All that is unable to notify you of updates, so you should enable notifications for Releases on this repository (Watch > Custom > Releases) or sign up for announcements in our Discord server.
What's New
- October 19th, 2023: GGUF Support Launches with Support for:
- Mistral 7b base model, an updated model gallery on gpt4all.io, several new local code models including Rift Coder v1.5
- Nomic Vulkan support for Q4_0 and Q4_1 quantizations in GGUF.
- Offline build support for running old versions of the GPT4All Local LLM Chat Client.
- September 18th, 2023: Nomic Vulkan launches supporting local LLM inference on NVIDIA and AMD GPUs.
- July 2023: Stable support for LocalDocs, a feature that allows you to privately and locally chat with your data.
- June 28th, 2023: Docker-based API server launches allowing inference of local LLMs from an OpenAI-compatible HTTP endpoint.
Building From Source
- Follow the instructions here to build the GPT4All Chat UI from source.
Bindings
Integrations
Contributing
GPT4All welcomes contributions, involvement, and discussion from the open source community! Please see CONTRIBUTING.md and follow the issues, bug reports, and PR markdown templates.
Check project discord, with project owners, or through existing issues/PRs to avoid duplicate work.
Please make sure to tag all of the above with relevant project identifiers or your contribution could potentially get lost.
Example tags: backend
, bindings
, python-bindings
, documentation
, etc.
GPT4All 2024 Roadmap
To contribute to the development of any of the below roadmap items, make or find the corresponding issue and cross-reference the in-progress task.
Each item should have an issue link below.
-
Chat UI Language Localization (localize UI into the native languages of users)
- Chinese
- German
- French
- Portuguese
- Your native language here.
-
UI Redesign: an internal effort at Nomic to improve the UI/UX of gpt4all for all users.
- Design new user interface and gather community feedback
- Implement the new user interface and experience.
-
Installer and Update Improvements
- Seamless native installation and update process on OSX
- Seamless native installation and update process on Windows
- Seamless native installation and update process on Linux
-
Model discoverability improvements:
- Support huggingface model discoverability
- Support Nomic hosted model discoverability
-
LocalDocs (towards a local perplexity)
- Multilingual LocalDocs Support
- Create a multilingual experience
- Incorporate a multilingual embedding model
- Specify a preferred multilingual LLM for localdocs
- Improved RAG techniques
- Query augmentation and re-writing
- Improved chunking and text extraction from arbitrary modalities
- Custom PDF extractor past the QT default (charts, tables, text)
- Faster indexing and local exact search with v1.5 hamming embeddings and reranking (skip ANN index construction!)
- Support queries like 'summarize X document'
- Multimodal LocalDocs support with Nomic Embed
- Nomic Dataset Integration with real-time LocalDocs
- Include an option to allow the export of private LocalDocs collections to Nomic Atlas for debugging data/chat quality
- Allow optional sharing of LocalDocs collections between users.
- Allow the import of a LocalDocs collection from an Atlas Datasets
- Chat with live version of Wikipedia, Chat with Pubmed, chat with the latest snapshot of world news.
- Multilingual LocalDocs Support
-
First class Multilingual LLM Support
- Recommend and set a default LLM for German
- Recommend and set a default LLM for English
- Recommend and set a default LLM for Chinese
- Recommend and set a default LLM for Spanish
-
Server Mode improvements
- Improved UI and new requested features:
- Fix outstanding bugs and feature requests around networking configurations.
- Support Nomic Embed inferencing
- First class documentation
- Improving developer use and quality of server mode (e.g. support larger batches)
- Improved UI and new requested features:
Technical Reports
📗 Technical Report 3: GPT4All Snoozy and Groovy
📗 Technical Report 2: GPT4All-J
Citation
If you utilize this repository, models or data in a downstream project, please consider citing it with:
@misc{gpt4all,
author = {Yuvanesh Anand and Zach Nussbaum and Brandon Duderstadt and Benjamin Schmidt and Andriy Mulyar},
title = {GPT4All: Training an Assistant-style Chatbot with Large Scale Data Distillation from GPT-3.5-Turbo},
year = {2023},
publisher = {GitHub},
journal = {GitHub repository},
howpublished = {\url{https://github.com/nomic-ai/gpt4all}},
}