AI/gpt4all

mirror of https://github.com/nomic-ai/gpt4all.git synced 2024-09-19 15:25:53 +00:00

Jared Van Bortel 6518b33697

llamamodel: use greedy sampling when temp=0 (#2854 )

Signed-off-by: Jared Van Bortel <jared@nomic.ai>

2024-08-13 17:04:50 -04:00

8.2 KiB

Raw Permalink Blame History

Changelog

All notable changes to this project will be documented in this file.

The format is based on Keep a Changelog.

Unreleased

Added

Use greedy sampling when temperature is set to zero (#2854)

3.2.1 - 2024-08-13

Fixed

Do not initialize Vulkan driver when only using CPU (#2843)
Fix a potential crash on exit when using only CPU on Linux with NVIDIA (does not affect X11) (#2843)
Fix default CUDA architecture list after #2802 (#2855)

3.2.0 - 2024-08-12

Added

Add Qwen2-1.5B-Instruct to models3.json (by @ThiloteE in #2759)
Enable translation feature for seven languages: English, Spanish, Italian, Portuguese, Chinese Simplified, Chinese Traditional, Romanian (#2830)

Changed

Add missing entries to Italian transltation (by @Harvester62 in #2783)
Use llama_kv_cache ops to shift context faster (#2781)
Don't stop generating at end of context (#2781)

Fixed

Case-insensitive LocalDocs source icon detection (by @cosmic-snow in #2761)
Fix comparison of pre- and post-release versions for update check and models3.json (#2762, #2772)
Fix several backend issues (#2778)
- Restore leading space removal logic that was incorrectly removed in #2694
- CUDA: Cherry-pick llama.cpp DMMV cols requirement fix that caused a crash with long conversations since #2694
Make reverse prompt detection work more reliably and prevent it from breaking output (#2781)
Disallow context shift for chat name and follow-up generation to prevent bugs (#2781)
Explicitly target macOS 12.6 in CI to fix Metal compatibility on older macOS (#2846)

3.1.1 - 2024-07-27

Added

Add Llama 3.1 8B Instruct to models3.json (by @3Simplex in #2731 and #2732)
Portuguese (BR) translation (by thiagojramos in #2733)
Support adding arbitrary OpenAI-compatible models by URL (by @supersonictw in #2683)
Support Llama 3.1 RoPE scaling (#2758)

Changed

Add missing entries to Chinese (Simplified) translation (by wuodoo in #2716 and #2749)
Update translation files and add missing paths to CMakeLists.txt (#2735)

3.1.0 - 2024-07-24

Added

Generate suggested follow-up questions (#2634, #2723)
- Also add options for the chat name and follow-up question prompt templates
Scaffolding for translations (#2612)
Spanish (MX) translation (by @jstayco in #2654)
Chinese (Simplified) translation by mikage (#2657)
Dynamic changes of language and locale at runtime (#2659, #2677)
Romanian translation by @SINAPSA_IC (#2662)
Chinese (Traditional) translation (by @supersonictw in #2661)
Italian translation (by @Harvester62 in #2700)

Changed

Customize combo boxes and context menus to fit the new style (#2535)
Improve view bar scaling and Model Settings layout (#2520
Make the logo spin while the model is generating (#2557)
Server: Reply to wrong GET/POST method with HTTP 405 instead of 404 (by @cosmic-snow in #2615)
Update theme for menus (by @3Simplex in #2578)
Move the "stop" button to the message box (#2561)
Build with CUDA 11.8 for better compatibility (#2639)
Make links in latest news section clickable (#2643)
Support translation of settings choices (#2667, #2690)
Improve LocalDocs view's error message (by @cosmic-snow in #2679)
Ignore case of LocalDocs file extensions (#2642, #2684)
Update llama.cpp to commit 87e397d00 from July 19th (#2694, #2702)
- Add support for GPT-NeoX, Gemma 2, OpenELM, ChatGLM, and Jais architectures (all with Vulkan support)
- Add support for DeepSeek-V2 architecture (no Vulkan support)
- Enable Vulkan support for StarCoder2, XVERSE, Command R, and OLMo
Show scrollbar in chat collections list as needed (by @cosmic-snow in #2691)

Removed

Remove support for GPT-J models (#2676, #2693)

Fixed

Fix placement of thumbs-down and datalake opt-in dialogs (#2540)
Select the correct folder with the Linux fallback folder dialog (#2541)
Fix clone button sometimes producing blank model info (#2545)
Fix jerky chat view scrolling (#2555)
Fix "reload" showing for chats with missing models (#2520
Fix property binding loop warning (#2601)
Fix UI hang with certain chat view content (#2543)
Fix crash when Kompute falls back to CPU (#2640)
Fix several Vulkan resource management issues (#2694)
Fix crash/hang when some models stop generating, by showing special tokens (#2701)

8.2 KiB Raw Permalink Blame History

Changelog

Unreleased

Added

3.2.1 - 2024-08-13

Fixed

3.2.0 - 2024-08-12

Added

Changed

Fixed

3.1.1 - 2024-07-27

Added

Changed

3.1.0 - 2024-07-24

Added

Changed

Removed

Fixed

8.2 KiB

Raw Permalink Blame History