llmodel: add FIXMEs to recalculateContext

Signed-off-by: Jared Van Bortel <jared@nomic.ai>
2024-10-01 01:06:10 -04:00 · 2024-07-31 17:24:16 -04:00 · 2024-07-31 17:24:16 -04:00 · 3832d140c9
commit 3832d140c9
parent 75bd250845
1 changed files with 3 additions and 0 deletions
--- a/gpt4all-backend/llmodel_shared.cpp
+++ b/gpt4all-backend/llmodel_shared.cpp
@ -15,6 +15,9 @@
 #include <vector>

 // TODO(cebtenzzre): replace this with llama_kv_cache_seq_shift for llamamodel (GPT-J needs this as-is)
+// FIXME(jared): if recalculate returns false, we leave n_past<tokens.size() and do not tell the caller to stop
+// FIXME(jared): if we get here during chat name or follow-up generation, bad things will happen when we try to restore
+// the old prompt context afterwards
 void LLModel::recalculateContext(PromptContext &promptCtx, std::function<bool(bool)> recalculate)
 {
    int n_keep = shouldAddBOS();