gpt4all/gpt4all-api/gpt4all_api/app/tests/test_endpoints.py

"""
Use the OpenAI python API to test gpt4all models.
"""
from typing import List, get_args
import os
from dotenv import load_dotenv

import openai

openai.api_base = "http://localhost:4891/v1"
openai.api_key = "not needed for a local LLM"

# Load the .env file
env_path = 'gpt4all-api/gpt4all_api/.env'
load_dotenv(dotenv_path=env_path)

# Fetch MODEL_ID from .env file
model_id = os.getenv('MODEL_BIN', 'default_model_id')
embedding = os.getenv('EMBEDDING', 'default_embedding_model_id')
print (model_id)
print (embedding)

def test_completion():
    model = model_id
    prompt = "Who is Michael Jordan?"
    response = openai.Completion.create(
        model=model, prompt=prompt, max_tokens=50, temperature=0.28, top_p=0.95, n=1, echo=True, stream=False
    )
    assert len(response['choices'][0]['text']) > len(prompt)

def test_streaming_completion():
    model = model_id
    prompt = "Who is Michael Jordan?"
    tokens = []
    for resp in openai.Completion.create(
            model=model,
            prompt=prompt,
            max_tokens=50,
            temperature=0.28,
            top_p=0.95,
            n=1,
            echo=True,
            stream=True):
        tokens.append(resp.choices[0].text)

    assert (len(tokens) > 0)
    assert (len("".join(tokens)) > len(prompt))

# Modified test batch, problems with keyerror in response
def test_batched_completion():
    model = model_id  # replace with your specific model ID
    prompt = "Who is Michael Jordan?"
    responses = []
    
    # Loop to create completions one at a time
    for _ in range(3):
        response = openai.Completion.create(
            model=model, prompt=prompt, max_tokens=50, temperature=0.28, top_p=0.95, n=1, echo=True, stream=False
        )
        responses.append(response)

    # Assertions to check the responses
    for response in responses:
        assert len(response['choices'][0]['text']) > len(prompt)
    
    assert len(responses) == 3

def test_embedding():
    model = embedding
    prompt = "Who is Michael Jordan?"
    response = openai.Embedding.create(model=model, input=prompt)
    output = response["data"][0]["embedding"]
    args = get_args(List[float])

    assert response["model"] == model
    assert isinstance(output, list)
    assert all(isinstance(x, args) for x in output)
GPT4All API Scaffolding. Matches OpenAI OpenAPI spec for chats and completions (#839) * GPT4All API Scaffolding. Matches OpenAI OpenAI spec for engines, chats and completions * Edits for docker building * FastAPI app builds and pydantic models are accurate * Added groovy download into dockerfile * improved dockerfile * Chat completions endpoint edits * API uni test sketch * Working example of groovy inference with open ai api * Added lines to test * Set default to mpt 2023-06-28 14:28:52 -04:00			`"""`
			`Use the OpenAI python API to test gpt4all models.`
			`"""`
Add embeddings endpoint for gpt4all-api (#1314) * Add embeddings endpoint * Add test for embedding endpoint 2023-08-10 10:43:07 -04:00			`from typing import List, get_args`
Update .gitignore and Dockerfile, add .env file and modify test batch 2023-11-11 23:27:32 -05:00			`import os`
			`from dotenv import load_dotenv`
Add embeddings endpoint for gpt4all-api (#1314) * Add embeddings endpoint * Add test for embedding endpoint 2023-08-10 10:43:07 -04:00
GPT4All API Scaffolding. Matches OpenAI OpenAPI spec for chats and completions (#839) * GPT4All API Scaffolding. Matches OpenAI OpenAI spec for engines, chats and completions * Edits for docker building * FastAPI app builds and pydantic models are accurate * Added groovy download into dockerfile * improved dockerfile * Chat completions endpoint edits * API uni test sketch * Working example of groovy inference with open ai api * Added lines to test * Set default to mpt 2023-06-28 14:28:52 -04:00			`import openai`
GPU Inference Server (#1112) * feat: local inference server * fix: source to use bash + vars * chore: isort and black * fix: make file + inference mode * chore: logging * refactor: remove old links * fix: add new env vars * feat: hf inference server * refactor: remove old links * test: batch and single response * chore: black + isort * separate gpu and cpu dockerfiles * moved gpu to separate dockerfile * Fixed test endpoints * Edits to API. server won't start due to failed instantiation error * Method signature * fix: gpu_infer * tests: fix tests --------- Co-authored-by: Andriy Mulyar <andriy.mulyar@gmail.com> 2023-07-21 15:13:29 -04:00
GPT4All API Scaffolding. Matches OpenAI OpenAPI spec for chats and completions (#839) * GPT4All API Scaffolding. Matches OpenAI OpenAI spec for engines, chats and completions * Edits for docker building * FastAPI app builds and pydantic models are accurate * Added groovy download into dockerfile * improved dockerfile * Chat completions endpoint edits * API uni test sketch * Working example of groovy inference with open ai api * Added lines to test * Set default to mpt 2023-06-28 14:28:52 -04:00			`openai.api_base = "http://localhost:4891/v1"`
			`openai.api_key = "not needed for a local LLM"`

Update .gitignore and Dockerfile, add .env file and modify test batch 2023-11-11 23:27:32 -05:00			`# Load the .env file`
			`env_path = 'gpt4all-api/gpt4all_api/.env'`
			`load_dotenv(dotenv_path=env_path)`

			`# Fetch MODEL_ID from .env file`
			`model_id = os.getenv('MODEL_BIN', 'default_model_id')`
			`embedding = os.getenv('EMBEDDING', 'default_embedding_model_id')`
			`print (model_id)`
			`print (embedding)`
GPT4All API Scaffolding. Matches OpenAI OpenAPI spec for chats and completions (#839) * GPT4All API Scaffolding. Matches OpenAI OpenAI spec for engines, chats and completions * Edits for docker building * FastAPI app builds and pydantic models are accurate * Added groovy download into dockerfile * improved dockerfile * Chat completions endpoint edits * API uni test sketch * Working example of groovy inference with open ai api * Added lines to test * Set default to mpt 2023-06-28 14:28:52 -04:00
			`def test_completion():`
Update .gitignore and Dockerfile, add .env file and modify test batch 2023-11-11 23:27:32 -05:00			`model = model_id`
GPT4All API Scaffolding. Matches OpenAI OpenAPI spec for chats and completions (#839) * GPT4All API Scaffolding. Matches OpenAI OpenAI spec for engines, chats and completions * Edits for docker building * FastAPI app builds and pydantic models are accurate * Added groovy download into dockerfile * improved dockerfile * Chat completions endpoint edits * API uni test sketch * Working example of groovy inference with open ai api * Added lines to test * Set default to mpt 2023-06-28 14:28:52 -04:00			`prompt = "Who is Michael Jordan?"`
			`response = openai.Completion.create(`
GPU Inference Server (#1112) * feat: local inference server * fix: source to use bash + vars * chore: isort and black * fix: make file + inference mode * chore: logging * refactor: remove old links * fix: add new env vars * feat: hf inference server * refactor: remove old links * test: batch and single response * chore: black + isort * separate gpu and cpu dockerfiles * moved gpu to separate dockerfile * Fixed test endpoints * Edits to API. server won't start due to failed instantiation error * Method signature * fix: gpu_infer * tests: fix tests --------- Co-authored-by: Andriy Mulyar <andriy.mulyar@gmail.com> 2023-07-21 15:13:29 -04:00			`model=model, prompt=prompt, max_tokens=50, temperature=0.28, top_p=0.95, n=1, echo=True, stream=False`
GPT4All API Scaffolding. Matches OpenAI OpenAPI spec for chats and completions (#839) * GPT4All API Scaffolding. Matches OpenAI OpenAI spec for engines, chats and completions * Edits for docker building * FastAPI app builds and pydantic models are accurate * Added groovy download into dockerfile * improved dockerfile * Chat completions endpoint edits * API uni test sketch * Working example of groovy inference with open ai api * Added lines to test * Set default to mpt 2023-06-28 14:28:52 -04:00			`)`
			`assert len(response['choices'][0]['text']) > len(prompt)`
Update to gpt4all version 1.0.1. Implement the Streaming version of the completions endpoint. Implemented an openai python client test for the new streaming functionality. (#1129) Co-authored-by: Brandon <bbeiler@ridgelineintl.com> 2023-07-05 23:17:30 -04:00
			`def test_streaming_completion():`
Update .gitignore and Dockerfile, add .env file and modify test batch 2023-11-11 23:27:32 -05:00			`model = model_id`
Update to gpt4all version 1.0.1. Implement the Streaming version of the completions endpoint. Implemented an openai python client test for the new streaming functionality. (#1129) Co-authored-by: Brandon <bbeiler@ridgelineintl.com> 2023-07-05 23:17:30 -04:00			`prompt = "Who is Michael Jordan?"`
			`tokens = []`
			`for resp in openai.Completion.create(`
			`model=model,`
			`prompt=prompt,`
			`max_tokens=50,`
			`temperature=0.28,`
			`top_p=0.95,`
			`n=1,`
			`echo=True,`
			`stream=True):`
			`tokens.append(resp.choices[0].text)`

			`assert (len(tokens) > 0)`
			`assert (len("".join(tokens)) > len(prompt))`

Update .gitignore and Dockerfile, add .env file and modify test batch 2023-11-11 23:27:32 -05:00			`# Modified test batch, problems with keyerror in response`
GPU Inference Server (#1112) * feat: local inference server * fix: source to use bash + vars * chore: isort and black * fix: make file + inference mode * chore: logging * refactor: remove old links * fix: add new env vars * feat: hf inference server * refactor: remove old links * test: batch and single response * chore: black + isort * separate gpu and cpu dockerfiles * moved gpu to separate dockerfile * Fixed test endpoints * Edits to API. server won't start due to failed instantiation error * Method signature * fix: gpu_infer * tests: fix tests --------- Co-authored-by: Andriy Mulyar <andriy.mulyar@gmail.com> 2023-07-21 15:13:29 -04:00			`def test_batched_completion():`
Update .gitignore and Dockerfile, add .env file and modify test batch 2023-11-11 23:27:32 -05:00			`model = model_id # replace with your specific model ID`
GPU Inference Server (#1112) * feat: local inference server * fix: source to use bash + vars * chore: isort and black * fix: make file + inference mode * chore: logging * refactor: remove old links * fix: add new env vars * feat: hf inference server * refactor: remove old links * test: batch and single response * chore: black + isort * separate gpu and cpu dockerfiles * moved gpu to separate dockerfile * Fixed test endpoints * Edits to API. server won't start due to failed instantiation error * Method signature * fix: gpu_infer * tests: fix tests --------- Co-authored-by: Andriy Mulyar <andriy.mulyar@gmail.com> 2023-07-21 15:13:29 -04:00			`prompt = "Who is Michael Jordan?"`
Update .gitignore and Dockerfile, add .env file and modify test batch 2023-11-11 23:27:32 -05:00			`responses = []`

			`# Loop to create completions one at a time`
			`for _ in range(3):`
			`response = openai.Completion.create(`
			`model=model, prompt=prompt, max_tokens=50, temperature=0.28, top_p=0.95, n=1, echo=True, stream=False`
			`)`
			`responses.append(response)`
Add embeddings endpoint for gpt4all-api (#1314) * Add embeddings endpoint * Add test for embedding endpoint 2023-08-10 10:43:07 -04:00
Update .gitignore and Dockerfile, add .env file and modify test batch 2023-11-11 23:27:32 -05:00			`# Assertions to check the responses`
			`for response in responses:`
			`assert len(response['choices'][0]['text']) > len(prompt)`

			`assert len(responses) == 3`
Add embeddings endpoint for gpt4all-api (#1314) * Add embeddings endpoint * Add test for embedding endpoint 2023-08-10 10:43:07 -04:00
			`def test_embedding():`
Update .gitignore and Dockerfile, add .env file and modify test batch 2023-11-11 23:27:32 -05:00			`model = embedding`
Add embeddings endpoint for gpt4all-api (#1314) * Add embeddings endpoint * Add test for embedding endpoint 2023-08-10 10:43:07 -04:00			`prompt = "Who is Michael Jordan?"`
			`response = openai.Embedding.create(model=model, input=prompt)`
			`output = response["data"][0]["embedding"]`
			`args = get_args(List[float])`

			`assert response["model"] == model`
			`assert isinstance(output, list)`
Update .gitignore and Dockerfile, add .env file and modify test batch 2023-11-11 23:27:32 -05:00			`assert all(isinstance(x, args) for x in output)`