test(sdk, api): fix finish reason, chat, audio and completion tests #1038

justinthelaw · 2024-09-17T01:07:06Z

Description

Creates and enables vLLM E2E tests that touch each SDK gRPC endpoint at least once with a "happy path" test. Standardizes the new Completions test from vLLM's E2E tests to the LLaMA-CPP-Python E2E tests. Fixes the FinishReason enum typing issues across Completions and ChatCompletions.

BREAKING CHANGES

fixes FinishReason to be an enum in both Completions and ChatCompletions protobufs
- modifies API gRPC handler, typing utils, and helper utils to use a Enum class to define and transform the stub responses
fixes object and created field for Completions, using Literal text_completion as defined in the OpenAI API specification

CHANGES

adds and enables simple ("happy path") E2E vLLM testing for local environments only (GPU runner is broken, see commit history)
adds Completions E2E test to llama-cpp-python and vLLM to catch more potential issues, like FinishReason being the wrong type
adds Audio and Completions unit testing
condenses tests into a single test file with an ENV parser (default, warning and helper text included) for model name
adds more comprehensive Make target for test and typing artifact clean-up
improves testing documentation and Makefile targets

Related Issue

Fixes #1037

Relates to #854

Checklist before merging

Tests, documentation, ADR added or updated as needed
Followed the Contributor Guide Steps

netlify · 2024-09-17T01:07:22Z

✅ Deploy Preview for leapfrogai-docs canceled.

Name	Link
🔨 Latest commit	`cac0fdb`
🔍 Latest deploy log	https://app.netlify.com/sites/leapfrogai-docs/deploys/66eca5b299eeee000876a768

justinthelaw · 2024-09-17T01:08:55Z

There seems to be an issue with our GPU runner configurations that is blocking the vLLM E2E tests from running: https://github.com/defenseunicorns/leapfrogai/actions/runs/10894219888/job/30230952202

The upstream UDS Common action is unable to install UDS CLI into the bin directory. It seems like it needs sudo or higher permissions, even though that is not usually required in our CPU (large or regular) runners. As a side note, the vLLM E2E tests run locally.

…nt-e2e-testing-for-vllm

…//github.com/defenseunicorns/leapfrogai into 1037-testvllm-implement-e2e-testing-for-vllm

This reverts commit ace4db1.

fix FinishReason, add vLLM E2E

79272d1

justinthelaw added tech-debt Not a feature, but still necessary blocked 🛑 Something needs to happen before this issues is worked labels Sep 17, 2024

justinthelaw added this to the Current - RAG UX Enhancements | Model Directory | API Odds and Ends milestone Sep 17, 2024

justinthelaw self-assigned this Sep 17, 2024

justinthelaw requested a review from a team as a code owner September 17, 2024 01:07

justinthelaw linked an issue Sep 17, 2024 that may be closed by this pull request

chore(vllm): implement e2e testing for vllm #1037

Open

justinthelaw added 4 commits September 16, 2024 21:37

llama completion test, add CompleteStreamChoice

927ad25

condense e2e to 1 file, add max_new_tokens

e9e434f

formatting fix

d8c6767

max_tokens for OpenAI client

29a9785

justinthelaw mentioned this pull request Sep 17, 2024

feat: upgrade vllm backend and refactor deployment #854

Draft

justinthelaw and others added 10 commits September 16, 2024 22:43

fix singular model_name arg

a166c93

isolate model_name to single test

1c63741

fix e2e-llama-cpp-python.yaml

2e82a9f

Update e2e-vllm.yaml

807128e

model_name fixture

e48331f

Merge remote-tracking branch 'origin/main' into 1037-testvllm-impleme…

e88b29f

…nt-e2e-testing-for-vllm

workaround GPU runner issue

8552ce0

workaround GPU runner issue, pt.2

af4e4ca

workaround GPU runner issue, pt.3

5b1532a

workaround GPU runner issue, pt.4

a8551e5

justinthelaw marked this pull request as draft September 17, 2024 18:00

justinthelaw and others added 5 commits September 17, 2024 14:01

temp turn on e2e vllm, add nvidia-smi

5f1b3c1

add nvidia setp

1e7e98c

fix cluster cmd, play with prompt

c46731a

k3d permissions

161fb3a

Update e2e-vllm.yaml

84a0388

justinthelaw added 3 commits September 18, 2024 16:57

revert vllm e2e GPU runner changes

664709b

revert formatting changes

f896e59

e2e tests made easier

ef75a70

justinthelaw changed the title ~~test(vllm): fix FinishReason, standardize and enable vLLM E2E tests~~ fix(test): fix finish reason, chat and completion e2e tests Sep 18, 2024

justinthelaw changed the title ~~fix(test): fix finish reason, chat and completion e2e tests~~ fix(sdk, test): fix finish reason, chat and completion e2e tests Sep 18, 2024

Merge branch 'main' into 1037-testvllm-implement-e2e-testing-for-vllm

2fcac88

justinthelaw marked this pull request as ready for review September 18, 2024 21:36

justinthelaw removed a link to an issue Sep 18, 2024

chore(vllm): implement e2e testing for vllm #1037

Open

justinthelaw added 2 commits September 18, 2024 17:40

e2e test Make target typo

d1d6540

Merge branch '1037-testvllm-implement-e2e-testing-for-vllm' of https:…

2cfd164

…//github.com/defenseunicorns/leapfrogai into 1037-testvllm-implement-e2e-testing-for-vllm

justinthelaw requested review from a team and jalling97 September 18, 2024 22:13

justinthelaw and others added 5 commits September 18, 2024 18:14

revert format e2e-llama-cpp-python.yaml

0568232

fixed Makefile typo

cc7ac6c

attempt merge with main

f335be7

better clean-up

e0c0ac7

add FinishReason enum back in

c90d820

justinthelaw changed the title ~~fix(sdk, test): fix finish reason, chat and completion e2e tests~~ fix(sdk, test): fix finish reason, chat, audsio and completion tests Sep 19, 2024

justinthelaw changed the title ~~fix(sdk, test): fix finish reason, chat, audsio and completion tests~~ fix(sdk, test): fix finish reason, chat, audio and completion tests Sep 19, 2024

justinthelaw changed the title ~~fix(sdk, test): fix finish reason, chat, audio and completion tests~~ test(sdk, api): fix finish reason, chat, audio and completion tests Sep 19, 2024

justinthelaw marked this pull request as draft September 19, 2024 15:03

passing unit tests

a1a03c1

justinthelaw force-pushed the 1037-testvllm-implement-e2e-testing-for-vllm branch from 70f58ad to a1a03c1 Compare September 19, 2024 20:07

justinthelaw and others added 4 commits September 19, 2024 16:07

Merge branch 'main' into 1037-testvllm-implement-e2e-testing-for-vllm

3387974

refactor watchdog

b8799b4

refactor watchdog, pt.1

968da1e

refactor watchdog, pt.2

ace4db1

justinthelaw marked this pull request as ready for review September 19, 2024 22:11

CollectiveUnicorn added 2 commits September 19, 2024 15:27

Revert "refactor watchdog, pt.2"

18f4d4c

This reverts commit ace4db1.

Returns logging

cac0fdb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

test(sdk, api): fix finish reason, chat, audio and completion tests #1038

test(sdk, api): fix finish reason, chat, audio and completion tests #1038

justinthelaw commented Sep 17, 2024 •

edited

Loading

netlify bot commented Sep 17, 2024 •

edited

Loading

justinthelaw commented Sep 17, 2024

test(sdk, api): fix finish reason, chat, audio and completion tests #1038

Are you sure you want to change the base?

test(sdk, api): fix finish reason, chat, audio and completion tests #1038

Conversation

justinthelaw commented Sep 17, 2024 • edited Loading

Description

BREAKING CHANGES

CHANGES

Related Issue

Checklist before merging

netlify bot commented Sep 17, 2024 • edited Loading

✅ Deploy Preview for leapfrogai-docs canceled.

justinthelaw commented Sep 17, 2024

justinthelaw commented Sep 17, 2024 •

edited

Loading

netlify bot commented Sep 17, 2024 •

edited

Loading