-
Notifications
You must be signed in to change notification settings - Fork 28
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
test(sdk, api): fix finish reason, chat, audio and completion tests #1038
base: main
Are you sure you want to change the base?
Conversation
✅ Deploy Preview for leapfrogai-docs canceled.
|
There seems to be an issue with our GPU runner configurations that is blocking the vLLM E2E tests from running: https://github.com/defenseunicorns/leapfrogai/actions/runs/10894219888/job/30230952202 The upstream UDS Common action is unable to install UDS CLI into the bin directory. It seems like it needs |
…nt-e2e-testing-for-vllm
…//github.com/defenseunicorns/leapfrogai into 1037-testvllm-implement-e2e-testing-for-vllm
70f58ad
to
a1a03c1
Compare
This reverts commit ace4db1.
Description
Creates and enables vLLM E2E tests that touch each SDK gRPC endpoint at least once with a "happy path" test. Standardizes the new
Completions
test from vLLM's E2E tests to the LLaMA-CPP-Python E2E tests. Fixes theFinishReason
enum
typing issues acrossCompletions
andChatCompletions
.BREAKING CHANGES
FinishReason
to be an enum in both Completions and ChatCompletions protobufsEnum
class to define and transform the stub responsesobject
andcreated
field for Completions, using Literaltext_completion
as defined in the OpenAI API specificationCHANGES
FinishReason
being the wrong typeRelated Issue
Fixes #1037
Relates to #854
Checklist before merging