vScheduler

A Kubernetes scheduler designed for smart scheduling with llmaz.

Plugins

vScheduler maintains multiple plugins for llm workloads scheduling.

A llama2-7B model can be run on 1xA100 GPU, can also be run on 1xA10 GPU, this is what we called fungibility.

With resourceFungibility plugin, we can simply achieve this with at most 8 alternative GPU types.

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
.github		.github
api/config/scheme		api/config/scheme
cmd		cmd
docs/plugins		docs/plugins
hack		hack
pkg/plugins/resource_fungibility		pkg/plugins/resource_fungibility
.dockerignore		.dockerignore
.gitignore		.gitignore
.golangci.yml		.golangci.yml
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
Dockerfile		Dockerfile
Makefile		Makefile
OWNERS		OWNERS
README.md		README.md
go.mod		go.mod
go.sum		go.sum