Conversation
- lab/inference/ollama-qwen-moe/: StatefulSet pinned to a Region A Spark
(label.unbounded.cloud/region=a, hardware-class=dgx-spark-gb10),
basic-auth ingress on /ollama/ path prefix, local-path PVC,
Qwen3 30B-A3B MoE.
- lab/inference/openwebui/: Open WebUI Deployment on AKS amd64 system pool
(kubernetes.azure.com/mode=system + arch=amd64), Azure Disk PVC,
ingress at host root, talks to Ollama via cluster DNS.
- Makefile: lab-w1.1-up aggregates ollama-up + openwebui-up; same for
-down/-status. Per-component targets retained as
lab-w1.1-{ollama,openwebui}-{up,down,status}.
- Public hostnames in both bundles are operator-supplied via
configMapGenerator. Defaults are placeholders (ollama.lab.example.com,
chat.lab.example.com); set your real DNS via:
kustomize edit set configmap ollama-host --from-literal=host=...
kustomize edit set configmap open-webui-host --from-literal=host=...
- secret.local.yaml in both bundles is generated locally and gitignored.
Note: Ollama API moved from / to /ollama/ to share a single DNS label
with Open WebUI at /.
Make targets accept LAB_HOST=<fqdn> on the command line, e.g.
make LAB_HOST=mychat.example.com lab-w1.1-up
Helper backs up each kustomization.yaml, runs
'kustomize edit set configmap <ollama-host|open-webui-host>
--from-literal=host=$LAB_HOST', applies, then restores the file (even
on Ctrl-C via trap). Without LAB_HOST, the targets warn and apply with
the placeholder defaults (ollama.lab.example.com / chat.lab.example.com)
which are fine for kustomize-render dry runs but produce no working
public endpoint.
READMEs updated: lab/README.md (top-level public-hostname section),
lab/inference/ollama-qwen-moe/README.md, lab/inference/openwebui/README.md.
Use sed to rewrite the 'host=' literal in kustomization.yaml instead of 'kustomize edit set configmap'. Kustomize CLI is not installed by default and 'kubectl kustomize' has no edit subcommand. The trap-based backup/restore is unchanged.
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
kustomize edit set configmap ollama-host --from-literal=host=...
kustomize edit set configmap open-webui-host --from-literal=host=...
Note: Ollama API moved from / to /ollama/ to share a single DNS label with Open WebUI at /.