Add SLO checks with a SQLAlchemy read/write workload by vgvoleg · Pull Request #116 · ydb-platform/ydb-sqlalchemy

vgvoleg · 2026-06-17T09:45:22Z

What

Adds SLO (Service Level Objective) testing on top of ydb-platform/ydb-slo-action, following the ydb-python-sdk SLO example but expressed entirely in terms of SQLAlchemy.

Workload (`tests/slo/`)

A parallel read/write load generator driving the ydb_sqlalchemy dialect:

read — SELECT ... WHERE object_id = :id for a random id;
write — UPSERT INTO ... VALUES (...) for a fresh id;
dedicated reader/writer thread pools plus a metrics thread;
every operation is wrapped in an idempotent retry loop, so transient errors injected by the action's chaos layer become latency rather than availability drops.

Two modes, selected by WORKLOAD_NAME / --mode:

mode	read	write
`core`	`Connection.execute(select())`	`Connection.execute(upsert())`
`orm`	`Session.get(KeyValueRow, id)`	`Session.execute(upsert())` + commit

Metrics are emitted via OTLP with names matching the action's default metrics.yaml (sdk_operations_total, sdk_operation_latency_p{50,95,99}_seconds, sdk_retry_attempts_total, ...).

Workflow (`.github/workflows/slo.yml`)

Runs on PRs labelled SLO:

builds current (PR) and baseline (merge-base) workload images;
runs ydb-slo-action/init@v2 for the core and orm workloads in parallel;
publishes a current-vs-baseline comparison with ydb-slo-action/report@v2 and gates the PR on regressions.

The cluster is trimmed to fit a GitHub-hosted runner via disable_compose_profiles: extra-nodes (chaos and telemetry stay enabled).

How to run

Label this PR with SLO to trigger the checks. Locally:

python ./tests/slo/src create grpc://localhost:2136 /local --mode core
python ./tests/slo/src run    grpc://localhost:2136 /local --mode core --time 60

Notes

The in-run report job needs pull-requests: write, which same-repo PRs have. For fork PRs the report can be moved to a separate workflow_run-triggered workflow.
The workload source under tests/slo/ is outside the existing test/ lint scope, so it doesn't affect the style/tests workflows.

Introduce a parallel read/write SLO workload built on the ydb_sqlalchemy dialect (SQLAlchemy Core and ORM modes) and wire it into ydb-slo-action via a label-gated GitHub workflow. - tests/slo: workload runner, Dockerfile, entrypoint, requirements, README - .github/workflows/slo.yml: build current+baseline images, run init@v2 and publish report@v2 on PRs labelled "SLO"

The dialect integration tests now live in tests/integration/ alongside tests/slo/, so the repo no longer has both a test/ and a tests/ directory. tox.ini (lint + dialect pytest paths) and setup.cfg (profile_file) are updated accordingly.

github-actions · 2026-06-17T11:26:15Z

🌋 SLO Test Results

🟢 2 workload(s) tested — All thresholds passed

Commit: 86a7458 · View run

Workload	Thresholds	Duration	Report
orm	🟢 OK	2m 5s	📄 Report
core	🟢 OK	2m 5s	📄 Report

Generated by ydb-slo-action

The dialect runs in AUTOCOMMIT, so each single-statement read/write already goes through the YDB SDK's retry_operation_sync inside ydb-dbapi. The workload now performs one attempt per operation and records any surfaced exception as a real SLO failure, instead of a broad app-level retry loop that masked non-retryable errors. Removes the now-unused timeout/max-retries flags.

Align workload_duration and read/write RPS with ydb-python-sdk's tests/slo workflow. extra-nodes stays disabled to fit a GitHub-hosted runner.

…cluster, 600s, 1000/100 rps) Run the workload job on the large-runner-sqlalchemy self-hosted runner with the full YDB cluster (all compose profiles), and align workload_duration and read/write RPS with ydb-python-sdk's tests/slo workflow. The report job stays on ubuntu-latest.

vgvoleg added the SLO Run SLO checks label Jun 17, 2026

vgvoleg added 2 commits June 17, 2026 12:48

SLO: create baseline tests/ parent dir before copying runner

5e90c6b

vgvoleg added 3 commits June 17, 2026 14:44

SLO: match the python-sdk reference config (600s, 1000/100 rps)

86f8e29

Align workload_duration and read/write RPS with ydb-python-sdk's tests/slo workflow. extra-nodes stays disabled to fit a GitHub-hosted runner.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add SLO checks with a SQLAlchemy read/write workload#116

Add SLO checks with a SQLAlchemy read/write workload#116
vgvoleg wants to merge 6 commits into
mainfrom
add-slo-checks

vgvoleg commented Jun 17, 2026

Uh oh!

github-actions Bot commented Jun 17, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

vgvoleg commented Jun 17, 2026

What

Workload (tests/slo/)

Workflow (.github/workflows/slo.yml)

How to run

Notes

Uh oh!

github-actions Bot commented Jun 17, 2026

🌋 SLO Test Results

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Workload (`tests/slo/`)

Workflow (`.github/workflows/slo.yml`)