Skip to content

Add baseline Grafana observability and stage timing telemetry#439

Draft
anth-volk wants to merge 4 commits intomainfrom
feat/grafana-observability
Draft

Add baseline Grafana observability and stage timing telemetry#439
anth-volk wants to merge 4 commits intomainfrom
feat/grafana-observability

Conversation

@anth-volk
Copy link
Copy Markdown
Contributor

Fixes #438

Summary

  • add baseline OTLP/Grafana-capable observability in shadow mode for the simulation gateway and worker
  • instrument gateway routing and worker lifecycle with coarse logs, traces, and metrics
  • add stage-level timing telemetry for routing, queue delay, worker phases, completion, and failure paths

Testing

  • uv run ruff check projects/policyengine-api-simulation/src/modal/gateway/endpoints.py projects/policyengine-api-simulation/src/modal/observability.py projects/policyengine-api-simulation/src/modal/app.py projects/policyengine-api-simulation/src/modal/simulation.py projects/policyengine-api-simulation/tests/gateway/test_endpoints.py projects/policyengine-api-simulation/tests/test_modal_observability.py libs/policyengine-fastapi/src/policyengine_fastapi/observability/config.py
  • uv run pytest projects/policyengine-api-simulation/tests/gateway/test_endpoints.py projects/policyengine-api-simulation/tests/gateway/test_models.py projects/policyengine-api-simulation/tests/test_modal_observability.py
  • uv run --extra test pytest libs/policyengine-fastapi/tests/test_observability.py

@anth-volk anth-volk force-pushed the feat/grafana-observability branch from 7fc1e03 to 5278263 Compare April 14, 2026 20:01
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

Implement baseline Grafana observability and stage timing for simulation gateway and worker

1 participant