Add streaming task log support to BaseExecutor and FileTaskHandler by jason810496 · Pull Request #69299 · apache/airflow

jason810496 · 2026-07-03T06:26:16Z

part of the streaming task log series

Merge this first - the other two build on it.
Add streaming task log support to KubernetesExecutor #69300
Add running_pod_log_lines config option to KubernetesExecutor #69301

Why

When the API server reads logs for a RUNNING task, the executor's get_task_log returns the whole log fully materialized, so it sits on the heap at once before the bounded LogStreamAccumulator downstream (5000 messages resident, rest spilled to disk) can do its job. Large logs spike the API server's anonymous heap and can OOM it.

What

Add get_streaming_task_log and a supports_streaming_logs class attribute (default False) to BaseExecutor.
FileTaskHandler._read prefers the streaming method when the executor advertises supports_streaming_logs, and falls back to the legacy get_task_log otherwise, so provider and custom executors that haven't implemented it keep working unchanged.
No executor in this PR advertises support yet; the KubernetesExecutor family implementation is the follow-up PR.

Benchmark

A/B measurement of API server memory serving the same ~415 MB ndjson log (1M lines) of a RUNNING KubernetesExecutor task through GET .../logs/{try_number} with Accept: application/x-ndjson, sampling the API server cgroup. A = materializing read (before this series), B = streaming read (this PR + the KubernetesExecutor follow-up).

Metric (API server cgroup)	A: materializing	B: streaming
Peak anonymous heap growth	+2093.9 MiB	+179.9 MiB (~11.6x lower)
Peak RSS (`memory.current`)	2964.4 MiB	1193.4 MiB
Elapsed	33.2 s	19.4 s

Without streaming the full log lives on the heap at once (~2.1 GiB anonymous, non-reclaimable memory, which is what OOMs the API server). With streaming the executor yields into LogStreamAccumulator; B's remaining RSS growth is mostly reclaimable page cache from the accumulator's disk spill.

Interpretation caveat: KubernetesExecutor.RUNNING_POD_LOG_LINES = 100 normally caps the executor read to the last 100 lines, which would hide the difference; the benchmark lifted the cap to 100,000,000 in both builds. Production keeps the cap at 100, so this measures the mechanism's headroom for executors returning large logs, not a shipped memory reduction today (the running_pod_log_lines config PR is what makes the cap tunable). Single run per build, and only the ndjson path streams end to end (Accept: application/json buffers the whole response regardless).

Was generative AI tooling used to co-author this PR?

Yes, with help of Claude Code Fable 5 following the guidelines

Reading a running task's log through an executor materializes the whole log in the API server before the bounded LogStreamAccumulator can bound memory, so large logs spike the API server heap. This adds an interface executors can implement to stream log lines lazily instead.

boring-cyborg Bot added area:Executors-core LocalExecutor & SequentialExecutor area:logging labels Jul 3, 2026

jason810496 self-assigned this Jul 3, 2026

This was referenced Jul 3, 2026

Add streaming task log support to KubernetesExecutor #69300

Draft

Add running_pod_log_lines config option to KubernetesExecutor #69301

Draft

jason810496 added the area:API Airflow's REST/HTTP API label Jul 3, 2026

jason810496 requested review from Lee-W, amoghrajesh and uranusjr July 3, 2026 06:49

jason810496 marked this pull request as ready for review July 3, 2026 10:34

jason810496 requested review from XD-DENG, ashb, dheerajturaga, hussein-awala, o-nikolas and pierrejeambrun as code owners July 3, 2026 10:34

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add streaming task log support to BaseExecutor and FileTaskHandler#69299

Add streaming task log support to BaseExecutor and FileTaskHandler#69299
jason810496 wants to merge 1 commit into
apache:mainfrom
jason810496:refactor/logging/add-stream-method-for-base-executor

jason810496 commented Jul 3, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Uh oh!

Conversation

jason810496 commented Jul 3, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Why

What

Benchmark

Was generative AI tooling used to co-author this PR?

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

jason810496 commented Jul 3, 2026 •

edited

Loading