Add s3-graphs-zip workflow for zipping CE graphs by rsaksida · Pull Request #999 · CredentialEngine/CredentialRegistry

rsaksida · 2026-03-06T20:26:50Z

Add s3-graphs-zip workflow for archiving CE graphs
Hook up legacy CER API to Argo workflows

Issue 17 of the new repo

workflows/tasks/s3-graphs-zip/main.py

db/migrate/20260306120000_add_argo_workflow_fields_to_envelope_downloads.rb

rohit-joy · 2026-03-09T04:50:25Z

Go ahead and deploy it in Sandbox. Then let @JWaltuch or @mparsons-ce know to test it. Then go to production. :)

rohit-joy · 2026-03-09T04:58:21Z

workflows/templates/s3-graphs-zip-workflow-template.yaml

+        resources:
+          requests:
+            cpu: "1000m"
+            memory: "2Gi"


With the streaming ZIP operation, why does it need 2GB minimum memory minimum? I would think, at most the memory consumption would be the size of the ZIP metadata header. That is about 50MB for 500K files. So it should not be 2GB minimum. Reason this is concerning is because we are allocating 2GB up front, that means Kubernetes is going to assume that it needs to auto-scale more VMs to server other apps. I recommend reducing this number to 200MB or lower to see where it breaks. Then increase it by 100MB increments until it doesn't break.

Also, add a # comment here indicating the reason for the low memory allocation.

Same question for the CPU. 1000m is probably too high for this workflow.

Note that these workflows are very light weight. So we should keep to the minimum required memory allocations to reduce costs as we start to run unrelated workflows in parallel.

This configuration doesn't mean Kubernetes allocates 2GB up front. It means the node where the pod gets deployed has 2GB reserved. Anyhow, fair point; this was brought over from the previous template. I'll check with Ariel if there are special reasons for the node configuration here and reduce it accordingly.

Reservation == allocation. No? :) Reminds me of the Seinfeld car reservation scene.

- Add script to package CE graph JSON files into zip files in S3 - Streams zip files directly to S3 with multipart upload support - Calls preconfigured webhook for notifications - Add Dockerfile for Argo / container orchestration - Add docker-compose.yml with LocalStack for integration test setup - Add workflow template - Hook up legacy CER API to Argo workflows (WIP)

rohit-joy · 2026-03-11T05:24:32Z

workflows/templates/s3-graphs-zip-workflow-template.yaml

+            memory: "256Mi"
+          limits:
+            cpu: "2000m"
+            memory: "4Gi"


Make this 2x the requested, so 512Mi. It's best to see the Job fail than take up 16x more resources than intended.

Similarly please check the impact of the CPU limit as well.

rohit-joy · 2026-03-11T05:27:02Z

workflows/templates/s3-graphs-zip-workflow-template.yaml

+      - name: destination-bucket
+      - name: destination-prefix
+      - name: max-uncompressed-zip-size-bytes
+        value: "209715200"


Why this number?

We're limiting zip file size either by number of files in zip (default 25k) or by max uncompressed size (default 200MB), whichever happens first. The rationale for the numbers is simply developer convenience. They are configurable by those parameters.

JWaltuch · 2026-03-11T19:13:53Z

Is the intention that if a user tries to run post multiple times with the same auth header it will still only make one download request? That seems like the behavior I'm seeing so just wanted to confirm. Either that or it keeps returning data from just the first request.

I also just tested running it normally and doing the downloads. Seems to work to me.

mparsons-ce

The download process works as desiged

jeannekitchens · 2026-03-11T20:01:01Z

@JWaltuch see the comments from @mparsons-ce and please review the PR.

rsaksida · 2026-03-11T20:34:13Z

Is the intention that if a user tries to run post multiple times with the same auth header it will still only make one download request? That seems like the behavior I'm seeing so just wanted to confirm. Either that or it keeps returning data from just the first request.

Yep, that's the intention.

rohit-joy reviewed Mar 6, 2026

View reviewed changes

workflows/tasks/s3-graphs-zip/main.py Show resolved Hide resolved

rohit-joy reviewed Mar 6, 2026

View reviewed changes

db/migrate/20260306120000_add_argo_workflow_fields_to_envelope_downloads.rb Show resolved Hide resolved

rohit-joy reviewed Mar 9, 2026

View reviewed changes

rohit-joy self-requested a review March 10, 2026 01:05

rsaksida and others added 3 commits March 10, 2026 19:44

Update Dockerfile, workflow template

22a5109

Fix issues with Argo workflows client, S3 client

6330e6d

rsaksida force-pushed the feature/s3-graphs-zip branch from b5b5001 to 6330e6d Compare March 10, 2026 22:45

rsaksida marked this pull request as ready for review March 10, 2026 22:46

rohit-joy reviewed Mar 11, 2026

View reviewed changes

rohit-joy approved these changes Mar 11, 2026

View reviewed changes

rohit-joy requested review from JWaltuch and mparsons-ce March 11, 2026 05:28

mparsons-ce reviewed Mar 11, 2026

View reviewed changes

JWaltuch approved these changes Mar 11, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add s3-graphs-zip workflow for zipping CE graphs#999

Add s3-graphs-zip workflow for zipping CE graphs#999
rsaksida wants to merge 3 commits intomasterfrom
feature/s3-graphs-zip

rsaksida commented Mar 6, 2026 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

rohit-joy commented Mar 9, 2026

Uh oh!

rohit-joy Mar 9, 2026 •

edited

Loading

Uh oh!

rsaksida Mar 9, 2026

Uh oh!

rohit-joy Mar 9, 2026 •

edited

Loading

Uh oh!

rohit-joy Mar 11, 2026 •

edited

Loading

Uh oh!

rohit-joy Mar 11, 2026

Uh oh!

rsaksida Mar 11, 2026

Uh oh!

JWaltuch commented Mar 11, 2026

Uh oh!

mparsons-ce left a comment

Uh oh!

jeannekitchens commented Mar 11, 2026

Uh oh!

rsaksida commented Mar 11, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Conversation

rsaksida commented Mar 6, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

rohit-joy commented Mar 9, 2026

Uh oh!

rohit-joy Mar 9, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

rsaksida Mar 9, 2026

Choose a reason for hiding this comment

Uh oh!

rohit-joy Mar 9, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

rohit-joy Mar 11, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

rohit-joy Mar 11, 2026

Choose a reason for hiding this comment

Uh oh!

rsaksida Mar 11, 2026

Choose a reason for hiding this comment

Uh oh!

JWaltuch commented Mar 11, 2026

Uh oh!

mparsons-ce left a comment

Choose a reason for hiding this comment

Uh oh!

jeannekitchens commented Mar 11, 2026

Uh oh!

rsaksida commented Mar 11, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

rsaksida commented Mar 6, 2026 •

edited

Loading

rohit-joy Mar 9, 2026 •

edited

Loading

rohit-joy Mar 9, 2026 •

edited

Loading

rohit-joy Mar 11, 2026 •

edited

Loading