ETCD-704: cluster-restore.sh: move extra /var/lib/etcd files to backup by apurvanisal5 · Pull Request #1628 · openshift/cluster-etcd-operator

apurvanisal5 · 2026-06-07T19:09:48Z

Summary

Legacy cluster-restore.sh fails with folder /var/lib/etcd is not empty when extra files exist under /var/lib/etcd after member/ is moved.
Add backup_remaining_etcd_data_dir_contents() to move remaining top-level files to /var/lib/etcd-backup instead of exiting.

Jira

Fixes: ETCD-704

Verification

OCP 4.22.0-rc.4, AWS IPI 3-node HA:

Legacy script fails when seed files present in /var/lib/etcd
Patched script moves demo files to /var/lib/etcd-backup and completes SNAPSHOT RESTORE COMPLETED
Full HA restore; cluster healthy; testing-seed-project restored from backup

Test plan

Reproduce legacy failure with extra files in /var/lib/etcd
Patched restore moves extras to /var/lib/etcd-backup
Full 3-node HA restore succeeds
etcd data restored from snapshot

Summary by CodeRabbit

Bug Fixes
- Restore process now preserves and moves leftover etcd data into a backup location before clearing the data directory, preventing hard failures when the data directory is not empty and improving recoverability.

coderabbitai · 2026-06-07T19:09:56Z

Walkthrough

The restore scripts now back up any remaining entries from ETCD_DATA_DIR into ETCD_DATA_DIR_BACKUP/extra-data-dir-contents via a new helper, and both restore initialization paths invoke that helper before wiping or requiring an empty data directory.

Changes

etcd Restore Data Cleanup

Layer / File(s)	Summary
Backup helper and core implementation `bindata/etcd/etcd-common-tools`	Adds `backup_remaining_etcd_data_dir_contents()` which moves all entries from `ETCD_DATA_DIR` to `ETCD_DATA_DIR_BACKUP/extra-data-dir-contents`, handling dotfiles and clearing existing targets first.
Restore script integration `bindata/etcd/cluster-restore.sh`	Replaces prior delete-or-fail logic with a call to `backup_remaining_etcd_data_dir_contents` in the non-`ETCD_ETCDCTL_RESTORE` path after snapshot/revision copy.
TNF restore script integration `bindata/etcd/cluster-restore-tnf.sh`	Calls `backup_remaining_etcd_data_dir_contents` after backing up member data and before removing `ETCD_DATA_DIR`.

🎯 3 (Moderate) | ⏱️ ~20 minutes

🚥 Pre-merge checks | ✅ 12 | ❌ 3

❌ Failed checks (2 warnings, 1 inconclusive)

Check name	Status	Explanation	Resolution
Docstring Coverage	⚠️ Warning	Docstring coverage is 0.00% which is insufficient. The required threshold is 80.00%.	Write docstrings for the functions missing them to satisfy the coverage threshold.
Test Structure And Quality	⚠️ Warning	Ginkgo tests use assertions like `o.Expect(err).NotTo(o.HaveOccurred())` with no message in test/e2e/network_policy.go (lines 44,46) and test/e2e/etcd_verification.go and etcd_disruptive.go.	Update affected assertions to include descriptive messages, e.g. `o.Expect(err).NotTo(o.HaveOccurred(), "failed to <action>")`, and ensure similar checks throughout the Ginkgo specs include context.
Container-Privileges	❓ Inconclusive	Custom check execution failed before a final verdict was produced.	Retry the review run. If this persists, inspect pre-merge custom-check logs for infrastructure or agent runtime failures.

✅ Passed checks (12 passed)

Check name	Status	Explanation
Description Check	✅ Passed	Check skipped - CodeRabbit’s high-level summary is enabled.
Linked Issues check	✅ Passed	Check skipped because no linked issues were found for this pull request.
Out of Scope Changes check	✅ Passed	Check skipped because no linked issues were found for this pull request.
Stable And Deterministic Test Names	✅ Passed	Scanned ginkgo-style test titles in changed *_test.go / e2e files for dynamic constructs (fmt.Sprintf, timestamps/dates, UUIDs, IPs, node/pod/namespace suffixes); none matched the disallowed patterns.
Microshift Test Compatibility	✅ Passed	PR changes only bindata/etcd restore scripts (helper appears only there); no new/modified Ginkgo e2e tests detected in test/e2e, so MicroShift API compatibility check is not applicable.
Single Node Openshift (Sno) Test Compatibility	✅ Passed	PR `#1628` only updates bindata/etcd restore shell scripts (3 files) and adds no Ginkgo e2e tests, so there are no SNO multi-node/HA assumptions to flag.
Topology-Aware Scheduling Compatibility	✅ Passed	PR only changes bindata/etcd/cluster-restore*.sh and etcd-common-tools; inspected contents contain no pod scheduling constraints (affinity, topologySpreadConstraints, node-role selectors).
Ote Binary Stdout Contract	✅ Passed	Checked OTE binaries under cmd/cluster-etcd-operator-tests-ext for stdout writes: no fmt.Print/Printf/Println. klog.Fatal is used and vendored klog defaults logtostderr=true (stderr).
Ipv6 And Disconnected Network Test Compatibility	✅ Passed	PR only updates bindata/etcd restore shell scripts and etcd-common-tools; no new/changed Ginkgo e2e tests (It/Describe/etc) to flag for IPv4 or external connectivity assumptions.
No-Weak-Crypto	✅ Passed	Searched the PR’s updated files (bindata/etcd/cluster-restore*.sh and etcd-common-tools) for MD5/SHA1/DES/RC4/3DES/Blowfish/ECB and crypto/comparison patterns; none found.
No-Sensitive-Data-In-Logs	✅ Passed	New logging added by PR (backup_remaining_etcd_data_dir_contents) only echoes etcd data-dir/backup paths & filenames; no passwords/tokens/PII/session IDs/internal hostnames are logged.
Title check	✅ Passed	The title accurately summarizes the main change: adding functionality to move extra /var/lib/etcd files to a backup directory instead of failing when the directory is not empty.

_{✏️ Tip: You can configure your own custom pre-merge checks in the settings.}

✨ Finishing Touches

🧪 Generate unit tests (beta)

Create PR with unit tests

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

apurvanisal5 · 2026-06-07T19:13:16Z

ETCD-704-VERIFICATION-OUTPUTS.txt
Reprod details

apurvanisal5 · 2026-06-08T01:21:49Z

CI analysis for failed required jobs:

Job	Failed test	Notes
e2e-agnostic-ovn	ResourceQuota terminating scopes	sig-api-machinery, unrelated
e2e-aws-ovn-single-node	Pod InPlace Resize	sig-node, unrelated
e2e-gcp-operator-disruptive	TestPeriodicBackupHappyPath (timeout)	backup test, not cluster-restore.sh

Manual verification on OCP 4.22.0-rc.4 (3-node HA): legacy restore fails with extra files in /var/lib/etcd; patched script moves files to /var/lib/etcd-backup and completes SNAPSHOT RESTORE. Full HA restore verified (ETCD-704).

/retest required

apurvanisal5 · 2026-06-08T05:24:14Z

2/3 required jobs now green. Remaining failure is TestRetentionBySize
(backup retention count flake — found 6 groups vs expected 4-5, unrelated to cluster-restore.sh).
TestPeriodicBackupHappyPath and TestBackupScript passed on same run.
Manual ETCD-704 HA restore verified on 4.22.

/test e2e-gcp-operator-disruptive

apurvanisal5 · 2026-06-08T07:40:13Z

Latest e2e-gcp-operator-disruptive run: all operator e2e tests passed (47m),
job failed only in post-step gather-must-gather due to GitHub camgi.tar download
infra flake — unrelated to ETCD-704.

Previous failures were TestRetentionBySize / TestPeriodicBackupHappyPath flakes.
Manual 3-node HA restore verified on 4.22.

/test e2e-gcp-operator-disruptive

apurvanisal5 · 2026-06-08T09:45:44Z

/label merge-review-needed

openshift-ci · 2026-06-08T09:45:55Z

@apurvanisal5: The label(s) /label merge-review-needed cannot be applied. These labels are supported: acknowledge-critical-fixes-only, platform/aws, platform/azure, platform/baremetal, platform/google, platform/libvirt, platform/openstack, ga, tide/merge-method-merge, tide/merge-method-rebase, tide/merge-method-squash, px-approved, docs-approved, qe-approved, ux-approved, no-qe, rebase/manual, cluster-config-api-changed, run-integration-tests, verified, ready-for-human-review, approved, backport-risk-assessed, bugzilla/valid-bug, cherry-pick-approved, jira/skip-dependent-bug-check, jira/valid-bug, ok-to-test, stability-fix-approved, staff-eng-approved. Is this label configured under labels -> additional_labels or labels -> restricted_labels in plugin.yaml?

Details

In response to this:

/label merge-review-needed

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository.

apurvanisal5 · 2026-06-08T09:48:43Z

/label ready-for-human-review

dusk125 · 2026-06-09T16:01:17Z

@coderabbitai review

coderabbitai · 2026-06-09T16:01:25Z

✅ Action performed

Review finished.

Note: CodeRabbit is an incremental review system and does not re-review already reviewed commits. This command is applicable only when automatic reviews are paused.

coderabbitai

Actionable comments posted: 1

🤖 Prompt for all review comments with AI agents

Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

Inline comments:
In `@bindata/etcd/cluster-restore.sh`:
- Around line 70-75: The script currently does rm -rf
"${ETCD_DATA_DIR_BACKUP:?}/${base}" before mv "${entry}"
"${ETCD_DATA_DIR_BACKUP}/", which can clobber previously staged restore
artifacts (snapshot.db, revision.json, member/); instead, stop deleting the
target backup root entry—if "${ETCD_DATA_DIR_BACKUP}/${base}" already exists,
either skip moving the stray "${entry}", or move it to a conflict-safe name
(e.g., append a timestamp or ".orig") so existing staged files are preserved;
apply the same behavior to the separate moves that write snapshot.db and
revision.json (refer to ETCD_DATA_DIR_BACKUP, base, entry, snapshot.db,
revision.json, and member/) so nothing in the backup root gets overwritten.

🪄 Autofix (Beta)

Fix all unresolved CodeRabbit comments on this PR:

Push a commit to this branch (recommended)
Create a new PR with the fixes

ℹ️ Review info

⚙️ Run configuration

Configuration used: Repository: openshift/coderabbit/.coderabbit.yaml

Review profile: CHILL

Plan: Enterprise

Run ID: 5b980737-38e5-4d48-b0aa-3efe28c29c3d

📥 Commits

Reviewing files that changed from the base of the PR and between 2674d1c and 4e87179.

📒 Files selected for processing (1)

bindata/etcd/cluster-restore.sh

dusk125 · 2026-06-09T20:18:50Z

@apurvanisal5 can you re-target this change to the main branch please? And we can backport if necessary

apurvanisal5 · 2026-06-10T03:53:51Z

/test e2e-gcp-operator-disruptive

apurvanisal5 · 2026-06-10T06:22:08Z

/test e2e-metal-ipi-ovn-ipv6

apurvanisal5 · 2026-06-10T08:43:08Z

/test e2e-metal-ipi-ovn-ipv6

tjungblu · 2026-06-10T08:46:55Z

you can save yourself the testing @apurvanisal5 - you're targeting the wrong branch. Please follow the backport process in https://docs.google.com/document/d/1FCL6HIUGKhelHKc6dHtZ8AvpFYjmE4_aRz8NOmPsQl8/edit?tab=t.k517pl1uza05#heading=h.wnk379sj5pq7
if you want to target 4.22.

tjungblu · 2026-06-10T08:48:02Z

  done
 }

+function backup_remaining_etcd_data_dir_contents() {


shouldn't this go into common and be shared with cluster-restore-tnf.sh?

tjungblu · 2026-06-10T08:49:28Z

+  mkdir -p "${extras_dir}"
+
+  shopt -s nullglob dotglob
+  for entry in "${ETCD_DATA_DIR}"/*; do


doesn't that move the revision.json out of this folder again?

for revion.json

109 # Copy snapshot to backupdir
110 cp -p "${SNAPSHOT_FILE}" "${ETCD_DATA_DIR_BACKUP}"/snapshot.db
111 # Move the revision.json when it exists
112 [ ! -f "${ETCD_REV_JSON}" ] || mv -f "${ETCD_REV_JSON}" "${ETCD_DATA_DIR_BACKUP}"/revision.json <===== this line already moves revision.json before backup_remaining_etcd_data_dir_contents is called
113 # Move any remaining files (fio perf artifacts, stray snapshots, etc.) out of the data dir.
114 # The restore pod requires /var/lib/etcd to be empty before it runs.
115 backup_remaining_etcd_data_dir_contents

When cluster-restore.sh runs the restore-pod path, it moves member/ and revision.json to /var/lib/etcd-backup, deletes etcd_perf*, then exits if anything remains in /var/lib/etcd. Extra files (perf artifacts, stray snapshots, etc.) cause DR restore to fail before the restore pod starts. Add backup_remaining_etcd_data_dir_contents() to move all remaining top-level entries to /var/lib/etcd-backup instead of failing. Fixes: ETCD-704 Related: https://access.redhat.com/solutions/6958920

Move leftover /var/lib/etcd entries to extra-data-dir-contents/ so staged snapshot.db, revision.json, and member/ are not overwritten. Co-authored-by: Cursor <cursoragent@cursor.com>

Share backup_remaining_etcd_data_dir_contents via etcd-common-tools per review feedback. Behavior unchanged.

apurvanisal5 · 2026-06-11T14:55:12Z

Re-targeted to main.

Moved the helper to etcd-common-tools per @tjungblu review. Same fix as before nothing else changed.

I already tested this on a real 3-node cluster (4.22) as mentioned in #1628 (comment)

Will backport to 4.22 after merge if needed.

apurvanisal5 · 2026-06-11T20:10:11Z

/retest-required

apurvanisal5 · 2026-06-12T03:19:46Z

/test e2e-gcp-operator-disruptive

openshift-ci-robot · 2026-06-12T20:17:43Z

@apurvanisal5: This pull request references ETCD-704 which is a valid jira issue.

Details

In response to this:

Summary

Legacy cluster-restore.sh fails with folder /var/lib/etcd is not empty when extra files exist under /var/lib/etcd after member/ is moved.

Add backup_remaining_etcd_data_dir_contents() to move remaining top-level files to /var/lib/etcd-backup instead of exiting.

Jira

Fixes: ETCD-704

Verification

OCP 4.22.0-rc.4, AWS IPI 3-node HA:

Legacy script fails when seed files present in /var/lib/etcd

Patched script moves demo files to /var/lib/etcd-backup and completes SNAPSHOT RESTORE COMPLETED

Full HA restore; cluster healthy; testing-seed-project restored from backup

Test plan

Reproduce legacy failure with extra files in /var/lib/etcd

Patched restore moves extras to /var/lib/etcd-backup

Full 3-node HA restore succeeds

etcd data restored from snapshot

Summary by CodeRabbit

Bug Fixes

Restore process now preserves and moves leftover etcd data into a backup location before clearing the data directory, preventing hard failures when the data directory is not empty and improving recoverability.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the openshift-eng/jira-lifecycle-plugin repository.

apurvanisal5 · 2026-06-12T20:17:53Z

/jira refresh

openshift-ci-robot · 2026-06-12T20:17:57Z

@apurvanisal5: This pull request references ETCD-704 which is a valid jira issue.

Details

In response to this:

/jira refresh

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the openshift-eng/jira-lifecycle-plugin repository.

apurvanisal5 · 2026-06-12T20:19:46Z

/verified by @apurvanisal5

Manual 3-node HA etcd restore on OCP 4.21.18 (cluster anisal-test-m):

Seeded extra files under /var/lib/etcd
Patched cluster-restore.sh moved them to /var/lib/etcd-backup/extra-data-dir-contents/
SNAPSHOT RESTORE COMPLETED
etcd Available=True, all nodes Ready, post-test-for-etcd project restored from backup
ETCD-704-anisal-test-m-VERIFICATION-OUTPUTS.txt

openshift-ci-robot · 2026-06-12T20:19:58Z

@apurvanisal5: This PR has been marked as verified by @apurvanisal5.

Details

In response to this:

/verified by @apurvanisal5

Manual 3-node HA etcd restore on OCP 4.21.18 (cluster anisal-test-m):

Seeded extra files under /var/lib/etcd

Patched cluster-restore.sh moved them to /var/lib/etcd-backup/extra-data-dir-contents/

SNAPSHOT RESTORE COMPLETED

etcd Available=True, all nodes Ready, post-test-for-etcd project restored from backup
ETCD-704-anisal-test-m-VERIFICATION-OUTPUTS.txt

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the openshift-eng/jira-lifecycle-plugin repository.

apurvanisal5 · 2026-06-12T20:20:21Z

/test e2e-metal-ovn-two-node-fencing

apurvanisal5 · 2026-06-12T20:26:55Z

/retest-required

apurvanisal5 · 2026-06-13T03:27:53Z

/test e2e-gcp-operator-disruptive

apurvanisal5 · 2026-06-13T07:26:37Z

/test e2e-gcp-operator-disruptive

apurvanisal5 · 2026-06-13T10:58:58Z

/test e2e-gcp-operator-disruptive

apurvanisal5 · 2026-06-13T16:03:17Z

/test e2e-gcp-operator-disruptive

openshift-merge-bot · 2026-06-13T22:23:06Z

/retest-required

Remaining retests: 0 against base HEAD 2abd78c and 2 for PR HEAD 38bade7 in total

apurvanisal5 · 2026-06-14T03:55:18Z

/retest-required

apurvanisal5 · 2026-06-14T08:25:32Z

/retest-required

apurvanisal5 · 2026-06-14T11:42:08Z

/retest-required

openshift-ci · 2026-06-14T17:45:23Z

@apurvanisal5: The following test failed, say /retest to rerun all failed tests or /retest-required to rerun all mandatory failed tests:

Test name	Commit	Details	Required	Rerun command
ci/prow/e2e-aws-etcd-recovery	`4b715f1`	link	false	`/test e2e-aws-etcd-recovery`

Full PR test history. Your PR dashboard.

Details

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository. I understand the commands that are listed here.

apurvanisal5 · 2026-06-15T08:55:18Z

@tjungblu all e2e-gcp-operator-disruptive runs fail on TestEtcdDBScaling only — backendQuotaGiB 32 > max 16 (scaling_dbsize.go:44). Backup tests pass. Unrelated to ETCD-704. Retest keeps hitting the same thing. Any chance we can merge or skip this job? Thanks.

tjungblu · 2026-06-15T10:29:01Z

/override ci/prow/e2e-gcp-operator-disruptive

openshift-ci · 2026-06-15T10:29:18Z

@tjungblu: Overrode contexts on behalf of tjungblu: ci/prow/e2e-gcp-operator-disruptive

Details

In response to this:

/override ci/prow/e2e-gcp-operator-disruptive

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository.

apurvanisal5 · 2026-06-17T10:00:12Z

/cherry-pick release-4.22
/cherry-pick release-4.21
/cherry-pick release-4.20
/cherry-pick release-4.19
/cherry-pick release-4.18
/cherry-pick release-4.17

openshift-cherrypick-robot · 2026-06-17T10:01:02Z

@apurvanisal5: #1628 failed to apply on top of branch "release-4.17":

Applying: ETCD-704 — cluster-restore.sh: move extra /var/lib/etcd files to backup
Using index info to reconstruct a base tree...
M	bindata/etcd/cluster-restore.sh
Falling back to patching base and 3-way merge...
Auto-merging bindata/etcd/cluster-restore.sh
CONFLICT (content): Merge conflict in bindata/etcd/cluster-restore.sh
error: Failed to merge in the changes.
hint: Use 'git am --show-current-patch=diff' to see the failed patch
hint: When you have resolved this problem, run "git am --continue".
hint: If you prefer to skip this patch, run "git am --skip" instead.
hint: To restore the original branch and stop patching, run "git am --abort".
hint: Disable this message with "git config set advice.mergeConflict false"
Patch failed at 0001 ETCD-704 — cluster-restore.sh: move extra /var/lib/etcd files to backup

Details

In response to this:

/cherry-pick release-4.22
/cherry-pick release-4.21
/cherry-pick release-4.20
/cherry-pick release-4.19
/cherry-pick release-4.18
/cherry-pick release-4.17

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository.

openshift-cherrypick-robot · 2026-06-17T10:01:43Z

@apurvanisal5: #1628 failed to apply on top of branch "release-4.18":

Applying: ETCD-704 — cluster-restore.sh: move extra /var/lib/etcd files to backup
Applying: ETCD-704: move extra etcd data dir files to backup subdirectory
Applying: ETCD-704: move backup helper to etcd-common-tools Share backup_remaining_etcd_data_dir_contents via etcd-common-tools per review feedback. Behavior unchanged.
Applying: ETCD-704: backup extra data dir contents in cluster-restore-tnf.sh Call backup_remaining_etcd_data_dir_contents before rm -rf so leftover files are moved to extra-data-dir-contents/ instead of deleted.
Using index info to reconstruct a base tree...
A	bindata/etcd/cluster-restore-tnf.sh
Falling back to patching base and 3-way merge...
CONFLICT (modify/delete): bindata/etcd/cluster-restore-tnf.sh deleted in HEAD and modified in ETCD-704: backup extra data dir contents in cluster-restore-tnf.sh Call backup_remaining_etcd_data_dir_contents before rm -rf so leftover files are moved to extra-data-dir-contents/ instead of deleted..  Version ETCD-704: backup extra data dir contents in cluster-restore-tnf.sh Call backup_remaining_etcd_data_dir_contents before rm -rf so leftover files are moved to extra-data-dir-contents/ instead of deleted. of bindata/etcd/cluster-restore-tnf.sh left in tree.
error: Failed to merge in the changes.
hint: Use 'git am --show-current-patch=diff' to see the failed patch
hint: When you have resolved this problem, run "git am --continue".
hint: If you prefer to skip this patch, run "git am --skip" instead.
hint: To restore the original branch and stop patching, run "git am --abort".
hint: Disable this message with "git config set advice.mergeConflict false"
Patch failed at 0004 ETCD-704: backup extra data dir contents in cluster-restore-tnf.sh Call backup_remaining_etcd_data_dir_contents before rm -rf so leftover files are moved to extra-data-dir-contents/ instead of deleted.

Details

In response to this:

/cherry-pick release-4.22
/cherry-pick release-4.21
/cherry-pick release-4.20
/cherry-pick release-4.19
/cherry-pick release-4.18
/cherry-pick release-4.17

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository.

openshift-cherrypick-robot · 2026-06-17T10:02:23Z

@apurvanisal5: #1628 failed to apply on top of branch "release-4.19":

Applying: ETCD-704 — cluster-restore.sh: move extra /var/lib/etcd files to backup
Applying: ETCD-704: move extra etcd data dir files to backup subdirectory
Applying: ETCD-704: move backup helper to etcd-common-tools Share backup_remaining_etcd_data_dir_contents via etcd-common-tools per review feedback. Behavior unchanged.
Applying: ETCD-704: backup extra data dir contents in cluster-restore-tnf.sh Call backup_remaining_etcd_data_dir_contents before rm -rf so leftover files are moved to extra-data-dir-contents/ instead of deleted.
Using index info to reconstruct a base tree...
A	bindata/etcd/cluster-restore-tnf.sh
Falling back to patching base and 3-way merge...
CONFLICT (modify/delete): bindata/etcd/cluster-restore-tnf.sh deleted in HEAD and modified in ETCD-704: backup extra data dir contents in cluster-restore-tnf.sh Call backup_remaining_etcd_data_dir_contents before rm -rf so leftover files are moved to extra-data-dir-contents/ instead of deleted..  Version ETCD-704: backup extra data dir contents in cluster-restore-tnf.sh Call backup_remaining_etcd_data_dir_contents before rm -rf so leftover files are moved to extra-data-dir-contents/ instead of deleted. of bindata/etcd/cluster-restore-tnf.sh left in tree.
error: Failed to merge in the changes.
hint: Use 'git am --show-current-patch=diff' to see the failed patch
hint: When you have resolved this problem, run "git am --continue".
hint: If you prefer to skip this patch, run "git am --skip" instead.
hint: To restore the original branch and stop patching, run "git am --abort".
hint: Disable this message with "git config set advice.mergeConflict false"
Patch failed at 0004 ETCD-704: backup extra data dir contents in cluster-restore-tnf.sh Call backup_remaining_etcd_data_dir_contents before rm -rf so leftover files are moved to extra-data-dir-contents/ instead of deleted.

Details

In response to this:

/cherry-pick release-4.22
/cherry-pick release-4.21
/cherry-pick release-4.20
/cherry-pick release-4.19
/cherry-pick release-4.18
/cherry-pick release-4.17

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository.

openshift-cherrypick-robot · 2026-06-17T10:03:02Z

@apurvanisal5: #1628 failed to apply on top of branch "release-4.20":

Applying: ETCD-704 — cluster-restore.sh: move extra /var/lib/etcd files to backup
Applying: ETCD-704: move extra etcd data dir files to backup subdirectory
Applying: ETCD-704: move backup helper to etcd-common-tools Share backup_remaining_etcd_data_dir_contents via etcd-common-tools per review feedback. Behavior unchanged.
Applying: ETCD-704: backup extra data dir contents in cluster-restore-tnf.sh Call backup_remaining_etcd_data_dir_contents before rm -rf so leftover files are moved to extra-data-dir-contents/ instead of deleted.
Using index info to reconstruct a base tree...
A	bindata/etcd/cluster-restore-tnf.sh
Falling back to patching base and 3-way merge...
CONFLICT (modify/delete): bindata/etcd/cluster-restore-tnf.sh deleted in HEAD and modified in ETCD-704: backup extra data dir contents in cluster-restore-tnf.sh Call backup_remaining_etcd_data_dir_contents before rm -rf so leftover files are moved to extra-data-dir-contents/ instead of deleted..  Version ETCD-704: backup extra data dir contents in cluster-restore-tnf.sh Call backup_remaining_etcd_data_dir_contents before rm -rf so leftover files are moved to extra-data-dir-contents/ instead of deleted. of bindata/etcd/cluster-restore-tnf.sh left in tree.
error: Failed to merge in the changes.
hint: Use 'git am --show-current-patch=diff' to see the failed patch
hint: When you have resolved this problem, run "git am --continue".
hint: If you prefer to skip this patch, run "git am --skip" instead.
hint: To restore the original branch and stop patching, run "git am --abort".
hint: Disable this message with "git config set advice.mergeConflict false"
Patch failed at 0004 ETCD-704: backup extra data dir contents in cluster-restore-tnf.sh Call backup_remaining_etcd_data_dir_contents before rm -rf so leftover files are moved to extra-data-dir-contents/ instead of deleted.

Details

In response to this:

/cherry-pick release-4.22
/cherry-pick release-4.21
/cherry-pick release-4.20
/cherry-pick release-4.19
/cherry-pick release-4.18
/cherry-pick release-4.17

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository.

openshift-cherrypick-robot · 2026-06-17T10:03:42Z

@apurvanisal5: #1628 failed to apply on top of branch "release-4.21":

Applying: ETCD-704 — cluster-restore.sh: move extra /var/lib/etcd files to backup
Applying: ETCD-704: move extra etcd data dir files to backup subdirectory
Applying: ETCD-704: move backup helper to etcd-common-tools Share backup_remaining_etcd_data_dir_contents via etcd-common-tools per review feedback. Behavior unchanged.
Applying: ETCD-704: backup extra data dir contents in cluster-restore-tnf.sh Call backup_remaining_etcd_data_dir_contents before rm -rf so leftover files are moved to extra-data-dir-contents/ instead of deleted.
Using index info to reconstruct a base tree...
A	bindata/etcd/cluster-restore-tnf.sh
Falling back to patching base and 3-way merge...
CONFLICT (modify/delete): bindata/etcd/cluster-restore-tnf.sh deleted in HEAD and modified in ETCD-704: backup extra data dir contents in cluster-restore-tnf.sh Call backup_remaining_etcd_data_dir_contents before rm -rf so leftover files are moved to extra-data-dir-contents/ instead of deleted..  Version ETCD-704: backup extra data dir contents in cluster-restore-tnf.sh Call backup_remaining_etcd_data_dir_contents before rm -rf so leftover files are moved to extra-data-dir-contents/ instead of deleted. of bindata/etcd/cluster-restore-tnf.sh left in tree.
error: Failed to merge in the changes.
hint: Use 'git am --show-current-patch=diff' to see the failed patch
hint: When you have resolved this problem, run "git am --continue".
hint: If you prefer to skip this patch, run "git am --skip" instead.
hint: To restore the original branch and stop patching, run "git am --abort".
hint: Disable this message with "git config set advice.mergeConflict false"
Patch failed at 0004 ETCD-704: backup extra data dir contents in cluster-restore-tnf.sh Call backup_remaining_etcd_data_dir_contents before rm -rf so leftover files are moved to extra-data-dir-contents/ instead of deleted.

Details

In response to this:

/cherry-pick release-4.22
/cherry-pick release-4.21
/cherry-pick release-4.20
/cherry-pick release-4.19
/cherry-pick release-4.18
/cherry-pick release-4.17

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository.

openshift-cherrypick-robot · 2026-06-17T10:04:23Z

@apurvanisal5: new pull request created: #1632

Details

In response to this:

/cherry-pick release-4.22
/cherry-pick release-4.21
/cherry-pick release-4.20
/cherry-pick release-4.19
/cherry-pick release-4.18
/cherry-pick release-4.17

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository.

openshift-ci Bot requested review from jubittajohn and tjungblu June 7, 2026 19:10

tjungblu mentioned this pull request Jun 8, 2026

[Auto-Generated] Personal Workflow Brief tjungblu/agent#1

Open

openshift-ci Bot added the ready-for-human-review Indicates a PR has been reviewed by automated tools and is ready for human review label Jun 8, 2026

coderabbitai Bot reviewed Jun 9, 2026

View reviewed changes

Comment thread bindata/etcd/cluster-restore.sh Outdated

tjungblu reviewed Jun 10, 2026

View reviewed changes

Apurva Nisal and others added 3 commits June 11, 2026 19:58

ETCD-704: move extra etcd data dir files to backup subdirectory

5f56b3a

Move leftover /var/lib/etcd entries to extra-data-dir-contents/ so staged snapshot.db, revision.json, and member/ are not overwritten. Co-authored-by: Cursor <cursoragent@cursor.com>

ETCD-704: move backup helper to etcd-common-tools

4b715f1

Share backup_remaining_etcd_data_dir_contents via etcd-common-tools per review feedback. Behavior unchanged.

apurvanisal5 force-pushed the etcd-704-cluster-restore-backup-remaining branch from 367a077 to 4b715f1 Compare June 11, 2026 14:33

apurvanisal5 changed the base branch from release-4.22 to main June 11, 2026 14:33

openshift-ci-robot added the jira/valid-reference Indicates that this PR references a valid Jira ticket of any type. label Jun 12, 2026

openshift-ci-robot added the verified Signifies that the PR passed pre-merge verification criteria label Jun 12, 2026

openshift-merge-bot Bot merged commit f38807a into openshift:main Jun 15, 2026
18 checks passed

openshift-cherrypick-robot mentioned this pull request Jun 17, 2026

[release-4.22] OCPBUGS-89322: cluster-restore.sh: move extra /var/lib/etcd files to backup #1632

Open

Conversation

apurvanisal5 commented Jun 7, 2026 • edited by coderabbitai Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Jira

Verification

Test plan

Summary by CodeRabbit

Uh oh!

coderabbitai Bot commented Jun 7, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Walkthrough

Changes

❌ Failed checks (2 warnings, 1 inconclusive)

Uh oh!

apurvanisal5 commented Jun 7, 2026

Uh oh!

apurvanisal5 commented Jun 8, 2026 • edited by openshift-ci Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

apurvanisal5 commented Jun 8, 2026 • edited by openshift-ci Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

apurvanisal5 commented Jun 8, 2026 • edited by openshift-ci Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

apurvanisal5 commented Jun 8, 2026

Uh oh!

openshift-ci Bot commented Jun 8, 2026

Uh oh!

apurvanisal5 commented Jun 8, 2026

Uh oh!

dusk125 commented Jun 9, 2026

Uh oh!

coderabbitai Bot commented Jun 9, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

coderabbitai Bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

dusk125 commented Jun 9, 2026

Uh oh!

apurvanisal5 commented Jun 10, 2026

Uh oh!

apurvanisal5 commented Jun 10, 2026

Uh oh!

apurvanisal5 commented Jun 10, 2026

Uh oh!

tjungblu commented Jun 10, 2026

Uh oh!

tjungblu Jun 10, 2026

Choose a reason for hiding this comment

Uh oh!

tjungblu Jun 10, 2026

Choose a reason for hiding this comment

Uh oh!

apurvanisal5 Jun 11, 2026

Choose a reason for hiding this comment

Uh oh!

apurvanisal5 commented Jun 11, 2026

Uh oh!

apurvanisal5 commented Jun 11, 2026

Uh oh!

apurvanisal5 commented Jun 12, 2026

Uh oh!

openshift-ci-robot commented Jun 12, 2026 • edited by openshift-ci Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Jira

Verification

Test plan

Summary by CodeRabbit

Uh oh!

apurvanisal5 commented Jun 12, 2026

Uh oh!

openshift-ci-robot commented Jun 12, 2026 • edited by openshift-ci Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

apurvanisal5 commented Jun 12, 2026

apurvanisal5 commented Jun 7, 2026 •

edited by coderabbitai Bot

Loading

coderabbitai Bot commented Jun 7, 2026 •

edited

Loading

apurvanisal5 commented Jun 8, 2026 •

edited by openshift-ci Bot

Loading

apurvanisal5 commented Jun 8, 2026 •

edited by openshift-ci Bot

Loading

apurvanisal5 commented Jun 8, 2026 •

edited by openshift-ci Bot

Loading

coderabbitai Bot commented Jun 9, 2026 •

edited

Loading

openshift-ci-robot commented Jun 12, 2026 •

edited by openshift-ci Bot

Loading

openshift-ci-robot commented Jun 12, 2026 •

edited by openshift-ci Bot

Loading

openshift-ci Bot commented Jun 14, 2026 •

edited

Loading

apurvanisal5 commented Jun 15, 2026 •

edited by openshift-ci Bot

Loading