serverless-migration: failure reporting flow + Genie Code install docs#72
Open
rrf-DB wants to merge 1 commit into
Open
serverless-migration: failure reporting flow + Genie Code install docs#72rrf-DB wants to merge 1 commit into
rrf-DB wants to merge 1 commit into
Conversation
Two improvements driven by field gaps: 1. Auto-file GitHub issue on unmigratable workloads. The Failure Reporting Protocol now generates a pre-filled issues/new URL (URL-encoded title + body) and an optional gh issue create command alongside the local report path. Decision tree simplified: offer to file for ANY unmigratable case, not just unknown patterns. MUST-NOT-CONTAIN list expanded to cover internal codenames, Slack IDs, go/ links, PROD-* tickets, gdoc IDs. New defensive redaction checklist runs before write. 2. Document Genie Code Agent-mode install. New "Where to Run This Skill" section in SKILL.md and references/install-in-databricks-genie-code.md covering all three install methods (notebook installer, shell installer, workspace-wide Databricks App), per-user vs workspace-wide paths, and the serverless-no-CLI caveat specific to this skill. Validated by the local migration test bench: 16/16 cases pass (10 existing + 6 new purpose-built unmigratable workloads exercising hbase / custom-jar / py4j / sparkr / cassandra / unknown-pattern paths). Failure-report score 1.0 on every unmigratable case; zero PII leakage in the offered issue bodies. Co-authored-by: Isaac
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Summary
Two improvements to the
databricks-serverless-migrationskill, plus the repo-level glue they need.1. Auto-file GitHub issue when migration can't complete. The Failure Reporting Protocol now produces a pre-filled
issues/newURL (URL-encoded title + body) and an optionalgh issue createcommand. The agent presents both alongside the local report path and never auto-submits. The MUST-NOT-CONTAIN list is extended (codenames, Slack IDs,go/links, internal ticket IDs, gdoc IDs) and a defensive redaction checklist runs before write.2. Document Genie Code Agent-mode install. A new reference doc covers the three install methods (notebook installer, shell installer, app-based workspace-wide deploy), the per-user vs workspace-wide paths, and the important serverless-no-CLI caveat for this specific skill. SKILL.md now surfaces the option at the top and instructs the agent to mention it after each migration analysis.
Changes
Skill (
skills/databricks-serverless-migration/)SKILL.md— extended Failure Reporting Protocol (simpler decision tree: offer for ANY unmigratable case, required output template with pre-filled URL +ghcommand); new "Where to Run This Skill" section; reference links updatedreferences/failure-reporting.md— new: redaction checklist, URL-encoding recipe, worked example,ghCLI alternativereferences/install-in-databricks-genie-code.md— new: install methods, paths, CLI caveat, follow-up Slack channelsRepo root
.github/ISSUE_TEMPLATE/migration-feedback.md— new: matches the schema the skill emitsREADME.md— listdatabricks-serverless-migrationunder Available Skillsmanifest.json— adds the two new reference files; top-levelupdated_atrefreshedTest plan
Validated by the local migration test bench: 16/16 cases pass (10 existing + 6 new purpose-built unmigratable workloads exercising hbase / custom-jar / py4j / sparkr / cassandra / unknown-pattern paths). Failure-report score 1.0 on every unmigratable case; zero PII leakage in the offered issue bodies.
synthetic-hbase-connector,synthetic-custom-jar-datasource,synthetic-py4j-jvm-reflection,synthetic-r-sparkr,synthetic-cassandra-connector,synthetic-unknown-future-api. Each expectsshould_trigger: truewith a specificexpected_error_category.referenced_report_path,offered_prefilled_url,offered_gh_cli,expected_error_category_mentioned, and a hard-fail check for PII leakage in the offered body (scoped to the offered text, not analysis chatter).Reviewer checklist
python3 scripts/skills.py validatepasses cleanlytemplate=migration-feedback.mdparameter resolves once this PR merges)Notes
compatibility: Requires databricks CLIdeclaration is unchanged. The new Genie Code install doc explicitly flags that this is a known limitation when running inside a serverless Databricks workspace and offers three workarounds.This pull request and its description were written by Isaac.