[copilot-cli-research] Copilot CLI Deep Research - 2026-05-15 #32287
Closed
Replies: 1 comment
-
|
This discussion has been marked as outdated by Copilot CLI Deep Research Agent. A newer discussion is available at Discussion #32544. |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
Analysis Date: 2026-05-15
Repository: github/gh-aw
Scope: 99 Copilot workflows out of 494 total (20%)
📊 Executive Summary
Research Topic: Copilot CLI Optimization Opportunities
Key Findings: 8 missed opportunities identified across security, performance, and developer experience
Primary Recommendation: Enable
max-continuationsfor complex multi-step workflows — currently 0% adoption despite being a Copilot-exclusive featureThis analysis examined 99 workflows using
engine: copilotagainst all available Copilot CLI capabilities. The workflows demonstrate strong adoption of safe-outputs (86%), GitHub MCP tools (62%), and cache-memory (32%). However, several powerful features remain entirely unused:max-continuationshas 0% adoption,engine.baremode is never used, and only 2 workflows leverage custom agent files. Model selection is also largely unconfigured (84% use defaults), missing cost/performance optimization opportunities.The most pressing gap is network security: 52 of 99 Copilot workflows (53%) have no
network.allowedconfiguration, and only 12 (12%) use the AWF sandbox. This represents a significant security hardening opportunity across the repository.Critical Findings
🔴 High Priority Issues
1. Missing Network Restrictions (53% of workflows)
52 out of 99 Copilot workflows have no
network.allowedconfiguration. Without network restrictions, the agent can reach arbitrary external hosts during execution.Affected examples: most
daily-*report workflows,stale-*cleanup workflows,sub-issue-closer.mdFix:
2. AWF Sandbox Only 12% Adoption
Only 12 workflows use
sandbox: agent: awfdespite the firewall providing meaningful isolation against prompt injection and data exfiltration.🟡 Medium Priority Opportunities
1.
max-continuations— 0% usageThis is a Copilot-exclusive feature (no other engine supports it). It enables autopilot mode for long multi-step tasks with
--max-autopilot-continues. Not a single workflow uses it.2. Model not specified — 84% of workflows
83 of 99 workflows use the default model. Workflows like daily reports, summaries, and simple triage could use
model: smallfor significant cost reduction.3.
engine.bare— 0% usageNo workflow uses bare mode, which disables automatic context loading (memory files, copilot-instructions.md). Read-only analysis workflows loading unnecessary context waste tokens every run.
View Full Analysis
1️⃣ Current State Analysis
View Copilot CLI Capabilities Inventory
Copilot CLI Capabilities Inventory
engine.agent.github/agents/*.agent.mdengine.versionlatestengine.modelsmall,large, or specific modelengine.argsengine.envengine.api-targetengine.bare/bare: truemax-continuationssandbox.agent: awfnetwork.allowedtools.github.toolsetstools.cache-memorytools.repo-memoryengine.env.COPILOT_PROVIDER_*strict: true/falseView Usage Statistics
Usage Statistics (99 Copilot workflows)
safe-outputstimeout-minutessettools.githubMCPnetwork.allowedtools.cache-memoryengine.modelspecifiedtools.repo-memorysandbox.agent: awfengine.envengine.argsstrict: falseengine.agentengine.versionpinnedmax-continuationsengine.bare2️⃣ Feature Usage Matrix
3️⃣ Missed Opportunities
View High Priority Opportunities
🔴 High Priority
Opportunity 1: Network Restriction (53% of workflows unprotected)
network.allowedblockdaily-issues-report.md,stale-pr-cleanup.md,sub-issue-closer.md,draft-pr-cleanup.md, and 48 morenetwork: allowed: [defaults]as a minimum baselineOpportunity 2: AWF Sandbox Adoption (12% → target 50%+)
View Medium Priority Opportunities
🟡 Medium Priority
Opportunity 3:
max-continuations— 0% Usage (Copilot-Exclusive Feature)max-continuationsenables autopilot mode, letting Copilot CLI chain multiple runs automatically for long tasksdaily-compiler-quality.md,dead-code-remover.md,code-scanning-fixer.mdcould benefit from chained execution without manual re-triggeringOpportunity 4: Model Selection — 84% Using Defaults
model: smallcosts less for simple tasksdaily-issues-report.md,stale-pr-cleanup.md,weekly-issue-summary.md,auto-triage-issues.md, and 79 moremodel: smallfor read-only analysis and reporting; keepmodel: largefor code changesOpportunity 5:
engine.bareMode — 0% Usagebare: true) disables automatic loading ofAGENTS.md,copilot-instructions.md, and memory filesdaily-issues-report.md,api-consumption-report.md,daily-performance-summary.md, all pure-read report workflowsOpportunity 6:
engine.agentCustom Agents — 2% Usage.github/agents/provide workflow-specific system promptsdaily-agent-of-the-day-blog-writer.md,weekly-blog-post-writer.md) use theawfagent; specialized agents could improve output quality for domain-specific workflowspr-code-quality-reviewer.md,architecture-guardian.md,security-compliance.mdwould benefit from expert persona agents.github/agents/security-reviewer.agent.mdwith specialized persona, then reference:View Low Priority Opportunities
🟢 Low Priority
Opportunity 7: Version Pinning — 0% Usage
engine.version)engine: id: copilot+version: "0.0.422"Opportunity 8: Over-broad GitHub MCP Toolsets
toolsets: [default]which includes all GitHub API tools; many could use narrower setstoolsets: [issues]instead of[default]toolsets: [issues]instead oftoolsets: [default]for read-only workflows4️⃣ Specific Workflow Recommendations
View Workflow-Specific Recommendations
daily-issues-report.mdmodel: small, addnetwork: allowed: [defaults], considerbare: truedead-code-remover.mdmax-continuationsmax-continuations: 2to allow multi-pass analysispr-code-quality-reviewer.md.github/agents/code-reviewer.agent.mdwith review-focused persona; addengine.agent: code-reviewerauto-triage-issues.mdmodel: small(reads issues, simple classification), add network restriction5️⃣ Trends & Insights
View Historical Trends
This is the first comprehensive analysis. Key baseline metrics established for future comparison:
max-continuations: 0% → watch for adoption as teams learn about autopilot modeengine.bare: 0% → new feature, expect gradual adoptionFuture research should track whether recommendations from this analysis are implemented and measure trend lines for each metric.
6️⃣ Best Practice Guidelines
Based on this research, here are recommended best practices:
Always set
network.allowed: Evenallowed: [defaults]is significantly better than open access. Make it a required field in workflow reviews.Use
model: smallfor non-coding tasks: Daily reports, issue summaries, triage workflows, and read-only analysis rarely need the full model. This alone could cut token costs for ~50+ workflows.Adopt
max-continuationsfor long-running tasks: Complex workflows that currently time out or need re-triggering are ideal candidates. Start withmax-continuations: 2and monitor.Create domain-specific agent files: The
.github/agents/directory is underutilized. Creating specialized agents (security-reviewer, code-quality, documentation-writer) could significantly improve output quality.Use
engine.barefor pure analysis workflows: Any workflow that reads data and reports without needing repo-specific instructions should setbare: trueto skip context loading.7️⃣ Action Items
Immediate Actions (this week):
network: allowed: [defaults]to the 52 workflows without any network configmodel: smallShort-term (this month):
max-continuations: 2ondead-code-remover.md,code-scanning-fixer.md,repository-quality-improver.mdbare: trueto 10+ pure read/report workflowsLong-term (this quarter):
toolsets: [default]usages and narrow to minimum required toolsetsView Supporting Evidence & Methodology
📚 References
docs/src/content/docs/reference/engines.mdpkg/workflow/copilot_engine_execution.gopkg/workflow/copilot_engine_tools.gopkg/workflow/copilot_mcp.goResearch Methodology
.mdfiles in.github/workflows/engine: copilotGenerated by Copilot CLI Deep Research (Run: §25901095924)
Beta Was this translation helpful? Give feedback.
All reactions