Skip to content

Commit b046e56

Browse files
committed
chore: update version to 2.0.3 and modify descriptions in package files
- Bump version from 2.0.1 to 2.0.3 in package.json and package-lock.json - Update description in package.json for clarity on functionality - Remove outdated plan.md file and create a new publication plan document - Adjust Content Security Policy in cockpitWebviewDocument.ts for improved security - Revise localized strings in cockpitWebviewStrings.ts for better accuracy and clarity - Update i18n messages for consistency with new functionality and terminology
1 parent a1e691c commit b046e56

11 files changed

Lines changed: 152 additions & 10556 deletions

.vscodeignore

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -34,6 +34,7 @@ AGENTS.md
3434
compile.log
3535
compile*.log
3636
patch.diff
37+
diff_output.txt
3738
plan.md
3839
test.js
3940
test.txt
File renamed without changes.

README.md

Lines changed: 37 additions & 25 deletions
Original file line numberDiff line numberDiff line change
@@ -18,19 +18,21 @@
1818

1919
<!-- markdownlint-enable MD033 MD041 -->
2020

21-
Copilot Cockpit helps you plan AI work, approve it, and then run it with visible checkpoints instead of handing your repo to a blind autonomous loop.
21+
Copilot Cockpit is one workflow system for AI work in VS Code: planning and triage, execution and scheduling, and optional tool/control-plane integration.
2222

2323
## Why It Exists
2424

25-
Most AI automation demos jump straight to execution. That looks impressive until the model burns tokens, edits files too early, or drifts away from the real goal.
25+
Most AI tooling still reads like feature accumulation: a todo board here, a scheduler there, a tool bridge somewhere else. Copilot Cockpit is meant to feel like one coherent stack.
2626

27-
Copilot Cockpit separates planning, approval, execution, and review so autonomy is earned step by step instead of assumed from the start.
27+
It connects three tightly linked layers:
2828

29-
In practice, the LLM is the native execution chat surface, while Copilot Cockpit is the orchestration and review layer around it. It gives a human or AI decision-maker a place to capture work, review it, research it, discuss it, and hand it off safely.
29+
1. Planning and triage in `Todo Cockpit`
30+
2. Execution and scheduling through `Tasks` and `Jobs`
31+
3. Tool and control-plane integration through `Research`, `MCP`, and optional agent surfaces
3032

31-
The point is not to reject automation. The point is to make automation accountable and keep humans in charge until the workflow has proven it deserves more autonomy.
33+
The workflow is explicit on purpose. A `Todo` is the planning artifact. A `Task` is one executable unit. A `Job` is an orchestrated or scheduled run built from steps. `Research` is an exploratory context-building artifact when the system still needs evidence before execution.
3234

33-
That matters most when the repo keeps producing more work than any one person can hold in memory: bugs, feature ideas, follow-up changes, security updates, web findings, pricing checks, customer tasks, or research that should turn into implementation later. Copilot Cockpit turns those discoveries into a visible queue so work can be found again and handled properly instead of getting lost between chat sessions.
35+
That structure keeps the LLM as the native execution chat surface while Copilot Cockpit provides the approval, scheduling, and control layer around it. The goal is not less automation. The goal is accountable automation that can move from intake to execution without losing review, context, or ownership.
3436

3537
## 🎬 Demo
3638

@@ -42,15 +44,15 @@ For the step-by-step walkthrough, open [docs/feature-tour.md](docs/feature-tour.
4244

4345
## 🧠 The Core Loop
4446

45-
Think of Copilot Cockpit as a local control system for structured AI work:
47+
The recommended default path is simple:
4648

47-
1. Capture and discuss work in `Todo Cockpit`.
48-
2. Research and refine it until the user is happy with the direction.
49-
3. Move approved work into `ready`.
50-
4. Turn that work into a `Task`, `Job`, or `Research` run.
51-
5. Review outcomes before granting more autonomy.
49+
1. Start with a `Todo` in `Todo Cockpit` for intake, planning, and triage.
50+
2. Use `Research` only when context is missing or the direction still needs evidence.
51+
3. Promote approved work into a `Task` for one executable unit or a `Job` for an orchestrated run.
52+
4. Review the result before granting more autonomy or scheduling the next cycle.
53+
5. Add `MCP`, repo-local skills, or agent/control-plane features only when the core loop is already working.
5254

53-
This keeps the relationship collaborative: you work with the LLM, not under a black-box agent that guesses what should happen next.
55+
This keeps the relationship collaborative: the workflow starts with planning, earns execution, and only then extends into higher-autonomy integrations.
5456

5557
## Built On VS Code And GitHub Copilot
5658

@@ -95,29 +97,37 @@ The optional layer stays practical because responsibilities are split deliberate
9597

9698
Bundled-agent sync is manual by design. Repo-local agent systems are user-owned, so Copilot Cockpit only offers the starter pack as an optional baseline and does not overwrite customized workspace copies during sync. For the deeper operating model, see [docs/agent-workflow.md](docs/agent-workflow.md).
9799

98-
## ✨ Feature Tour
100+
## ✨ Workflow Layers
101+
102+
### Stable workflow primitives
103+
104+
These are the default path and the main product surface.
99105

100106
### Todo Cockpit
101107

102-
`Todo Cockpit` is the planning and approval hub. Use it to capture work, add comments, apply labels and flags, and hand approved work into execution.
108+
`Todo Cockpit` is the planning and triage layer. A `Todo` stays a planning artifact: capture work, add comments, apply labels and workflow flags, and decide what should happen next.
103109

104110
### Tasks
105111

106-
`Tasks` are the simplest execution unit: one prompt, one scheduled action, one concrete piece of work. Use them for one-time runs or recurring execution.
112+
`Tasks` are the simplest execution layer artifact: one executable unit, one prompt, one scheduled action, one concrete piece of work. Use them for one-time runs or recurring execution.
107113

108114
That includes recurring tasks such as security research, market checks, feature scouting, maintenance prompts, prompt refinement, repo upkeep, or any other repeated work that should run on a schedule and return to review.
109115

110116
### Jobs
111117

112-
`Jobs` are ordered multi-step workflows built from multiple steps with reusable actions and pause checkpoints. Use them when the work should not run as one uninterrupted chain.
118+
`Jobs` are the orchestration layer inside execution: ordered multi-step workflows with reusable actions and pause checkpoints. Use them when work should not run as one uninterrupted chain.
113119

114120
Think of `Jobs` as deeper agentic workflows inside VS Code: research, decision support, implementation steps, maintenance steps, MCP calls, or external-tool sequences that should be inspected at explicit checkpoints instead of left to one opaque run.
115121

116122
### Research
117123

118-
`Research` profiles are bounded benchmark-driven iteration loops. Use them when you need repeated attempts at improvement against a metric instead of one direct execution.
124+
`Research` is the exploratory context-building layer. Use it when the system is missing context, needs outside evidence, or should iterate against a benchmark before you decide on execution.
125+
126+
Research is especially useful when work should pull in fresher outside knowledge first, through web search, Perplexity, scrapers, or other tooling, and then return that material for user review before implementation begins.
119127

120-
Research is especially useful when the work should pull in fresher outside knowledge first, through web search, Perplexity, scrapers, or other tooling, and then return that material for user review before implementation begins.
128+
### Experimental and advanced playground capabilities
129+
130+
These capabilities stay discoverable, but they are not required for the default path.
121131

122132
### Model And Agent Choice
123133

@@ -127,11 +137,11 @@ That also creates a control layer for cost: GitHub Copilot or OpenRouter can use
127137

128138
### Settings
129139

130-
`Settings` configure workspace defaults, integrations, storage mode, and execution preferences so the cockpit matches the repo you are operating in. They are also where you can optionally incorporate repo-local agents or the bundled starter pack when you want an orchestration layer that lets the CEO or orchestrator stay focused on routing, planning, and validation while specialists handle bounded work. See [docs/agent-workflow.md](docs/agent-workflow.md) for the operating model.
140+
`Settings` configure workspace defaults, integrations, storage mode, and execution preferences so the cockpit matches the repo you are operating in. They are also where you can optionally incorporate repo-local agents or the bundled starter pack when you want an extra control-plane layer that lets the orchestrator stay focused on routing, planning, and validation while specialists handle bounded work. See [docs/agent-workflow.md](docs/agent-workflow.md) for the operating model.
131141

132142
### How To Use
133143

134-
`How To Use` is the built-in onboarding tab. Start there if you want a guided explanation of the operating model before you schedule anything. You can also launch the same walkthrough from the top bar with `Intro Tutorial`, then use the top-bar `Plan Integration` button to inspect existing repo-local agent surfaces before you approve any manual bundled-agent sync. That optional agent layer is useful when you want the orchestrator to receive the task, do the initial repo framing, delegate bounded work to the right repo-local or starter-pack specialist, and then validate the returned result instead of carrying the full implementation loop in one long chat.
144+
`How To Use` is the built-in onboarding tab. Start there if you want the recommended path explained in order: `Todo` first, `Research` when context is missing, `Task` or `Job` for execution, then optional control-plane integration after the core loop is working.
135145

136146
## Common Workflows
137147

@@ -189,10 +199,10 @@ The point is not to overclaim autonomy. The point is to show recurring, inspecta
189199

190200
1. Open Copilot Cockpit from the activity bar or run `Copilot Cockpit: Create Scheduled Prompt (GUI)` from the command palette. Or use the todo-list icon in the top right.
191201
2. Start in `How To Use` if you are new to the extension, or click the top-bar `Intro Tutorial` button for the same guided walkthrough.
192-
3. Capture or refine work in `Todo Cockpit`.
193-
4. Move approved work into `ready` to prepare a task draft.
194-
5. Use `Tasks` for one execution unit, `Jobs` for multi-step flows, and `Research` for benchmark-driven iteration.
195-
6. Open `Settings` to configure repo-local defaults, MCP, Copilot skills, and Codex support files.
202+
3. Capture or refine work in `Todo Cockpit` until the planning artifact is clear.
203+
4. Use `Research` if the work still needs exploratory context or outside evidence.
204+
5. Move approved work into `ready`, then promote it into a `Task` for one executable unit or a `Job` for an orchestrated run.
205+
6. Open `Settings` to configure repo-local defaults. Add `MCP`, Copilot skills, starter agents, or other control-plane features only when you want those optional extensions.
196206

197207
The `Settings` tab also lets you manually sync bundled starter agents into `.github/agents` when you want that optional specialist layer. This is useful when the orchestrator should hand work to the right specialist instead of stuffing planning, execution, and review into one long chat. Treat any existing repo-local agent setup as user-owned first. Only approve a sync when you want it, back up `.github` first when it already exists, and keep in mind that customized workspace copies are skipped so your repo-specific agent edits are not overwritten.
198208

@@ -219,6 +229,8 @@ Detailed documentation lives under [docs/index.md](docs/index.md).
219229

220230
## Advanced Capabilities
221231

232+
These extend the core workflow. They are optional and should not be mandatory for onboarding.
233+
222234
- `MCP` gives AI agents a controlled tool surface to use the plugin inside the workspace.
223235
- Support for Copilot-first workflows, with experimental Codex integration for repo-local coordination.
224236
- Bundled starter agents can be synced into `.github/agents` as a small default orchestration layer: `CEO`, `Planner`, `Remediation Implementer`, `Documentation Specialist`, `Custom Agent Foundry`, and `Cockpit Todo Expert`. The pattern is optional, keeps the top-level orchestrator cleaner, and is described in [docs/agent-workflow.md](docs/agent-workflow.md).

docs/getting-started.md

Lines changed: 21 additions & 15 deletions
Original file line numberDiff line numberDiff line change
@@ -1,27 +1,33 @@
11
# Getting Started
22

3-
Copilot Cockpit works best when you think of it as a simple loop:
3+
Copilot Cockpit works best when you treat it as one workflow stack with three layers:
44

5-
1. Plan the work.
6-
2. Approve the handoff.
7-
3. Run the right execution unit.
8-
4. Review the result before granting more autonomy.
5+
1. Planning and triage in `Todo Cockpit`
6+
2. Execution and scheduling through `Tasks` and `Jobs`
7+
3. Optional tool/control-plane integration through `Research`, `MCP`, and repo-local agent surfaces
8+
9+
The recommended path is: start with a `Todo`, use `Research` when context is missing, then promote approved work into a `Task` or `Job`.
910

1011
## Quick Start
1112

1213
1. Open Copilot Cockpit from the activity bar or with `Copilot Cockpit: Create Scheduled Prompt (GUI)`.
1314
2. Start in `How To Use` if you are new to the extension, or click the top-bar `Intro Tutorial` button for the same walkthrough.
14-
3. Capture or refine work in `Todo Cockpit`.
15-
4. Move approved work into a task draft when the item is `ready`.
16-
5. Use `Tasks` for one execution unit, `Jobs` for ordered multi-step execution, and `Research` for benchmark-driven iteration.
17-
6. Open `Settings` to configure repo-local defaults and integrations, and use the top-bar `Plan Integration` button when you are ready to inspect or design the repo-local agent setup.
15+
3. Capture or refine work in `Todo Cockpit`. A `Todo` is the planning artifact and intake surface.
16+
4. Use `Research` if the work still needs exploratory context, outside evidence, or benchmarked iteration.
17+
5. Move approved work into `ready`, then promote it into a `Task` for one executable unit or a `Job` for an orchestrated or scheduled run.
18+
6. Open `Settings` to configure repo-local defaults and integrations. Use the top-bar `Plan Integration` button only when you want optional control-plane extensions such as MCP, skills, or starter agents.
19+
20+
## Stable Primitives
21+
22+
- Use `Todo Cockpit` when the work still needs planning, comments, approval, or triage.
23+
- Use `Tasks` when one prompt and one schedule are enough for one executable unit.
24+
- Use `Jobs` when the work needs ordered stages, orchestration, or pause checkpoints.
25+
- Use `Research` when the work needs exploratory context or measured improvement before execution.
1826

19-
## Choose The Right Surface
27+
## Optional Extensions
2028

21-
- Use `Todo Cockpit` when the work still needs planning, comments, or approval.
22-
- Use `Tasks` when one prompt and one schedule are enough.
23-
- Use `Jobs` when the work needs ordered stages or pause checkpoints.
24-
- Use `Research` when the goal is measured improvement against a benchmark.
29+
- Add `MCP`, repo-local skills, or starter agents after the default path is working.
30+
- Treat those capabilities as control-plane extensions, not as mandatory setup for first use.
2531

2632
If you want the tab-by-tab walkthrough, continue to [Feature Tour](./feature-tour.md).
2733

@@ -31,7 +37,7 @@ Skip toy prompts. Start with one recurring loop that would still be worth keepin
3137

3238
- For a small project, use an opportunity scout, a delivery-risk watch, and a knowledge packager, then stop at a review checkpoint.
3339
- For a company team, use the same pattern for product signals, security and release readiness, support queues, or operations follow-up.
34-
- If you also want to show the Research surface, add one benchmarked profile that scores onboarding or prompt quality against a simple command.
40+
- If you also want to show the Research surface, add one benchmarked profile that scores onboarding or prompt quality against a simple command before you promote anything into execution.
3541

3642
That keeps the demo honest: the proof is useful output plus explicit review, not a claim that the system should run unchecked.
3743

docs/workflows.md

Lines changed: 23 additions & 8 deletions
Original file line numberDiff line numberDiff line change
@@ -1,38 +1,53 @@
11
# Workflows
22

3-
Copilot Cockpit separates planning from execution so users can review work before it runs.
3+
Copilot Cockpit is one workflow stack with three layers: planning and triage, execution and scheduling, and optional tool/control-plane integration.
4+
5+
The default path is: start with a `Todo`, use `Research` when context is missing, then promote approved work into a `Task` or `Job`.
46

57
## Todo Cockpit
68

7-
- Repo-local planning, communication, approval, and handoff surface.
9+
- Repo-local planning, intake, communication, approval, and handoff surface.
10+
- A `Todo` is a planning artifact, not an execution artifact.
811
- Cards support comments, due dates, labels, flags, task links, archive review, drag and drop, and filters.
912
- Built-in workflow flags include `new`, `needs-bot-review`, `needs-user-review`, `ready`, `ON-SCHEDULE-LIST`, and `FINAL-USER-CHECK`.
1013
- Saving a todo into `ready` can create or reopen its linked task draft so execution prep stays adjacent to the approval step.
1114

1215
## Tasks
1316

14-
- One scheduled execution unit, either one-time or recurring.
17+
- One executable unit, either one-time or recurring.
18+
- A `Task` is the direct execution artifact for one prompt and one schedule.
1519
- Repo-scoped storage with optional agent and model selection.
1620
- Best for one direct execution step without workflow branching.
1721
- Overdue tasks are reviewed on startup.
1822

1923
## Jobs
2024

2125
- Ordered multi-step workflows with one schedule.
26+
- A `Job` is an orchestrated or scheduled run built from multiple steps.
2227
- Steps can be reused, reordered, paused, edited, and compiled into a bundled task.
2328
- Best when work must pass through explicit checkpoints or pauses.
2429

2530
## Research
2631

27-
- Bounded iteration against a benchmark.
32+
- Exploratory context-building artifact with bounded iteration against a benchmark.
2833
- Profiles define benchmark commands, score extraction, optimization direction, failure limits, and editable path allowlists.
29-
- Best when the goal is improvement over multiple measured runs instead of one direct execution.
34+
- Best when the goal is better context or improvement over multiple measured runs before one direct execution.
35+
36+
## Stable Workflow Primitives
37+
38+
- `Todo`, `Task`, and `Job` are the stable workflow primitives for the default path.
39+
- `Research` supports that path when context is missing before execution begins.
40+
41+
## Experimental And Advanced Playground Capabilities
42+
43+
- `MCP`, repo-local skills, starter agents, and related integrations extend the workflow as optional control-plane capabilities.
44+
- They should stay discoverable, but they are not mandatory for onboarding or first execution.
3045

3146
## Execution Guidance
3247

33-
- Use a todo when the work still needs planning, comments, or approval.
34-
- Use a task when the work is ready for one concrete execution step.
48+
- Use a todo when the work still needs planning, comments, approval, or triage.
49+
- Use research when the work needs exploratory context or evidence before execution.
50+
- Use a task when the work is ready for one concrete executable step.
3551
- Use a job when the work needs multiple ordered steps or pause checkpoints.
36-
- Use research when the work needs repeated benchmark-driven refinement.
3752

3853
[Back to README](../README.md)

0 commit comments

Comments
 (0)