Multi-Agent Orchestration¶

Agent Smith coordinates multiple specialized AI skills to analyze, plan, review, and synthesize results. Each skill has typed inputs, typed outputs, and a role that's assigned per ticket by the triage step.

The model has two layers: roles (what a skill does in a given run) and phases (when in the pipeline a role acts).

Roles¶

Role	What it does	Output	Veto?
`Lead`	Sets the plan downstream skills compare against. One per phase.	`plan` (typed observations)	No
`Analyst`	Contributes perspective. No veto power.	`list` of observations	No
`Reviewer`	Compares actual code/diff against the plan. Evidence-required.	`list` of observations	No
`Filter`	Reduces a finding list (drops duplicates/false positives) or synthesizes a final artifact.	`list` or `artifact`	No

A single skill may declare multiple supported roles (roles_supported: [lead, analyst, reviewer]); triage picks one role per phase based on activation criteria.

Phases¶

Structured pipelines (fix-bug, add-feature, security-scan, api-security-scan) declare three phases.

Phase	Round #	Typical roles	What happens
`Plan`	1	Lead, Analysts	Lead emits a plan; analysts contribute perspective.
`AgenticStep`	—	(no triage roles)	Developer agent writes code following the plan. Only in `fix-bug` / `add-feature`.
`Review`	2	Lead (sometimes), Reviewers	Reviewers compare diff against the plan via `{{plan}}` template token.
`Final`	3	Filter	Reduces or synthesizes the run's output.

graph LR
    Triage --> Plan
    Plan --> AgenticStep
    AgenticStep --> Review
    Review --> Final
    Final --> Output

    style Triage fill:#4a4a4a,color:#fff
    style Plan fill:#27ae60,color:#fff
    style Review fill:#2980b9,color:#fff
    style Final fill:#c0392b,color:#fff

For security-scan and api-security-scan the AgenticStep is omitted — they read-only-scan, so phases run back-to-back.

For legal-analysis, mad-discussion, init-project, skill-manager, and autonomous, triage falls back to the legacy LLM strategy that picks Lead + Participants. Phases don't apply; the run is one open round driven by ConvergenceCheck.

Plan artifact threading¶

After the Plan phase, the Lead's observations are stored in PipelineContext as a PlanArtifact. Review-phase skills with a {{plan}} placeholder in their ## as_reviewer body get it substituted at prompt-build time. Reviewers without a same-run lead see (no plan provided) and run as generic reviewers.

Confidence threshold¶

Every observation carries a Confidence (0–100) and Blocking flag. Observations with Blocking=true and Confidence<70 are auto-downgraded to Blocking=false with a structured log entry. The high-confidence threshold prevents speculation from breaking the pipeline; low-confidence concerns still surface in the final report but don't gate.

Filter mode¶

Filter skills execute as a separate FilterRoundCommand (not a SkillRoundCommand). The output mode is read from output_contract.output_type[Filter]:

List → the LLM returns a reduced JSON observation list; the framework replaces the in-context observation list with the reduced one (IDs reassigned).
Artifact → the LLM returns synthesized text; the framework stores it under SkillOutputs[skillName] for downstream consumption (final report, deliver step).

Unlike the legacy Gate role, Filter has no veto. Reductions and syntheses are observable and downstream pipeline steps continue regardless.

Skill contract¶

Skills declare their roles, activation criteria, and output contract in SKILL.md frontmatter. See the skills.md reference and the migration guide for the full schema and a before/after example.

The legacy agentsmith.md ## orchestration section, the OrchestrationRole enum (Lead/Contributor/Gate/Executor), and the deterministic SkillGraphBuilder are all retired in p0111c. The current pipeline order is decided per ticket by the LLM-driven triage step, not by topological sort over skill metadata.

Pipelines using this pattern¶

Pipeline	Triage strategy	Phases
`fix-bug`, `add-feature`, `fix-no-test`	Structured	Plan → AgenticStep → Review → Final
`security-scan`	Structured	Plan → Review → Final
`api-security-scan`	Structured	Plan → Review → Final
`legal-analysis`, `mad-discussion`	Legacy (Discussion)	Single open round
`init-project`, `skill-manager`, `autonomous`	Legacy (Discussion)	Single open round