Multi-Skill Architecture¶

For complex tasks, Agent Smith assembles a panel of AI specialists. Depending on the pipeline type, these specialists may debate an approach, execute in a deterministic graph, or follow a lead-then-contribute pattern.

Pipeline Types¶

Agent Smith supports three pipeline types. Each pipeline selects its execution strategy automatically based on the orchestration metadata declared in skills.

Discussion Pipelines¶

Used by: mad-discussion, legal-analysis

Triage -- LLM analyzes the ticket and codebase, selects relevant roles
Skill Rounds -- each role analyzes the problem and states its position
Convergence -- roles discuss via OBJECTION/AGREE rounds until they reach consensus
Execution -- the consolidated plan goes to the agentic loop

Discussion pipelines use free-text accumulation. Each role can AGREE, OBJECT (blocking concern with alternative), or SUGGEST (non-blocking improvement). The discussion ends when all active roles agree, subject to a configurable round limit (default 3).

This is the original multi-skill mode and remains unchanged.

Structured Pipelines¶

Used by: security-scan, api-security-scan

Graph Construction -- SkillGraphBuilder reads orchestration metadata from all skills and produces a deterministic execution graph via topological sort
Stage Execution -- skills execute stage by stage; skills within a stage may run in parallel
Gate Check -- gate skills can veto the pipeline (see Roles below)
Output -- typed JSON handoffs between stages, no free-text accumulation

Structured pipelines skip LLM triage entirely. Each skill receives a single LLM call with typed input/output. The execution order is fully determined by the runs_after, runs_before, and parallel_with declarations in each skill's orchestration metadata.

Hierarchical Pipelines¶

Used by: fix-bug, add-feature

Lead First -- the lead skill runs first and produces a plan/directive
Contributors -- contributor skills run next; the lead's plan is injected into each contributor's context
Gates -- gate skills evaluate the accumulated output and can block the pipeline
Execution -- executor skills act in the world based on the consolidated plan

Like structured pipelines, hierarchical pipelines use SkillGraphBuilder for deterministic ordering and skip LLM triage.

Skill Roles¶

Every skill has a role that determines its behavior in the pipeline. The role is declared in the skill's agentsmith.md orchestration section.

Role	Output Type	Blocking	Behavior
contributor	list	No	Analyzes and produces a JSON list. The default role for skills without an orchestration block.
lead	plan	No	Runs first in hierarchical pipelines. Produces a plan/directive that is injected into all subsequent skills.
gate	verdict or list	Yes	Can block the pipeline. With `output: verdict`, emits true/false. With `output: list`, writes typed `List<Finding>` to `ExtractedFindings`; an empty list stops the pipeline.
executor	artifact	No	Acts in the world (creates files, runs commands). Produces an artifact as output.

Gate Veto Mechanics¶

Gates have two veto mechanisms depending on their output type:

verdict gate -- returns a boolean. false stops the pipeline.
list gate -- writes findings to ExtractedFindings. An empty list means "nothing to act on" and stops the pipeline. When a gate produces findings directly, the ExtractFindingsHandler step is skipped since the gate already populated the data.

Execution Graph¶

For structured and hierarchical pipelines, the SkillGraphBuilder constructs an execution graph:

Reads each skill's runs_after, runs_before, and parallel_with declarations
Performs a topological sort to determine stage ordering
Groups skills into ExecutionStage instances, each labeled with its role (lead, gate, executor, or contributors)
Skills within a stage run in the same phase; the parallel_with declaration allows concurrent execution

The graph is fully deterministic -- the same set of skills always produces the same execution order.

Backward Compatibility¶

Skills without an ## orchestration block in their agentsmith.md default to role: contributor with output: artifact. In discussion pipelines, these skills participate exactly as before: LLM triage selects them, they speak in rounds, and convergence is checked via OBJECTION/AGREE mechanics.

Skill Categories¶

Agent Smith ships with role sets for different domains:

Directory	Roles	Used by	Pipeline Type
`config/skills/coding/`	Architect, Backend Dev, Tester, DBA, Security, DevOps, Frontend, Product Owner	fix-bug, add-feature	hierarchical
`config/skills/security/`	Vulnerability Analyst, Auth Reviewer, Injection Checker, Secrets Detector, False Positive Filter	security-scan	structured
`config/skills/api-security/`	API Design Auditor, Auth Tester, Vulnerability Analyst, False Positive Filter	api-scan	structured
`config/skills/legal/`	Contract Analyst, Compliance Checker, Risk Assessor, Liability Analyst, Clause Negotiator	legal-analysis	discussion
`config/skills/mad/`	Philosopher, Dreamer, Realist, Devil's Advocate, Silencer	mad-discussion	discussion

Discussion Flow¶

Discussion pipelines (mad, legal) run in rounds:

Round 1:
  Architect:  "OBJECT -- this violates our layering rules. Propose: add interface in Contracts."
  DBA:        "AGREE -- schema change is backward compatible."
  Tester:     "SUGGEST -- add integration test for the new endpoint."

Round 2:
  Architect:  "AGREE -- interface added to the plan."
  Tester:     "AGREE -- integration test included."

-> Consensus reached after 2 rounds.
-> Consolidated plan goes to execution.

Each role can:

AGREE -- satisfied with the current plan
OBJECT -- has a blocking concern (must include reasoning and alternative)
SUGGEST -- has a non-blocking improvement

The discussion ends when all active roles agree. A hard limit prevents endless debates (configurable, default 3 rounds).

Project-Specific Skills¶

Projects can override or extend the default roles. Put a skills.yaml in your project's .agentsmith/ directory to select which roles are active:

roles:
  - architect
  - backend-developer
  - tester

This activates only those three roles for the project, ignoring DBA, DevOps, etc.

Custom Roles¶

Create your own role by adding a YAML file to your skills directory:

name: performance-engineer
display_name: "Performance Engineer"
emoji: "⚡"
description: "Evaluates performance impact of proposed changes"

triggers:
  - database
  - api-endpoint
  - caching

rules: |
  You review changes for performance implications.
  Flag N+1 queries, missing indexes, unbounded collections,
  and unnecessary allocations.

convergence_criteria:
  - "No performance concerns above medium severity"