Layer 01

Collection

Source-specific collectors handle the messy reality of acquiring data. Each collector is responsible for one source — its authentication, rate limiting, parsing, pagination, and checkpointing.

Collectors return partial-success envelopes, not exceptions. A failed source does not bring down the workflow. The orchestration layer can replay, resume, or skip individual sources without affecting the rest of the system.

Layer 02

Normalisation

Raw content from heterogeneous sources is normalised to a canonical schema. Dates, identifiers, jurisdictions, and entity references are resolved to their normalised forms before any downstream processing.

Normalisation is deterministic. Two runs over the same input produce the same output. This is what makes deduplication, change tracking, and replay possible.

Layer 03

Orchestration

The orchestration layer is the spine of the system. It defines the workflow as a state machine: which stages run, in what order, with what inputs, producing what outputs. State is persisted between stages. Failed stages can be retried, skipped, or replayed.

Orchestration is the boundary between deterministic system behaviour and probabilistic agent behaviour. The orchestration layer never calls a language model. It coordinates the agents that do.

Layer 04

Agents

Agents are where judgement happens. Each agent has a single responsibility — classify, extract, draft, score, summarise — and operates against a schema-defined contract. Inputs are typed. Outputs are validated. Behaviour is testable.

The agent layer is the only place where probabilistic behaviour is permitted. Every other layer treats agents as components with defined contracts, not as the system itself.

Layer 05

Review & Approval

Human review and approval gates are first-class components of the architecture, not afterthoughts. Workflow stages can be configured to require human sign-off before downstream processing continues. Reviewers see the system context, not raw model output.

For sensitive workflows, multiple gates can be inserted. The audit trail captures every approval, decision, and override.

Layer 06

Delivery

The delivery layer handles outputs: written briefs, dashboard updates, API responses, notifications, file generation. Delivery is decoupled from processing — the same workflow output can be delivered through multiple channels without changing the upstream architecture.

This separation is what makes the system extensible. New delivery channels can be added without modifying the rest of the architecture.

Why this architecture matters.

Three operating principles distinguish Vellum systems from typical AI deployments.

Deterministic where it matters

Orchestration, state, routing, and validation are deterministic. Models are used for judgement, not coordination. This is what makes the system reliable.

Schema-driven contracts

Every interface between layers is a schema. Inputs and outputs are validated. Behaviour can be tested. Failures stay local.

Auditable by design

Every action — collection, processing, decision, approval, delivery — is logged. The audit trail is the record of what the system did and why.