Fit assessment
A focused review of the workload, the existing infrastructure, and the decision boundary between Swarm and a bounded workflow runner. Output: a written recommendation, not a deck.
Division Swarm
Division Swarm runs long-running AI systems at fleet scale. Composable flows, per-entity state, forkable and audited end to end. Safe to deploy, engineered to stay under your control.
Why Division Swarm
You shape the flow, not the LLM. Strict gates validate every transition. Hallucinations stop at the gate.
Cost ceilings per entity. Schema-validated events. Tenant isolation. Static checks before deploy.
Thousands of concurrent agent teams. Each with its own durable state, surviving crashes and restarts.
Flows nest into systems of any depth. Sub-flows materialize per entity on demand.
Every event, every decision, every dollar traced end to end.
Replay any past timeline exactly. Test contract changes against real production data.
The model
Work is an entity moving through a state machine you declared: named states, guarded transitions, gates that must clear before the next step. Every transition (data writes, emitted events, state change) commits in one atomic transaction.
Agents reason inside scoped sessions and emit events. Deterministic code routes them. Nothing happens because an LLM said so. The runtime decides what each agent result actually changes.
State survives crashes, restarts, and weeks of waiting on a timer or a human. Hundreds of entities advance concurrently in isolated workspaces. Any run can be replayed turn by turn or forked from the log.
A handler declaration
ticket-orchestrator:
event_handlers:
ticket.validated:
guard:
check: "entity.priority in ['high', 'urgent']"
on_fail: discard
data_accumulation:
writes: [order_summary, validation_context]
sets_gate: g1_validation
advances_to: processing
emit: work.assigned Runtime architecture
Composition
A flow is a complete, importable package: state machine, system nodes, agents, tools, policy. Its boundary is declared by typed input and output pins, so composition is a declaration, not a refactor.
A parent flow imports children and wires their pins. The static analyzer verifies every connection at boot. Hundreds of flows run as one coordinated system on a single deployment, with the same durable persistence, isolation, and audit trail as a single flow.
Build a flow once; reuse it across teams, projects, and systems. Composition is how a small contract becomes a platform without becoming a monolith.
Rewind anything
Because every event and every state change is persisted, any run can be rewound to any point in its history. Change one thing (a model, a prompt, a policy value, a tool's response) and re-execute the counterfactual. Compare it against the original, turn by turn.
The cost of iterating on a system drops to the cost of forking a run. Debugging stops being archaeology. And when a regulator or an auditor asks what happened, you don't show them logs; you show them the run.
// one variable changed at the fork point: every downstream turn re-executed
Where it fits
Most agent frameworks aim at bounded workflows: a code-review pipeline, a research-then-synthesize chain, a task that runs and finishes. If that's your problem, use one.
Division Swarm is for the tier above: systems that run continuously, hold state across crashes, coordinate dozens of agents over days, and have to be auditable and replayable end to end. When your multi-agent system needs to be a durable system of record, not a workflow invocation, that's where Division Swarm lives.
Integration services
For teams running Division Swarm in regulated or high-stakes contexts, the maintainer's team is available for direct engineering partnership. Engagements are scoped, time-boxed, and led by the people who designed the runtime:not a salesforce.
A focused review of the workload, the existing infrastructure, and the decision boundary between Swarm and a bounded workflow runner. Output: a written recommendation, not a deck.
Contract design and implementation for the business processes the team intends to run on Swarm. Includes the contract bundle, a passing conformance plan, and the operator runbook.
Postgres deployment topology, observability wiring, identity and secret-manager integration, and a staged rollout from a single flow to a multi-flow surface.
Onboarding for the engineers who will own the system day-to-day: state-machine authoring, analyzer-driven iteration, replay and fork in operations, and the audit story end to end.
Direct engagement
Direct introduction to the maintainer's team. Replies within two business days.
Engineering rigor
Division Swarm doesn't ask for trust. Every claim in this page is bound to a single machine-readable platform specification, enforced by a static analyzer that runs before the runtime can boot, and proved against a conformance suite the maintainer doesn't pay an LLM bill to execute.
Boot a flow with a half-finished contract and the analyzer refuses to start the runtime. Change a payload field on one event and 46 structural checks ripple across the bundle until every consumer is reconciled. Open the repository and the same checks that run in CI run on your laptop in under a second.
Output of the static analyzer against an example bundle. Reproducible locally; no runtime, no database, no LLM credential required.
# install
go install github.com/division-sh/swarm/cmd/swarm@latest
# verify and run your own bundle (see docs.division.sh/quickstart)
swarm verify --contracts ./my-flow
swarm run --contracts ./my-flow --event order.created --payload ./payload.json Releases, new flows, and occasional notes on production multi-agent patterns. No spam.