The Echelon Stack

The infrastructure behind the agents.

Anyone can wire a model to a phone number and call it an AI agent. The difference between a demo and a system your business runs on is everything underneath: who routes the work, what the agent is allowed to know, what it's allowed to touch, and whether you can see why it did what it did. This page is the architecture standard every Echelon deployment is built to.

Four layers, closed into a loop. Work flows down. Telemetry flows back up. The system that runs in month six is measurably better than the one that went live on day 90.

The closed loop

Four layers. One loop. No black box.

Every agent we deploy — inbound, outbound, follow-up, quoting, operations — runs on the same four layers. The loop is what makes it infrastructure: what observability catches feeds back into routing, prompts, and playbooks every single week.

01 — Orchestration

Who decides what happens next?

One supervisor sits in front of every conversation. It classifies what just arrived — a new lead, a quote request, a support issue, a callback — and routes it to the specialist agent built for that job. Not one giant prompt guessing at everything: explicit state, deterministic routing, and a human-approval checkpoint anywhere money or your reputation moves.

When an agent qualifies a lead or drafts a quote, "mostly right" isn't acceptable. The supervisor pattern means every hand-off is a decision you can inspect — and the steps that commit your business (sending the quote, issuing the refund, signing the contract) can require a human tap before they fire.

Supervisor routingIntent classificationSpecialist agentsApproval checkpointsExplicit state

02 — Knowledge

What is the agent allowed to know — and say?

Every client gets a private knowledge store: your pricing, your runbooks, your policies, your FAQs. Agents answer from your data — grounded and checked against the source — or they escalate. They don't improvise an answer to fill silence.

This is the difference between a quoting agent that applies your actual pricing logic and a chatbot that makes up a discount. The store is yours, it's isolated, and when your prices change, the agent's answers change the same hour.

Per-client storePricing & policiesRunbooks & FAQsGrounding checksEscalate on unknown

03 — Integration

What can the agent actually touch?

Agents act through the stack you already run — your CRM as the system of record, your phone lines, your calendar, your e-signature and payment tools — through scoped access. Each agent holds keys only to the tools its job requires. The follow-up agent can read your pipeline; it cannot touch your bank.

No rip-and-replace, no "new platform to log into." The work lands where your team already looks for it, and every credential is scoped, auditable, and revocable in one place.

CRM · HubSpot / GHL / SalesforceVoice & SMS · TwilioCalendarsQuotes · e-sign · paymentsLeast-privilege keys

04 — Observability

Why did the agent do that?

Every agent step is logged and traceable — what came in, what the agent decided, which tool it called, what came back. Dashboards track the numbers that matter to you: response time, qualification rate, quote-to-signature, where leads drop off. Alerting catches the failure modes — timeouts, malformed tool calls, retrieval misses — with automatic retries and graceful degradation.

This is where most AI builds die: they demo well and then nobody can see inside them. Ours are built to be watched. If a vendor can't show you the trace of why their agent did something, you're looking at a demo, not infrastructure.

Step-level tracesLive dashboardsFailure-mode alertingRetries & fallbacksWeekly reports

The loop closes

What layer 04 observes feeds back into layer 01's routing, the knowledge store, and your playbooks — every week, as part of the operating partnership.

What a run actually looks like

Every step, on the record.

This is the anatomy of one routine run — a missed call recovered into a booked appointment — the way it appears in the trace log. Eight steps, each one inspectable: what arrived, what the agent decided, which tool it called, what came back.

Ask any AI vendor one question: “Show me the trace of why your agent did that.” If they can't, you're buying a demo.

See the client portal this feeds

agent-trace · follow-up/aria · run 8F42

14:02:11INBOUNDMissed call from (310) ···-4821 → supervisor✓ routed

14:02:12CLASSIFYIntent: new lead · service inquiry · high fit✓ 0.94

14:02:12KNOWLEDGEPulled: service area, pricing rules, booking policy✓ grounded

14:02:13ACTIONSMS sent: greeting + two qualifying questions✓ sent

14:04:37REPLYLead answered both · meets qualification rubric✓ qualified

14:04:39TOOLcalendar.create_booking → Tue 10:30am confirmed✓ 201

14:04:40CRMContact created · transcript attached · owner notified✓ synced

14:04:40TRACE8 steps · 0 retries · full log available in portal✓ done

Voice, done properly

A voice agent is a real-time system, not a chatbot with a phone number.

Phone is where the highest-intent leads show up — and where latency, interruptions, and wrong answers are most expensive. Our voice agents run on the same four layers, engineered as a streaming pipeline so the conversation feels like talking to a sharp employee, not navigating a phone tree.

Caller speaks

Telephony streams audio in real time — no "please wait while we process."

Heard while speaking

Streaming transcription produces words as they're said, so the agent is already reasoning before the sentence ends.

Decide & act

The supervisor classifies the call — new lead, reschedule, question, emergency — pulls what it needs from the knowledge layer, and acts through the integration layer: qualifies, answers, books.

Respond naturally

Synthesized speech streams back with interruption handling — a caller can talk over the agent and it adjusts, like a person would.

Land the outcome

The appointment hits your calendar, the lead hits your CRM with a full transcript, and anything below the confidence threshold is handed to a human with context — not dropped.

Every call ends with an artifact: a transcript in your CRM, an outcome tag, and a trace. Calls the agent shouldn't handle — an angry customer, a complex negotiation, anything below the confidence threshold — get handed to a human with context, not dropped to voicemail.

Isolation by design

Your stack is yours. Full stop.

Every client runs isolated: separate knowledge stores, separate credentials, separate configuration. Your pricing logic, your call transcripts, and your customer list never share a space with anyone else's. We build inside your accounts wherever possible — so if we disappeared tomorrow, the assets are already in your hands.

Security & Data Responsible AI

Separate stores per client

Knowledge, transcripts, and configuration are partitioned per business — no shared pools, no cross-contamination.

Least-privilege credentials

Each agent holds scoped keys to exactly the tools its job needs. Revocable in one place, auditable always.

No training on your data

Your data runs your agents. It doesn't train anyone's models — ours or anyone else's.

Kill switch on every agent

Any agent can be paused instantly — by us or by you — without taking down the rest of the system.

The build standard

Every deployment ships with all ten. Not most. All.

This is the checklist a build must pass before it goes live in your business — and the checklist to hold any vendor to, including us.

Staging and production environments — nothing ships straight to your customers

Version-controlled prompts, routing rules, and configuration with rollback

Human-approval checkpoints on every step that moves money or signs your name

Confidence-routed escalation: above threshold it acts, below it hands to a human

Structured logging on every agent step, queryable when you ask "why?"

Alerting wired to us — most issues are fixed before you notice them

Graceful degradation: if a tool is down, the agent says so and queues the work

Per-client isolation: your data, credentials, and configuration touch no one else's

A documented runbook and a kill switch for every agent

Weekly plain-language reports plus a live portal — no black box

Put it to work

This stack, mapped to your business.

On a 30-minute call we'll walk your operation, show you which agents earn their keep first, and sketch the architecture on your actual stack — before you commit to anything.

Book a Strategy Call See the 90-Day Sprint