Engagement · 02 · Foundational · Recommended

Twelve weeks. One re-instrumented system. Your team running it.

Our flagship engagement. Telemetry redesigned, OTel rolled out, eBPF where it matters, two paired P1 incidents before we leave. You don't end with a deck — you end with a system.

Duration
12 weeks
Price
USD 96,000
Format
Remote + 1 wk on-site
Senior allocation
80%
01 Outcome targets

What changes by week 12.

These are the median results from 14 Foundational engagements (2023–2025). Your numbers depend on your starting point — we set the actual targets in week 2.

62 % trending_up
Avg. cost reduction
4.2x trending_up
MTTR improvement
−81 % trending_up
Alert volume
12 wk trending_up
Fixed timeline
02 Protocol

Twelve weeks, five phases.

01
Week 1–2

Diagnose

We run the full Diagnostic in flight. If you've already done it with us, we skip and credit the fee.

description Maturity profile
description Cost + cardinality audit
description Stakeholder map
02
Week 3–4

Define

Telemetry schema redesign. SLO catalogue. Error-budget policy. Trace coverage plan keyed to revenue-bearing flows.

description Telemetry schema v1
description SLO catalogue
description Trace coverage plan
description Cost target
03
Week 5–8

Instrument

OTel collector pipelines, eBPF probe set on hot paths, dashboard rewrite, alert library v2. Your engineers pair on every PR.

description OTel collector config
description eBPF probe set
description Alert library
description Dashboards (≤12)
04
Week 9–10

Operate

Two paired P1 simulations against the new system. Runbook rewrite. On-call training. We tune for production noise.

description Runbook set
description Incident retros (×2)
description On-call schedule
description Capacity review
05
Week 11–12

Handover

Knowledge transfer, document set, retainer scoping if you want one. We leave with a working system, not a slide deck.

description Architecture decision records
description Handover doc set
description 90-day post-engagement plan
03 Tech stack

What we deploy.

Defaults below — we adapt to what you already run. We don't replace tools that work; we replace ones that don't.

OpenTelemetry Otel Collector Grafana Tempo Loki Mimir Pyroscope Prometheus eBPF (Cilium/Pixie) Sentry Honeycomb
Inclusions
  • check Maturity profile + roadmap
  • check Telemetry schema v1
  • check OTel collector pipelines
  • check eBPF probe set (hot paths)
  • check Alert library v2
  • check ≤12 dashboards (we delete more than we add)
  • check SLO catalogue + error-budget policy
  • check On-call runbook rewrite
  • check Two paired P1 simulations
  • check Architecture decision records
  • check Handover doc set
  • check 30 days post-handover Slack
Engagement.start()

Capacity is limited.

We run at most 4 Foundational engagements concurrently. Currently 2 of 4 slots open for the next quarter.