Strategy

Managed AI-Augmented BPO from Lagos: The Operation, By Design

cmdev13 min read
Managed AI-Augmented BPO from Lagos: The Operation, By Design
Share
~19 min

Most articles about a service capability describe what the provider has already done. This one is deliberately different. The Lagos and Abuja offices are real. The AI engineering practice is in production for other workloads. The operating design — talent funnel, AI-augmented workflow, supervisor ratios, SLA tiers, eval gates, pricing model, audit chain — is written down in the form you are about to read. What is not yet true is that any of it is running for a paying managed-BPO client; the first cohort has not been hired, the first contract has not been signed.

The framing throughout is the honest one: if we are to build and run a managed AI-augmented BPO from Lagos, this is how we will do it — in theory and in practice. The capability is real and ready; the execution is the next phase, declared in the open before it starts.

Key takeaways

  • The Lagos and Abuja offices, the existing AI engineering practice on Amazon Bedrock and Connect, and the Nigerian regulatory understanding sit as the substrate — but no live BPO contracts are running yet. This article is the operational design, not a retrospective claim.
  • The talent funnel draws from Lagos and Abuja tertiary graduates plus the parallel technical pipeline (Andela, Decagon, Semicolon Africa, ALX) through a five-stage screen, into a four-to-six-week training programme before first live interaction.
  • The AI-augmented workflow runs on Amazon Connect plus Bedrock: the agent sees a model-generated draft grounded in the client's knowledge base, edits or accepts, and the audit chain captures the draft, the edits, and the final response. Agent edit rate is the monitored operational signal.
  • SLAs, as designed: 80% of tier-1 calls answered in under 30 seconds, CSAT at or above 95%, platform uptime at 99.5%, and a complete audit log delivered within 48 hours of interaction window close.
  • Pricing is per interaction, not per seat — tier-1 routine, tier-2 complex, tier-3 regulated, with rates we expect to land in the €1.30 to €1.80 per-interaction band at scale, but which we refuse to promise below until the first cohort is operational and unit economics are measured.

Why we are set up to do this

The case for cmdev as a managed AI-augmented BPO provider is not that we have run the operation before. It is that the four ingredients a buyer should look for are already in place. The work ahead is composing them into a single delivery shape, not building any from scratch.

First, the AI engineering practice — Amazon Bedrock pipelines, retrieval-augmented generation against client knowledge bases, Lex and Bedrock Agents integrated with Connect contact flows — in production, audited, and documented across the work we have written up elsewhere.

Second, the physical operations footprint. The Lagos and Abuja offices are leased, networked, and used by our engineering and advisory teams. Standing up a managed-BPO floor is a fit-out and hiring exercise on top of an operating base, not a greenfield property project.

Third, the Nigerian regulatory understanding. We have mapped NDPA Section 65 record-keeping, GAID expectations on cross-border processing, the CBN cybersecurity assessment regime, and GDPR Article 30 obligations into a coherent data-handling and audit-chain posture.

Fourth, the operating posture itself — the design described below. The discipline of writing it down before delivery is the discipline we expect to bring to delivery itself. What we are not pretending to have is a track record of delivered BPO contracts.

The operation, by design

The shape, as designed, is a single Lagos-led managed service with Abuja as the secondary site for resilience and capacity expansion. Both sites run on the same operating model, training, eval gates, and audit chain. Four structural choices distinguish the design from a generic BPO posture: the training programme is six weeks for AI-augmented seats rather than four (the additional weeks pay for tool fluency); supervisor-to-agent ratios are 1:8 and 1:5 rather than 1:15 or 1:20 (the QA gain shows up downstream in CSAT and audit-chain integrity); pricing is per interaction rather than per seat (aligning incentives toward output, not staffed hours); and every interaction produces a structured audit record that meets the regulated-buyer evidence standard out of the box.

The talent funnel

The talent funnel, as designed, is a five-stage screen run continuously rather than per cohort, so a contract trigger draws from a qualified pool. Stages one through three are open application, an English fluency assessment (written and spoken against a published rubric), and a cognitive aptitude screen. Primary recruitment is from Lagos and Abuja tertiary institutions — Lagos, Babcock, Covenant, Pan-Atlantic, Abuja, Ibadan, and the technical institutions in both cities — with a secondary pool from the parallel technical ecosystem (Andela, Decagon, Semicolon Africa, ALX). Roughly 8 to 12 per cent of applications pass stages one through three.

Stage four is a structured interview with a behavioural component and a role-play — a billing query with a confused customer, an escalation that needs de-escalation, a regulated-information request handled correctly. Stage five is the offer; we expect a high acceptance rate (over 80 per cent) because structured training and documented progression are genuinely attractive against the alternative graduate market. The wage band is calibrated to the junior technology market, not the depressed traditional BPO wage.

Once accepted, the agent enters training — four weeks for traditional, six weeks for AI-augmented — before first live interaction. The two-week additional investment is mandatory; the failure modes of an under-trained AI-augmented agent (over-trusting or rewriting every draft) appear within the first month and damage cohort quality scores.

The AI-augmented workflow

The workflow runs on the substrate we already operate for other clients: Amazon Connect for channel routing, Contact Lens for transcription and sentiment, a Bedrock-hosted generative model for draft response generation, a Bedrock Knowledge Base for retrieval-augmented grounding, and a Bedrock Agent for cases where the workflow needs to act against a system of record.

When an inbound interaction arrives — voice, chat, email, or social — Connect routes it to an available agent on intent classification. The agent's workstation surfaces customer context, retrieved knowledge-base passages, and a model-generated draft. The agent decides whether to accept, edit, discard and rewrite, or escalate. Draft, edits, final output, and decision metadata go to the audit chain.

The agent edit rate is the monitored operational signal. The target band is 15 to 25 per cent on routine intents. Above 50 per cent suggests poor model tuning and triggers a prompt-engineering review; below 5 per cent suggests over-trust with QA catching errors the agents miss, and triggers a coaching cycle. What changes for a managed-BPO operation, against the existing engagements that run on the same substrate, is the human layer above it.

The supervisor structure

The supervisor-to-agent ratio is the largest single determinant of quality at scale. Tier-1 — routine support, well-bounded intents, low regulatory exposure — runs at 1:8. Complex or regulated work — financial services, healthcare, legal, high-value account handling — runs at 1:5. The cost differential against 1:15 is absorbed deliberately; the quality and audit-chain integrity it buys is the structural reason a buyer should pay the per-interaction rate we will quote.

The supervisor is closer to a player-coach than a traditional call-centre supervisor, monitoring a real-time Contact Lens dashboard that flags escalating customer frustration, anomalous edit rates, call duration over the intent-type band, or a model low-confidence signal the agent has not escalated. Defined triggers — customer escalation request, regulated-information request, suspected fraud, duration over two sigma above intent median — fire automatically with a defined supervisor response per trigger.

QA sits outside the supervisor reporting line, deliberately, with sampling at 3 to 5 per cent for tier-1 routine, 8 to 12 per cent for complex or regulated work, and 100 per cent for any Contact Lens-flagged interaction.

SLA tiers, as designed

Tier-1 voice: 80 per cent of calls answered in under 30 seconds, abandon rate below 5 per cent, first-contact resolution above 70 per cent, CSAT at or above 95 per cent. Tier-1 chat: median response under 45 seconds, three concurrent sessions per agent, CSAT at or above 95 per cent. Tier-1 email: 95 per cent responded within four business hours.

Tier-2 complex: 90 per cent answered in under 60 seconds, first-contact resolution above 60 per cent, CSAT at or above 90 per cent. Tier-3 regulated: routing latency in line with tier-2, plus a regulatory-evidence SLA — a complete audit record delivered within 48 hours of interaction window close. Platform uptime against the underlying Amazon Connect availability is contracted at 99.5 per cent monthly; audit log delivery is 48 hours from window close, dropping to per-record latency in tier-3 where the client requires near-real-time evidence. These targets are designed against the substrate we operate — not aspirational, and not yet contracted because no contract is signed.

Eval gate before the first cohort goes live

The discipline that separates a managed AI-augmented operation from a generic BPO floor is the eval gate. Before the first cohort takes a live customer interaction, the workflow has to pass an evaluation against a representative golden corpus derived from the client's actual support history.

The client supplies — or we construct from their historical corpus with their review — 200 to 500 representative interactions across the workload's intent distribution, partitioned into a training portion used to tune the prompt and retrieval, and a held-out evaluation portion. Before go-live, the workflow's responses on the held-out set are scored by a human panel against the client's rubric — accuracy, tone, policy adherence, regulatory correctness — and the score has to clear a contract-defined threshold for the cohort to take live interactions.

The gate runs again on every material prompt update, knowledge-base refresh, and cohort addition; a prompt change that improves the median but degrades the tail does not pass. Red-team probes are part of the same gate — prompt injection, extraction attempts on internal policy, manipulation toward bypassing escalation triggers, regulated-information requests from unverified callers. Red-team coverage is a contracted artefact, refreshed quarterly, delivered to the client's security team.

Pricing model — designed

The pricing model is per interaction, not per seat. Per-seat aligns the incentives toward staffed hours; per-interaction aligns toward output and forces us to manage staffing to volume.

The ladder is tier-1 routine, tier-2 complex, tier-3 regulated. The rates we expect to quote at scale fall in the €1.30 to €1.80 per-interaction band for tier-1, with tier-2 and tier-3 priced upward against the tighter supervisor ratio and the additional QA and audit-chain overhead. We will not promise specific numbers below this band until the first cohort is operational and unit economics are measured rather than modelled. Against €4 to €7 in the equivalent EU operation and €6 to €10 in the US equivalent, the cmdev band is structurally lower without being arbitrage-dependent — it survives wage growth. Specific pricing is per-contract.

The audit chain

The audit chain is what makes the operation defensible to a regulated buyer. Principal-of-record discipline runs from the caller through Connect, through the Bedrock model and knowledge-base retrieval, through the human agent's edits, to the final response — each step's actor, input, output, and timestamp captured.

The record schema captures the inbound customer input (with PII tokenised), intent classification and routing, retrieved knowledge-base passages, the model draft and the prompt that produced it, the agent identifier and edits, the final output, supervisor interventions, escalation triggers, and downstream-evidence metadata. It is structured to meet NDPA Section 65 record-keeping for Nigerian data subjects, GDPR Article 30 records for EU data subjects, and GAID record-keeping expectations as they come into force. Retention is configured per workload and jurisdiction; direct identifiers are tokenised at ingestion. The record is a contracted deliverable, exposed through a defined API and scheduled bulk delivery. For an EU client whose data subjects' information is processed in Lagos, that audit chain plus the supporting Data Processing Agreement and standard contractual clauses is the regulatory wiring that makes the engagement defensible to the client's data protection officer.

What we'd commit to in month one

Month one, as scoped: a first cohort of typically 10 to 20 seats, depending on workload volume and intent complexity. Onboarding covers training, eval-gate construction and execution against the client's golden corpus, audit-chain integration, the supervisor cadre, and SLA reporting wiring. The exit criterion is the eval gate passing and the supervisor structure being in place and tested. The buyer should hear from us weekly through that month.

Month two scales to operational volume; month three is the first full SLA and QA cycle with feedback into the next cohort's training; from month four the operation is in steady state with monthly business review and quarterly red-team refresh.

What we will not promise yet

The list of commitments we refuse to make in writing until the first cohort is operational is short and specific. Call volumes at scale until throughput is measured against the substrate. Pricing certainty below the €1.30 to €1.80 tier-1 band until unit economics are measured rather than modelled. SLA performance on languages outside the trained set — English for US, UK, and EU markets is the trained set; German, French, Spanish, Portuguese, and the broader EU-language set are out of scope until we have built the equivalent recruitment and training capability. Infeasible ramp expectations — a six-week ramp from contract signature to 50 seats is feasible against the substrate; a two-week ramp to 200 seats is not, and we will say so.

The point of the article is to be exact about both the capability and the gap.

FAQs

You have not run a managed BPO contract yet — why should a buyer take the design seriously?

The substrate is in production for existing engagements: Amazon Connect, Bedrock generative and retrieval pipelines, the Lagos and Abuja sites, NDPA and GDPR record-keeping discipline. A defensible procurement should review the design, contract eval-gate corpus construction as the first milestone, scope a small first cohort with a defined go-live gate, and only ramp once measured performance clears the SLA targets. The risk profile of a provider declaring the design openly and committing to gated execution is materially different from one claiming an unverified track record.

How quickly can the first cohort be operational from contract signature?

Six to eight weeks, assuming the eval-gate corpus is available from the client within the first two weeks. A faster ramp is possible if the client supplies a pre-existing cohort to retrain — that compresses recruitment but not training, because the AI-tool fluency weeks are mandatory.

What happens to pricing if the workload is complex or regulated?

The €1.30 to €1.80 band is the expected tier-1 range. Tier-2 and tier-3 price upward against the tighter supervisor ratio, heavier QA sampling, additional red-team coverage, and the audit-chain SLA at near-real-time latency. We quote against the specific workload at scoping.

How does the design handle a 24/7 customer base from a Lagos site?

The two-shift design covers EU, UK, and US East business hours plus the US West morning from one Lagos centre — the structural GMT+1 advantage against the Atlantic markets. True 24/7 adds a third overnight shift, engaged only when the contract requires it, materially more expensive per interaction. For Pacific or East Asian coverage, Lagos is the wrong geography.

What does the eval-gate corpus look like in practice?

200 to 500 representative interactions across the workload's intent distribution, drawn from the client's historical support records with their data-governance review. Scoring is by a panel of three to five reviewers — accuracy, tone, policy adherence, regulatory correctness — including at least one client-side reviewer. We expect tier-1 to clear at 90 per cent and tier-3 regulated at 95 per cent, with explicit failure conditions for any single dimension below a defined floor.

Companion content

How to engage

We are set up to deliver a managed AI-augmented BPO from Lagos and Abuja for US, UK, and EU customer-experience workloads, against the operating design declared above. The capability is ready; the first contract has not been signed. Talk to us at creativeminds.dev/contact. We will scope the engagement against the eval-gate exit criterion and the cohort ramp described above, and we will be exact about what is built, what is designed, and what is yet to be measured.

bpomanaged-bponigeriaamazon-connectai-augmentationmanaged-servicesoperationsstrategy

Ready to strengthen your security posture?

We help organizations across Africa build resilient infrastructure, deploy AI at scale, and navigate complex regulatory environments.

Start a conversation