Data Products for Agentic Banking: The Layer a Transaction Model Can't Skip

June 16, 2026 · Banking, Architecture

data-products data-engineering agentic banking-transformation bian composable-banking owned-autonomy semantic-layer

There is a good argument going around, and it is worth taking seriously. NVIDIA has set out why financial institutions are converging on transaction foundation models: instead of building a separate model for fraud, another for credit, another for churn, a bank trains one model on its own transaction and event history and reuses that learned representation across many tasks. Thoughtworks frames the same shift as building a unified intelligence layer for payments — moving from dozens of isolated feature pipelines to one shared backbone that learns “the language of money movement.”

The strategic claim underneath both is the part that matters: a bank’s proprietary event history is the asset, not any single model trained on it. That is correct, and it is the right place to be looking.

But it skips a layer. A foundation model trained directly on raw transaction logs is an impressive science project. It is not yet something a regulated bank can put in front of a fraud team, a credit committee, or the FCA. Between the raw event history and the reusable intelligence sits a layer that the model-centric framing tends to assume away — and it is the layer that decides whether any of this is operable.

That layer is the data product. And the argument of this note is simple: the missing unit in agentic banking is not the model. It is the governed data product — and a transaction-based platform is structurally advantaged at producing it.

What the model is actually doing

Look closely at the worked example the transaction-foundation-model case rests on. NVIDIA’s is a good one: a payment at midnight means little on its own, but means fraud when it is “the fourth in 10 minutes, on an unfamiliar device, in a city the customer’s never transacted from before.” As they put it, “a foundation model interprets behavior in context where timing, device, location and prior activity shape meaning.”

Notice what the model is doing in that sentence. It is inferring intent and legitimacy — why the payment happened, whether it was authorised, what the customer was trying to do — from behavioural correlates, because the transaction itself recorded that money moved and discarded the rest. The answer existed at the moment of action and was thrown away; the model reconstructs it, statistically, after the fact.

That is the whole game, and it is why the gaps the same authors name — fragmented data, task-specific models, expensive feature engineering, missing context — are gaps in the data, not the model. Feature engineering is slow precisely because the signal (intent, authority, outcome) was never written down. A model can be brilliant at prediction and still be unusable in a regulated decision, because the substrate it learned from cannot answer “who was allowed to do this, what does it mean, and can you prove it.” The intelligence is only ever as governable as the data underneath it.

Two data products from one interaction

Here is the move the model-centric framing misses. A governed agentic banking process does not discard the answer — it records it as the action runs. So a single customer interaction yields two data products, not one:

The interaction microdata — the screen sequence, the timing, the device, the hesitation before a customer confirms. The high-volume behavioural signal of how intent actually formed. This is the raw material a transaction model feeds on, and the half the industry is already chasing.
The governed decision trace — the customer’s stated intent, the authority they held, what the agent proposed, the policy gate that fired, the human approval, and the evidence emitted. The labelled record of who was allowed to do what, and why.

The second one is the prize, and it is the one nobody talks about. It carries — as first-class, recorded fields rather than statistical guesses — exactly the things the model otherwise has to infer:

Meaning. What is this event, tied to a domain concept and an accountable owner — not two systems that both say “payment” and mean different things.
Authority. The mandate, consent or approval that permitted it, travelling with the event.
Lineage and evidence. A trace from the customer action to the control outcome, by construction.
Sensitivity. Which attributes are personal or mandate-bound, and who may see or act on them.

That governed trace is a strictly higher-supervision substrate than a raw transaction stream, and a more defensible proprietary asset: a competitor can hold the same payment rails and never see the intent, authority and approval chain that sits beside them. It does not replace transaction history — it is the labelled fuel that makes a transaction model converge faster and stay auditable. The hard part of agentic banking was never the model. It is the ontology and governance of the data the model and the agents both depend on.

The unit that fills the gap: the governed data product

A data product is a governed, reusable view of a banking concept — packaged with the things that make it safe to use, not just the values. It carries:

a governed meaning, tied to business events and an accountable owner (a Transaction Ontology, not a folder of tables);
authority and consent travelling with the data — mandates, permissions and approval states as first-class fields, not a separate lookup bolted on later (consent-bearing data);
lineage and evidence — traceable from customer action to control outcome (evidence-grade data products);
sensitivity boundaries — personal and mandate-bound attributes bounded by role, purpose and exposure policy;
reusable context — one product serving many engines and propositions, so each team stops rebuilding its own private version of the bank.

This is the same discipline we apply to capabilities in the Domain Building Block: take the thing everyone treats as a cross-cutting afterthought — controls and evidence — and promote it to a first-class property of the unit. A DBB does it for a capability. A data product does it for the data that capability runs on. They are the same move, one layer apart, and they are designed to fit: data products are structured around the same DBB, ABB and BIAN boundaries your architects already trust, so a product is portable and intelligible rather than a one-off extract.

What that looks like, concretely. In our live Family Wealth journey a customer adds £250 to her mother’s care fund. That single action becomes one governed object: her stated intent, the authority she holds over the fund (a contributor mandate, carried on the event rather than looked up in a permissions table afterwards), what the agent proposed, the policy gate that fired, the human approval where one was needed, and the evidence emitted at each step. Beside it sits the interaction microdata — the screen sequence and timing of how she arrived at the decision. One is the behavioural signal a model feeds on; the other is the governed, labelled trace. Both fall out of acting under governance; neither is reconstructed later. (That object is the worked example on our public product view — we show its shape, not its schema.)

Why a transaction-based platform is structurally advantaged

Here is the part that is our own argument, not the industry’s.

You can bolt a data-product layer onto any estate, and many banks will spend years doing exactly that — reverse-engineering meaning, authority and lineage out of systems that were never built to emit them. But if your platform is transaction-based — if business events, states, permissions, relationships and evidence are native to how it runs — then the governed data product is close to a by-product rather than a salvage operation.

We learned this the hard way, building it. Years ago we architected a UK payments platform whose authoritative source of truth was the event log, not the ledger. The accounting service was a downstream projection — a consumer of the event stream, not the centre of the system. That choice was made for operational reasons (scheme conformance, dual-site availability, multi-rail support), but the shape that fell out of it is exactly the shape this decade needs. When the sequence of events is the truth, every reusable data product is a governed projection of that same spine, rather than a fragile reconciliation across systems that each think they are the master.

We call that spine the Governed Event Spine, and it is the structural reason a transaction-based platform produces evidence-grade data products cheaply: the meaning, the authority and the lineage are recorded as the events happen, not reconstructed from logs afterwards. The same separation made another property fall out for free — decisioning sat as a peer to settlement, not inside it. That is what makes an agent’s decision governable: when the decision is a separable, evidenced step rather than a side effect buried in a payment flow, “show me how this was authorised and recorded” has an answer by construction.

Two decades ago we used the same idea for a different proposition: a multi-currency vault we built for a yacht, where the owner could see the funds held against the asset and a defined set of authorised actors — captain and crew — could spend against it under policy, in two currencies. In today’s language that is proposition-native intelligence with policy-bounded actor authority — exactly the grammar an agent needs, designed long before agents could use it. The architectural slot was right; the substance has caught up.

From data product to engine — the reuse that actually compounds

Once the data products exist, the model-centric vision finally becomes deliverable — because now the reusable layer is governed, and the engines are built on it rather than around it.

flowchart TB
  EV["Business events
(card, A2A, servicing, disputes, mandates)"]
  ON["Transaction Ontology
(governed meaning + accountable owner)"]
  SP[("Governed Event Spine
event log as source of truth")]
  subgraph DP["Data products — governed, reusable"]
    P1["Party & Relationship"]
    P2["Authority & Consent"]
    P3["Transaction Context"]
    P4["Case & Evidence"]
  end
  subgraph EN["Engines — built on the products"]
    E1["Fraud"]
    E2["AML / KYC"]
    E3["Risk"]
    E4["Reporting"]
    E5["Servicing"]
  end
  PR["Propositions & agents
(customer action · agent action · human oversight)"]
  EV --> ON --> SP --> DP --> EN --> PR

Fraud, AML, KYC, risk, reporting and servicing stop being six private interpretations of the bank and become six consumers of the same governed products. A transaction foundation model, if the bank builds one, becomes one more consumer of that layer — a powerful one — rather than the thing the whole strategy depends on.

For the CDO, this is the move from data you can query to data you can act on. A governed estate of warehouse tables serves BI; agentic decisions need products that are fresh, callable, and carry lineage that holds up. BCBS 239 expects risk data aggregated “on a largely automated basis so as to minimise the probability of errors” — a product that carries its own provenance answers that by construction, instead of reconstructing lineage to satisfy an examiner. It also changes how capability is evidenced: the EDM Council’s DCAM scores data-management maturity not just on process but on the auditable artefacts a capability produces, and a governed product emits those — definition, consent, lineage, sensitivity — as it runs. The CDO shows capability across the estate by construction, rather than reconstructing it for each assessment. And the expensive part of a data team’s work, recovering what an event meant, is already done at the point of action.

That accountability is not the CDO’s alone. A governed data product is the unit each role in the data office can finally hold: the Data Owner can prove quality, definitions and access for their domain at the object level; the Data Steward reads lineage and metadata the object already carries instead of reconstructing them; the Data Custodian applies storage and security controls driven by the object’s own sensitivity classification, not a separate mapping that drifts.

For the CISO, governed products are a security control, not a data-quality nicety. An agent is a new privileged identity, and OWASP’s 2025 list for LLM applications names excessive agency — an agent on a broad identity can reach everything that identity can touch, joined however the model decides, which is an exposure you cannot attest to. Sensitivity boundaries make the reachable set finite, named and auditable. They also keep personal data out of the substrate that leaks: researchers have extracted verbatim personal data from trained models, and UK GDPR purpose limitation already requires processing only what is relevant to a declared purpose — a purpose-bound product operationalises that. This is the security argument the model-centric pitch usually leaves out.

We have already built the engine

This is where we part company with the think-pieces: the data-product-to-engine pattern is not a diagram we are hoping to validate. We have built the engine, and one proposition runs on it end to end.

Our Family Wealth proposition runs as an interactive demonstration on our reference engine: a real flow in which governed data products become customer action, autonomous agent action, and human oversight, with the evidence produced as a by-product rather than reconstructed after the fact. It is the proof that the model is operational, not just coherent — an agent prepares and proposes, and a material action cannot execute until an authorised human records approval. That is the same consent and context discipline that runs through C3, our Capability · Context · Consent reference architecture, and through what we call Owned Autonomy — a bank owning its agentic capability rather than renting it.

A short honesty note, because it is the point of how we work. Several propositions run as interactive demonstrations; the rest are explicitly blueprints. The data-product catalogue in our public product view is an illustrative operating model — it shows the shape of each product, deliberately, not a claim that all of them are already built. The deeper material — the ontology internals, the schemas and canonical objects, the control maps and the platform design — sits behind an MOU, in the architecture discussion, not in a blog post. We would rather show you the product shape and prove one engine than imply a finished platform. Our Clean-Room Demonstration is built to production-grade controls on synthetic data: a demonstration, not production, and labelled as such.

Where to start

The instinct, reading the transaction-foundation-model argument, is to start with the model. Start with the layer beneath it instead. Build the governed intelligence layer first — the data products — and the engines, and any foundation model you choose to train, become things you compose rather than megaprojects you gamble on.

That is what our six-week Proposition Blueprint is for: take one proposition, define the data products it needs and the DBBs and ABBs beneath them, and produce a board-ready recommendation — go, reshape, or stop — with an optional 90-day proof of a single bounded journey. You learn the real organisational cost of governed data on one proposition before you commit the estate.

The model-centric framing is right about the prize: your own event history, turned into one reusable intelligence asset. It is just one layer too high about how you get there. The reusable asset is not the model. It is the governed data product — and the bank that builds that layer first is the bank whose agents can actually be trusted to act.

A public product view of the data-product model is maintained at digital-transformation-advisory.com/composable-bank/data-products. This note stays at the architecture-decision level; the ontology and schema detail sit in the post-MOU architecture discussion by design.

Sailesh Panchal

Founder and CEO of Digital Transformation Advisory, strategic advisory for the future of banking. 30+ years in banking and FinTech. Writing a book on digital transformation in UK banking.

Intelligences Journeys Architecture Observatory More Field Notes

Working on something like this?

If this resonates, we should talk. Bring a strategy, architecture, proposition, or capability challenge and we will be honest about whether and how we can help.

Discuss a challenge

← Back to Field Notes