The Agent Governance Gap


The Agent Governance Gap

Introduction

This newsletter helps CFOs think better about AI decisions. Each edition tests frameworks from the CFO AI Playbook against real evidence and delivers one concrete action.

This edition: Building the Finance AI Stack covers AI architecture for finance, applying McKinsey's mesh concept to finance systems. Who Owns Your AI Agents? discusses agentic AI: when agents carry out processes semi-autonomously, who takes ownership and how do you measure impact?

The Playbook publishes Q2 2026. Your feedback shapes the final work.

Building the Finance AI Stack

While much of the discussion on AI focuses on prompt engineering, what if the future isn't choosing between approaches, but building a complementary AI stack that serves different strategic purposes?

The Three-Layer AI Architecture for Finance

Most discussions treat AI approaches as evolutionary stages, but complex organizations need all three working together.

This builds on McKinsey's "agentic AI mesh" concept of layered decoupling, applied specifically to finance operations.

Foundation Layer: Systematic ERP Integration AI embedded directly in core systems with automated workflows. SAP S/4HANA cuts financial close cycles by 50%, Tipalti reduces accounts payable processing by 80%. This handles routine processes without human intervention.

Intelligence Layer: Specialized AI Agents Purpose-built agents that draw from multiple systems for complex analysis. Tipalti's Ask Pi queries across payment data, DataRails' FP&A Genius consolidates budgeting from various sources. These provide strategic intelligence beyond single-system capabilities.

Flexibility Layer: Prompt Engineering Ad hoc analysis and exploratory queries for presentations, quick scenarios, and one-off strategic questions. Perfect for executive dashboards and board presentations.

Across the Stack McKinsey illustrates this complementary approach in supply chain: AI integrated into warehouse management systems (Foundation), autonomous orchestration across multiple operations (Intelligence), with strategic decisions escalated for human input (Flexibility). The three-layer architecture is emerging as a universal pattern for enterprise AI implementation.

The Next Evolution: Avatar-Enhanced Interfaces A conversation with an Apyhub co-founder (API platform for AI-powered developer tools) revealed avatar-based agents transforming software engineering. Soon in finance: "Virtual CFO Advisors" making sophisticated cross-system analysis as approachable as consulting a colleague.

Competitive Landscape

Enterprise Solutions (SAP, Oracle, IBM): Deep ERP integration with specialized agents like SAP's Joule, built-in compliance frameworks, Fortune 500 scale Specialized F&C Tools (Tipalti, DataRails, Netgain): Purpose-built agents for rapid deployment, conversational interfaces understanding F&C terminology, mid-market focused Strategic Implementation For CFOs: Don't choose between approaches - architect the complete stack. Foundation AI for efficiency, intelligent agents for strategic insights, prompt engineering for flexibility.

For Finance Professionals: Master all three layers to become hybrid professionals who seamlessly blend systematic automation with strategic analysis and executive communication.

The Bottom Line: The most sophisticated finance organizations aren't choosing between AI approaches - they're building complementary stacks where systematic integration, specialized agents, and prompt engineering work together to transform decision-making at every level.

What's your approach to building the complete AI stack? Are you implementing across all three layers?

Reference: McKinsey & Company. "Seizing the agentic AI advantage." June 2025. https://www.mckinsey.com/capabilities/quantumblack/our-insights/seizing-theagentic-ai-advantage

Who Owns Your AI Agents

AI agents are entering finance workflows as systems that execute work semi-autonomously: processing invoices, monitoring contract compliance, flagging anomalies, routing exceptions.

The architecture question is settling. ERP remains the foundation layer—your system of record for transactions, balances, and audit trails. AI agents operate in the intelligence layer above it, drawing from multiple systems for pattern recognition and cross-system analysis. A flexibility layer handles ad hoc queries and executive dashboards.

Agents depend on this foundation. When an agent flags a compliance issue or recommends a hedge, the underlying data still lives in ERP. That's where you reconcile. That's what auditors trace to.

The technical architecture is becoming clear. What's missing is the governance architecture to match it.

The Complication

Srinivasan and Wei, writing in Harvard Business Review, document the emergence of dedicated roles to govern agent performance.

Salesforce's Agentforce platform now autonomously resolves 74% of inbound customer support cases. Their sales development team went from 150 meetings in 30 days to over 350 meetings in a single week after deploying AI agents, generating $60 million in annualized pipeline within four months.

Order-of-magnitude changes that only happen when AI agents operate semi-autonomously at scale.

Salesforce created dedicated roles to govern these systems. Zach Stauber, an agent manager in their organisation, describes his daily routine: "Data, Data, Data. I start and end my day in dashboards, scorecards, and agent observability monitoring."

Governance embedded in operations.

The Ownership Question

Srinivasan and Wei make a claim with direct implications for CFOs: "In the pre-agentic world, AI deployment lived within IT or the data science organisation. In the agentic era, business units need to take control."

If AI agents are performing real work for a business unit, that unit must own their performance.

The deterministic-probabilistic boundary matters here. ERP processes—general ledger posting, tax calculations, regulatory reporting—require rule-based certainty where "99% accurate" means compliance failure. AI excels at probabilistic tasks: pattern recognition, anomaly detection, forecasting.

The governance challenge is maintaining clear separation between these domains. AI provides recommendations and flags exceptions. Humans approve decisions that impact financial records. Deterministic code executes approved transactions.

This separation requires someone accountable for agent performance. Someone who assesses whether outputs remain reliable and knows when to escalate.

For finance, this accountability extends beyond agents deployed directly within your processes. If an AI agent processes supplier data before it reaches your AP workflow, someone must own that agent's performance—and finance needs visibility into output quality and clear escalation paths when issues arise. The owner may sit in procurement or operations, but the governance gap affects finance regardless.

The capabilities Srinivasan and Wei identify for effective agent managers:

AI operational literacy: Operational judgment about probabilistic systems. Understanding how agents operate and how to diagnose performance issues.

Functional depth: Deep knowledge of the business process the agent supports. Someone who knows what "good" looks like in invoice processing or variance analysis.

Systems thinking: Visualising how agents interact across workflows and departments, recognising that finance processes connect to upstream and downstream systems.

Work design across humans and machines: Creating hybrid workflows, assessing machine capabilities, designing escalation routines.

Change resilience: Adapting to shifting models through continuous monitoring. Salesforce's agent managers work in weekly "test-deploy-learn" cycles.

The Metric Shift

The HBR research documents a shift in how performance should be measured when agents enter the workforce. In sales and support contexts, Salesforce moved from activity metrics to orchestration outcomes—measuring how well people govern the human-agent system.

For finance, the question is whether productivity gains actually convert to value.

Shah and Course's Six Levels of Benefits framework offers a structure. Productivity gains—time saved through automation—sit in the middle of a progression. They deliver no financial value unless explicitly converted: either downward to cost reduction through actual headcount or spend decreases, or upward to strategic capability through redeployed capacity that influences decisions.

Measuring time saved without tracking conversion perpetuates the measurement gap that has plagued finance transformation for a decade.

Anders Liu-Lindberg operationalises this through the Value Log: finance employees document specific actions where they influenced business performance. Ørsted implemented this across 600+ finance employees. Their Impact Log now contains close to 1,000 documented cases. A global pharmaceutical company sets annual targets of $150 million in documented value contributions.

Liu-Lindberg emphasises that the Value Log drives behaviour. Finance employees become conscious of how they contribute to value creation across the drivers that matter: revenue growth, margin improvement, working capital optimisation, risk mitigation.

When agents handle volume, the human role shifts to business partnering. The outcome is whether finance analysis influenced a better capital allocation decision, pricing strategy, or risk position. That's what gets logged.

The Resolution

The three-layer architecture clarifies what's required. ERP remains your foundation—the system of record that agents depend on. The intelligence layer adds capability. That capability demands answers to four questions:

Who owns agent performance in the intelligence layer, including for upstream processes that feed finance data?

What monitoring enables finance to assess output reliability?

How do you convert productivity gains to documented value creation?

Who governs the continuous iteration cycles that the intelligence layer requires?

The organisations getting value from AI agents are building the operational infrastructure to govern them. That infrastructure includes clear ownership extending across process boundaries, monitoring that finance can interpret, and measurement systems that track value conversion.

The foundation stays stable. What you build on top requires ongoing governance.

This week's articles

To Thrive in the AI Era, Companies Need Agent Managers

As autonomous AI agents become common ground, the question for finance leader will be who takes ownership. Drawing on examples from Salesforce and other large organisations, this article introduces the role of the agent manager—leaders responsible for orchestrating how AI agents learn, collaborate, perform, and work safely alongside humans. Agent managers are becoming essential to translating strategic intent into reliable outcomes in an AI-powered, hybrid workforce

Building the business case for quality improvement: a framework for evaluating return on investment

AI often delivers intangible returns—better decisions, faster insights, improved quality—that traditional ROI calculations fail to capture. Shah and Course's Six Levels of Benefits framework gives finance leaders a structured way to translate these quality improvements into measurable value: either downward to cost reduction or upward to strategic capability.

Monday Moves: From Insight to Action

Before your next AI agent deployment, answer two questions.

First: who in your organisation would spend their morning in "dashboards, scorecards, and agent observability monitoring" for that system? Who owns outcomes when outputs drift or exceptions escalate? If the answer is "nobody" or "IT will handle it," you're not ready.

Second: how will you know whether time saved converts to value created? If you can't trace freed capacity to either cost reduction or documented business impact, you're measuring the wrong thing.

Go Deeper: The CFO AI Playbook

The governance frameworks in this edition build on Chapters 2, 3, and 8 of the methodology I'm developing for finance leaders implementing AI.

Chapter 2 covers the deterministic-probabilistic distinction and process selection. Chapter 3 addresses the Six Levels of Benefits framework and why productivity gains deliver no value unless converted. Chapter 8 covers human-on-the-loop architectures and what monitoring infrastructure prevents cascading failures.

I'm finalising the Playbook now and anticipate to publish in May.

Subscribe to the waiting list at www.quipucfo.com