Securing the Agentic Payment Layer

Alessandro Pignati • 13 de mayo de 2026

The payment industry has spent decades perfecting the art of verifying that a human is behind a transaction. From the physical presence of a card to the digital friction of 3D Secure (3DS) and biometric authentication, every security layer assumes a conscious "buy" click from a person. When we introduce autonomous AI agents into this ecosystem, we encounter the "Human-Not-Present" (HNP) crisis. This is not just a variation of "Card-Not-Present" fraud. It is a fundamental architectural mismatch. Traditional payment rails are built on the assumption of human intent, but agents operate on the basis of inferred goals and machine reasoning.

The core of the problem lies in the breakdown of traditional authorization logic. In a standard e-commerce flow, the merchant and the issuer rely on the user’s direct interaction with a secure interface. This interaction provides a clear signal of consent. AI agents, however, act as intermediaries that make decisions based on high-level instructions. When an agent initiates a payment, the merchant has no native way to verify if the specific transaction was actually authorized by the user or if it is the result of a model hallucination or an adversarial injection.

Traditional security protocols like 3DS become a bottleneck rather than a safeguard in an agentic workflow. If an agent requires a human to solve a CAPTCHA or provide a fingerprint for every micro-transaction, the very value of autonomy is lost. Conversely, bypassing these checks creates a massive security vacuum. Without a human in the loop at the moment of purchase, the industry lacks a standardized method for non-repudiation. If a user later claims they did not authorize a specific purchase made by their agent, the current financial infrastructure has no cryptographic proof to settle the dispute.

The technical challenges of HNP transactions can be categorized into three primary areas:

Identity Ambiguity: Current payment systems struggle to distinguish between a legitimate agent acting on behalf of a user and a malicious bot using stolen credentials. There is no "Agent ID" that is recognized across the global financial network.
Authorization Decay: A user might grant an agent broad permission to "manage travel bookings," but this permission does not translate well into the granular, one-time authorizations required by banks. Over time, the link between the original user intent and the specific machine action becomes increasingly fragile.
Lack of Evidence: When a transaction occurs, the metadata typically includes the IP address, device ID, and location. For an agent running in a cloud environment, these signals are often generic or easily spoofed, providing zero utility for fraud detection or risk scoring.

As we move toward an economy where agents process millions of transactions per second, the "trust gap" will only widen. We cannot simply "patch" existing rails to accommodate machine-to-machine commerce. The industry requires a new layer of security that can provide deterministic proof of intent without relying on human presence at the point of sale. This shift requires moving away from interactive authentication and toward verifiable, cryptographically signed mandates that can be validated at machine speed.

Verifiable Digital Credentials (VDCs) and the AP2 Protocol Architecture

To bridge the trust gap in autonomous commerce, we need a way to anchor machine actions to human intent. The Agent Payments Protocol (AP2) introduces a robust framework for this by using Verifiable Digital Credentials (VDCs). These are tamper-evident, cryptographically signed objects that serve as the fundamental building blocks of a secure transaction. Unlike a traditional credit card token, a VDC carries specific metadata about the authorization, the participants, and the constraints of the purchase. This allows the payment to be self-describing and verifiable by any party in the chain without requiring a centralized authority to vouch for every step.

The AP2 architecture operates through a system of mandates that separate the "what" of a purchase from the "how" of the payment. This separation is critical for maintaining security while allowing for agentic flexibility. There are two primary types of mandates within the protocol:

Checkout Mandate: This captures the specific details of the items or services being purchased. It is shared with the merchant and serves as a digital contract that the agent is authorized to execute. It ensures that the agent cannot arbitrarily change the contents of a "cart" after the user has given their initial consent.
Payment Mandate: This authorizes the actual movement of funds from a specific payment instrument. It is shared with the credential provider and the payment processor. By decoupling the payment mandate from the checkout mandate, the protocol ensures that sensitive financial data is never exposed to the merchant or the agent orchestration layer unnecessarily.

These mandates exist in two distinct stages to accommodate the lifecycle of an autonomous transaction. The "Open" stage captures the user’s broad constraints and goals, such as a budget limit or a preferred vendor list. The "Closed" stage is a finalized, immutable authorization for a specific transaction amount bound to a finalized checkout. This two-stage process allows an agent to negotiate the best deal within pre-approved boundaries and then "lock" the transaction once the final terms are met.

The use of VDCs provides several technical advantages for enterprise security:

Cryptographic Non-Repudiation: Every mandate is signed using public-key infrastructure. This creates a permanent, verifiable audit trail that proves the user authorized the agent to act within specific parameters.
Interoperability: Because AP2 is an open protocol, it allows different agent frameworks and payment processors to communicate using a common language. This prevents the fragmentation of the agent economy into proprietary, insecure silos.
Role-Based Security: The architecture defines clear roles for the User, the Agent, the Merchant, and the Credential Provider. Each role has access only to the information necessary to perform its specific function, following the principle of least privilege.

By engineering trust directly into the transaction objects, AP2 moves us away from the "all-or-nothing" authorization model of the past. It provides the granularity needed to manage complex machine-to-machine interactions while ensuring that the ultimate control remains with the human user. This architectural shift is the first step toward a scalable and secure agentic economy.

Transaction-Level Auth vs. Session Hijacking: The Role of KYAPay and JWTs

One of the most significant security risks in the agentic era is the over-reliance on session-level authorization. In traditional web applications, once a user logs in, a session is established that grants broad access for a period of time. For an AI agent, this "blank check" model is a disaster waiting to happen. If an agent is compromised or suffers from a reasoning failure, a long-lived session could allow it to execute hundreds of unauthorized transactions before the breach is detected. To secure autonomous payments, we must move from the concept of a "trusted session" to the concept of "trusted transactions."

This is where the KYAPay protocol, developed by Skyfire, provides a critical layer of defense. KYAPay is designed as an identity-linked payment standard that focuses on transaction-level authentication. Instead of relying on a persistent session, every single payment request must carry its own proof of identity and authorization. This approach significantly reduces the "blast radius" of any potential exploit, as an attacker who gains control of an agent cannot simply drain a user’s account without providing a valid, per-transaction proof.

The technical backbone of this granular security model is the use of signed JSON Web Tokens (JWTs). These tokens are not just simple identifiers. They are rich data containers that carry verified information about the transaction, including:

Owner Identity: A cryptographic link to the user who owns the agent.
Authorization Scope: A precise definition of what the agent is allowed to do, such as "purchase electronics under $500."
Transaction Parameters: The specific details of the current request, such as the merchant ID, the timestamp, and the exact amount.

By requiring a signed JWT for every transaction, KYAPay ensures that the payment rails can verify the "Who, What, and Why" of a machine-initiated request before any money moves. This model is particularly effective against session hijacking. Even if an adversary intercepts a message or gains temporary access to an agent's execution environment, they cannot generate a valid JWT for a new, unauthorized transaction without the user’s private key.

The advantages of transaction-level authentication extend beyond just security:

Granular Policy Enforcement: Enterprises can define complex spending policies that are checked in real time for every transaction. For example, an agent could be allowed to buy cloud credits but restricted from purchasing physical hardware.
Improved Auditability: Because each JWT contains the full context of the transaction, the resulting audit trail is much more detailed than a simple list of charges. This makes it easier to track agent behavior and identify anomalies.
Seamless Integration: Since JWTs are a widely used standard in modern web development, KYAPay can be integrated into existing OAuth2 and OIDC stacks without requiring a complete overhaul of the enterprise identity infrastructure.

Moving to transaction-level authentication is a necessary evolution for any organization deploying autonomous agents at scale. It replaces the fragile "trust but verify" model with a "verify every action" architecture that is better suited for the speed and complexity of the agentic economy.

Mitigating "Machine-to-Machine Mayhem": Defending Against Hallucinated Spending

The autonomy of AI agents introduces a new category of operational risk that traditional fraud detection systems are not equipped to handle. In the world of "machine-to-machine mayhem," the threat is not always a malicious hacker. Often, the risk comes from the agent itself. Model hallucinations, where an agent confidently executes an incorrect action, or recursive loops, where an agent repeatedly attempts the same transaction, can lead to significant financial loss in a matter of seconds. These are not just technical glitches. They are fundamental security vulnerabilities that require a new layer of "pre-flight" defense.

One of the most dangerous vectors in this space is indirect prompt injection. An agent tasked with researching a product might encounter malicious instructions on a third-party website. If the agent is not properly sandboxed, these instructions could hijack its reasoning engine, leading it to initiate an unauthorized purchase or leak sensitive payment metadata. Because the agent is acting "on behalf" of the user, traditional firewalls and filters often fail to recognize these actions as malicious.

To defend against these risks, enterprises must implement a "pre-flight" verification layer that sits between the agent's reasoning engine and the payment gateway. This layer acts as a deterministic gatekeeper, ensuring that every request meets pre-defined safety and financial criteria before it is ever sent to the payment rails. Key components of this defense strategy include:

Deterministic Policy Guardrails: Instead of relying on the LLM to "follow the rules," financial policies should be hard-coded into a separate validation engine. For example, if a policy states that an agent cannot spend more than $100 per day, the validation engine will automatically reject any request that exceeds this limit, regardless of the agent’s reasoning.
Contextual Anomaly Detection: Behavioral monitoring systems should track agent activity in real time. If an agent that normally buys office supplies suddenly attempts to purchase high-value crypto assets, the system should trigger an immediate "human-in-the-loop" review.
Recursive Loop Protection: To prevent runaway spending, developers must implement circuit breakers that limit the number of transactions an agent can perform within a specific timeframe. This is critical for preventing an agent from accidentally draining a budget due to a logic error in its planning phase.

The goal of these defenses is to create a "sandbox for actions" that is as robust as the sandboxes we use for code. By decoupling the reasoning engine from the execution engine, we ensure that a failure in the LLM does not translate into a failure in the financial system.

Furthermore, the industry is moving toward "multi-agent supervision" models. In this architecture, a primary agent performs the task, while a secondary "supervisor" agent, with a more restricted set of tools and a different model architecture, reviews the proposed action for safety and compliance. This "two-key" approach provides an additional layer of verification that is particularly effective against complex hallucinations and prompt injection attacks.

Securing agentic payments is not just about protecting credentials. It is about protecting the integrity of the machine's decision-making process. By implementing deterministic guardrails and real-time behavioral monitoring, organizations can unlock the benefits of autonomous commerce without exposing themselves to the chaos of unconstrained machine-to-machine interactions.

Scoped Payment Tokens: Redefining Least Privilege for Digital Wallets

The principle of least privilege is a cornerstone of cybersecurity, yet it is often ignored in the context of digital payments. Historically, giving an application access to a payment method meant giving it the ability to charge any amount at any time. For autonomous agents, this "all-or-nothing" access is a catastrophic security flaw. To mitigate this risk, the industry is moving toward "Scoped Payment Tokens." These are specialized credentials that are restricted by design, ensuring that an agent has only the exact amount of financial authority it needs to complete its task.

Leading payment infrastructure providers are already deploying these technologies. Stripe’s Agentic Commerce Suite, for example, introduces Shared Payment Tokens. These tokens allow an agent to initiate a purchase using a buyer’s credentials without ever seeing the underlying card data. More importantly, these tokens can be scoped with precise limitations that are enforced at the network level. This means that even if an agent is completely compromised, the attacker’s ability to cause financial damage is limited by the pre-defined scope of the token.

The technical implementation of scoped tokens typically involves several layers of constraints:

Merchant and Category Restrictions: A token can be locked to a specific merchant ID or a broader Merchant Category Code (MCC). For instance, an agent tasked with booking travel can be issued a token that only works with airlines and hotels, rendering it useless for any other type of purchase.
Temporal Limits (TTL): Tokens can be assigned a short "Time-to-Live." Once the specific task is completed or the time window expires, the token is automatically revoked. This eliminates the risk of long-lived "zombie" credentials that could be exploited later.
Maximum Spend and Velocity Controls: Enterprises can set hard limits on the total amount a token can authorize, as well as the frequency of transactions. This acts as a financial circuit breaker that prevents an agent from scaling a mistake or a fraud attempt.

Visa and Mastercard are also developing their own "Trusted Agent" frameworks that leverage network-level tokenization. These frameworks allow issuers to recognize when a transaction is being initiated by an agent and apply different risk-scoring models accordingly. By moving the enforcement of these limits to the payment network itself, we create a defense-in-depth architecture that does not rely on the agent’s own internal security.

The shift toward scoped tokens also simplifies the user experience. Instead of a user having to manually approve every small purchase, they can grant a "scoped mandate" to their agent. The user feels secure knowing that the agent cannot exceed its boundaries, while the agent gains the fluidity it needs to operate autonomously.

For security specialists, the takeaway is clear: the agent’s "wallet" should never contain a raw credit card. It should contain a collection of highly specific, short-lived, and restricted tokens. This architecture ensures that the "blast radius" of any single agent failure is minimized, and that the enterprise maintains granular control over its autonomous financial flows.

Accountability and Dispute Resolution in Autonomous Commerce

The final frontier in securing agentic payments is solving the "Liability Gap." When a human makes a mistake or a fraudulent transaction occurs, the legal and financial frameworks for dispute resolution are well-established. However, when an autonomous agent initiates a transaction that the user later disputes, the burden of proof becomes significantly more complex. Was the transaction the result of a clear user instruction, a model hallucination, a developer’s logic error, or a malicious third-party injection? Without a robust technical framework for accountability, the uncertainty of who is liable will stifle the adoption of agentic commerce.

To address this, we must move beyond simple transaction logs and toward "Non-Repudiable Audit Trails." A standard log entry that says "Agent X bought Item Y" is insufficient for a legal or financial dispute. Instead, every autonomous transaction must be backed by a cryptographic chain of evidence that links the final payment back to the original user intent. This chain should include:

Signed User Mandates: The original, cryptographically signed instruction from the user that defines the agent's boundaries.
Traceable Reasoning Logs: A secure, immutable record of the agent's planning phase, showing exactly how it interpreted the user's request and why it chose a specific merchant and price.
Verified Execution Metadata: Proof that the transaction was executed within the parameters of the mandate, including the scoped token used and the network-level validation result.

This level of detail is essential for distinguishing between "authorized but incorrect" transactions and "unauthorized" ones. If an agent makes a purchase that falls within its signed mandate but is based on a hallucination, the liability may lie with the developer or the user, depending on the service agreement. However, if an agent executes a transaction that exceeds its mandate, the cryptographic proof would clearly indicate a failure of the security architecture, potentially shifting liability to the orchestration layer or the credential provider.

Financial institutions and merchants are also calling for standardized "Agent-Initiated Transaction" (AIT) flags. These flags would allow participants in the payment chain to recognize the unique nature of the transaction and apply appropriate dispute resolution rules. For example, a merchant might accept a higher risk for an AIT if it is accompanied by a Verifiable Digital Credential (VDC) from a trusted provider.

The role of the "Credential Provider" becomes central in this new ecosystem. Much like a certificate authority in web security, these providers will be responsible for vouching for the identity and authorization of an agent. In the event of a dispute, they will provide the necessary cryptographic evidence to determine where the failure occurred. This shift from "trusting the agent" to "trusting the credential" is what will allow the global financial system to scale to billions of autonomous transactions.

As we conclude this exploration of the agentic payment layer, the message for security specialists is clear: we are building the infrastructure for a new economy. The security of this economy will not be found in better firewalls or more complex passwords. It will be found in the integrity of our protocols, the granularity of our authorizations, and the immutability of our audit trails. By solving the challenges of identity, intent, and accountability today, we are laying the foundation for a future where autonomous machines can transact with the same level of trust as the humans they represent.