Prisma SD-WAN Troubleshooting AI Agent
Focus
Focus
Prisma SD-WAN

Prisma SD-WAN Troubleshooting AI Agent

Table of Contents

Prisma SD-WAN Troubleshooting AI Agent

Understand how the Prisma SD-WAN Troubleshooting AI Agent autonomously diagnoses network issues and accelerates root cause analysis for network operations teams.
Where Can I Use This?What Do I Need?
  • Prisma SD-WAN (Managed by Strata Cloud Manager)
  • Prisma SD-WAN
The Prisma® SD-WAN Troubleshooting Agent is a specialized, autonomous AI Agent that dynamically troubleshoots and determines root causes for network operational issues, including configuration errors, log analysis, link quality problems, route reachability failures, and interface errors. This agent supports a proactive investigation approach, moving your team away from manual, time-consuming war room troubleshooting sessions.

Troubleshooting AI Agent Reasoning Framework

You invoke the Prisma SD-WAN Troubleshooting AI Agent through the Strata Co-pilot interface. The agent performs the following steps to identify the root cause of the issue you specify:
  • Context Gathering: The AI agent dynamically gathers contextual data when invoked. The agent collects context by querying incidents, alerts, and other metadata.
  • RAG Integration: The Troubleshooting Agent is designed to troubleshoot dynamically and root cause a wide variety of network operational use cases like configuration errors, log analysis, link quality issues, route reachability issues, and interface errors. The AI Agent uses Retrieval Augmented Generation (RAG) to retrieve the product documentation, knowledge base, and product architecture information, thus enhancing the accuracy of the Agent.
  • Dynamic Playbook Generation: The Troubleshooting agent dynamically builds a custom troubleshooting plan based on real-time context, for example, for an issue related to application slowness, the AI Agent will dynamically build a plan to check the underlay link circuit, application metrics, performance, and QoS policy configuration to then dynamically correlate multiple data points.
  • Backend Tool Access: The Troubleshooting Agent then takes the action of calling the backend tools specific to the generated action plan. Multiple backend tools like site analyzer, QoS config analyzer, circuit and log analyzer, and application analyzer are invoked, and output from those tools is correlated at run time.
  • Correlation across multiple data points: The agent correlates multiple data points in near real-time, including interface status, system logs, errors, and recent configuration changes.
  • Accelerated Diagnostics: By concurrently running diagnostic steps across the entire SD-WAN fabric, the agent correlates static network policies with dynamic data plane traffic significantly faster than manual processes, thus lowering the Mean Time To Resolution (MTTR).
  • Transparent Reasoning: Every conclusion reached by the Troubleshooting AI Agent is backed by verifiable backend data, giving admins full visibility to monitor the diagnostic process and validate the Root Cause Analysis (RCA).
  • Fallback Guidance: If a root cause exists outside the SD-WAN environment (for example, a local switch port failure), the agent provides recommended next steps for adjacent infrastructure.

Key Performance Indicators

The following metrics track the performance of the AI agent:
  • System Health Metrics:
    • Uptime: Availability of the Agent and tools invoked by the Agent.
    • Latency: Latency per invocation and task completion (The AI Agent has read-only permissions within the system, so task completion will measure the time taken to arrive at the root cause of the issue).
    • Scalability: Monitor backend resource usage during peak query periods.
  • Quality/Efficacy Metrics:
    • Accuracy: Root cause identification success rate, validated by network admin feedback.
    • Issue Complexity: Measured in terms of the number of backend tools invoked and the number of data points correlated by the Agent, with weighing mechanisms for each metric.
    • Efficacy per Product Area: Metrics per product sub-area, for example, routing/VPN/ configuration/QoS/Security policy, etc.

Quantifiable Improvements

The Prisma SD-WAN Troubleshooting AI Agent accelerates your daily network operations by delivering data-backed root cause analysis with detailed reasoning steps. You retain complete control and safety through mandatory human-in-the-loop (HITL) oversight, strict task boundaries, and clear, system-generated remediation plans. By automatically identifying deep-rooted issues and proposing remediation plans, you can significantly reduce network downtime.

Security and Governance

  • Human Oversight: While the agent plans and identifies remediation, the final execution of changes remains subject to administrator approval.
  • Explainable AI: Every diagnostic step is logged with a 'reasoning path', allowing NetOps teams to verify the agent's logic before proceeding.

Summary

The transition to an autonomous, agentic NetOps environment provides an essential framework for modern network administration. By using the Prisma SD-WAN Troubleshooting AI agent, you streamline daily network operations, eliminate reactive troubleshooting, and future-proof your infrastructure for agentic network operations.