Strata Cloud Manager Alert "Process Memory Depletion - Management Server"
13566
Created On 02/15/22 21:21 PM - Last Modified 09/17/24 21:48 PM
Symptom
- Alert from Strata Cloud Manager regarding process memory depletion for "mgmtsrvr"
Environment
- PAN-OS
- SCM Alert
Cause
A memory depletion in the "mgmtsrvr" process was found.
Resolution
If you receive this Alert, it is recommended to collect the following Troubleshooting Data below and open a Support Case. After data is collected, considering following the Mitigation Steps to bring down the memory usage of the mgmtsrvr process till Support can analyze the data.
Troubleshooting Data
-
Collect Tech Support File (GUI: Device > Support Click Generate Tech Support File)
-
Generate a trace file using the following CLI command
debug software trace management-server
-
Generate a core file using following command
debug software core management-server
- Collect the Device State (GUI: Device>Setup>Operations- Export: Export device state)
- Export the core file (HOW TO EXPORT CORE FILES FROM A PALO ALTO NETWORKS DEVICE)
- Gather data below from Strata Cloud Manager:
- Check the dates indicated by Strata Cloud Manager as to when the memory depletion started.
- Check if there were any config modifications, PANOS upgrades/downgrades, or any other changes performed around the time that might have triggered this behavior.
- (Optional) If performing Mitigation Steps below, collect another Tech Support File after completing steps
- Open a case with the above data.
Mitigation Steps
Till the issue is resolved, you can restart the mgmtsrvr process that to bring down the memory usage of management-server.NOTE: Recommended to be performed in a maintenance window.
Potential Impact of restart the process:
- After PAN-OS 10.1, mgmtsrvr on firewall serves as a message dispatcher(Similar to panorama), Backend daemons such as configd, distributord, iotd, logrcvr, pl-dlp_agent, reportd, and useridd will be restarted.
- Prior to PAN-OS 10.1, following functionality on firewall will not be available during the process restart,
- UI and CLI access
- HA Sync
- Panorama push
- Dynamic Updates
- The mgmtsrvr on Panorama serves as a message dispatcher, All backend daemons are dependent on mgmtsrvr, when mgmtsrvr is restarted, all backend daemons will be restarted. Single access point to all the firewalls via proxy session on GUI.
Option 1 (Standalone Device)
- Save and export the candidate config.
- Save and export the current configuration.
- Perform a full commit
- Restart the management-server process using below command
debug software restart process management-server
- (For PAN-OS 10.0.X or 10.1.X) Restart the device-server to ensure that the commits go through without a problem.
debug software restart process device-server
Option 2 (Device in Active/Passive HA)
- Disable "Preemptive" mode (GUI: Device > High Availability > General > Election Settings: Uncheck Preemptive )
- Failover to the passive device (From Active Device: Device > High Availability > Operations > Click Suspend local device)
- Restart the management-server from the CLI from the now Suspended device (debug software restart process management-server)
- (For PAN-OS 10.0.X or 10.1.X) Restart the device-server to ensure that the commits go through without a problem. (debug software restart process device-server)
- From CLI run show management-clients to ensure that all processes have started successfully.
> show management-clients
Client PRI State Progress
-------------------------------------------------------------------------
ha_agent 25 init 0
sslmgr 10 init 0
authd 10 init 0
cryptod 10 init 0
dagger 10 init 0 (op cmds only)
cord 10 init 0
logd 10 init 0 (op cmds only)
reportd 10 init 0 (op cmds only)
useridd 10 init 0
distributord 10 init 0
iotd 10 init 0
Overall status: init. Progress: 0
Warnings:
Errors:
NOTE: Restarting the process will temporarily mitigate the process growth, but the issue may re-surface.
Additional Information
PAN-175211 - Fixed a memory leak issue in the (mgmtsrvr) process. (Fixed on PAN-OS 9.0.16, 9.1.13, 10.0.9, 10.1.4)
PAN-183774 - Fixed an memory leak issue in the mgmtsrvr process, which resulted in an out-of-memory (OOM) condition and high availability (HA) failover. (Fixed on PANOS 9.1.13, 10.0.10, 10.1.5)