Use the following workflow as an example of the troubleshooting sequence for analyzing performance by metrics. In this scenario, performance degradation has caused a critical alert to be sent about a platinum grade consumer in Ops Center Analyzer.
Analyze response time
-
Check performance of the affected storage resources:
- See performance metrics for parity groups, MPB, I/O paths, response time, and cache write pending.
- If the storage system is modular, see performance metrics for UVM/VM response time, HDD operation rates, CPU core busy, cache write pending.
-
Consider the following components:
- HBA queue depth
- HBA settings
- Virtual queue depth settings on the ESX host
-
Check the following switch components:
- Switch configuration
- Buffer-to-buffer credit hits
- ISL links
- Hardware failures (for example, an SFP module might not be properly connected)
Analyze IOPS
- Check for workload issues.
- Is the storage tier type incorrect?
- Is the profile for this application correct?
- Is there enough disk space to handle the workload?
- Check the performance for Parity Groups, MPBs, I/O paths.
Check utilization
If throughput is high:
- Check all associated data paths for high utilization.
- Check the CHA>ESX host paths.
If throughput is low:
- Check the host paths for high utilization.
- Check for port saturation on the host.
- Consider host-based striping.