Your HCI installation includes a custom Grafana dashboard, specifically designed to provide top-level insights into your system. Aptly named the HCI Dashboard, it offers a comprehensive overview of key metrics, allowing you to quickly assess both the health and status of your system at a glance. This dashboard integrates multiple data sources, helping you visualize critical system information in one place for better operational awareness.
This default dashboard also acts as an example of the wide variety of information available to you when creating dashboards of your own.
To learn more about creating your own custom Grafana dashboards, refer to the official Grafana documentation.
Overview
- Global CPU Usage and CPU Usage: Displays the real-time CPU consumption, resource requests, and current usage limits across the entire cluster.
- Global RAM Usage and Ram Usage: Displays the real-time RAM consumption, resource requests, and current usage limits across the entire cluster.
- Pod Status: Displays the current health status of all of your available pods.
- Node Status: Displays the current health status of all of your available nodes.
- Cluster Disk Usage: Displays the disk usage across your cluster.
Cluster Health Issues
- Pod Health Issues: Displays any current issues with your system's pods.
- Node Health Issues: Displays any current issues with your system's nodes.
Resources
- Cluster CPU Utilization (Total): Displays a graph of your current CPU usage on your cluster.
- Cluster Memory Utilization (Total): Displays a graph of your current RAM usage on your cluster.
- CPU Utilization by Node: Displays a graph of your current CPU usage, separated by node.
- Memory Utilization by Node: Displays a graph of your current RAM usage, separated by node.
Network
- Global Network Utilization by Network Interface: Displays the current incoming and outgoing bandwidth being used across all network interfaces in the Kubernetes cluster.
- Total Network Receive (with all virtual devices) by instance: Displays the current incoming bandwidth on each instance in the Kubernetes cluster.
- Network Received (without loopback) by instance: Displays the current incoming bandwidth on each instance, excluding traffic from the loopback interface.
- Network Received (loopback only) by instance: Displays the current incoming bandwidth on each instance for exclusively the loopback interface.