Resource Leaderboards
The resource leaderboards provide detailed analysis of the highest resource-consuming pods in clusters, helping identify optimization opportunities and performance bottlenecks.
Top CPU Consumers
Leaderboard Interface
The resource leaderboard displays the top 10 pods by CPU usage in a comprehensive data grid:
![Figure needed]
Screenshot of resource leaderboard showing top CPU consuming pods
Grid Columns
Pod Information
- Pod Name: Kubernetes pod identifier (clickable for details)
- Namespace: Pod's namespace location
- Container Count: Number of containers within the pod
- Labels: Pod labels with intelligent truncation for display
Resource Metrics
- CPU Usage: Current CPU consumption with proper unit formatting
- Memory Usage: Current memory consumption with automatic unit conversion
- Age: Duration the pod has been running (e.g., "2d 14h", "3h 42m")
Data Features
- Sortable Columns: Click any column header to sort by that metric
- Real-time Data: Updates with dashboard refresh cycle (30 seconds)
- Unit Formatting: Automatic conversion to appropriate units (cores, GB, MB)
- Time Formatting: Human-readable time displays
Data Grid Features
Interactive Elements
![Figure needed]
Screenshot showing sortable columns and interactive elements
Column Sorting
- Click to Sort: Click any column header to sort data
- Sort Direction: Toggle between ascending and descending
- Multi-column: Sort by multiple criteria for detailed analysis
- Default Sort: Initially sorted by CPU usage (highest first)
Label Management
- Smart Truncation: Long label strings intelligently shortened
- Hover Details: Full label information on mouse hover
- Key Labels: Most important labels prioritized in display
- Readable Format: Labels formatted for easy reading
Resource Formatting
CPU Usage Display
Format Examples:
- "1.25 cores" (high usage)
- "0.5 cores" (moderate usage)
- "0.1 cores" (low usage)
- "2.8 cores" (very high usage)
Memory Usage Display
Format Examples:
- "2.5 GB" (gigabyte range)
- "512 MB" (megabyte range)
- "4.2 GB" (high memory usage)
- "128 MB" (low memory usage)
Age Formatting
Format Examples:
- "5m" (5 minutes)
- "2h 30m" (2 hours 30 minutes)
- "1d 6h" (1 day 6 hours)
- "3d 14h" (3 days 14 hours)
Analysis Capabilities
Performance Pattern Identification
High Resource Consumers
- Consistent High Usage: Pods that consistently consume significant resources
- Resource Spikes: Pods with periodic high resource usage
- Baseline Consumers: Pods with steady, predictable usage patterns
- Optimization Candidates: Pods that may benefit from resource tuning
Usage Categories
High CPU Consumers (over 1 core):
- Database pods
- Compute-intensive applications
- Machine learning workloads
- Data processing jobs
Medium CPU Consumers (0.2-1 core):
- Web applications
- API services
- Monitoring tools
- Standard business applications
Low CPU Consumers (under 0.2 core):
- Logging agents
- Sidecar containers
- Configuration pods
- Lightweight services
Optimization Opportunity Discovery
Over-provisioned Workloads
- High Limits, Low Usage: Pods with resource limits much higher than actual usage
- Waste Identification: Resources allocated but not utilized
- Right-sizing Opportunities: Potential for resource limit adjustments
- Cost Reduction: Opportunities to reduce resource costs
Under-provisioned Workloads
- Resource Pressure: Pods approaching or hitting resource limits
- Performance Impact: Workloads affected by resource constraints
- Scaling Needs: Pods that may need resource increases
- Bottleneck Identification: Resource constraints affecting performance
Use Cases
Daily Operations
Resource Monitoring
- Quick Assessment: Identify highest resource consumers at a glance
- Anomaly Detection: Spot unusual resource consumption patterns
- Capacity Planning: Understand which workloads drive resource needs
- Performance Tracking: Monitor resource usage trends over time
Issue Investigation
- Performance Problems: Find resource-intensive pods affecting cluster performance
- Resource Conflicts: Identify pods competing for resources
- Capacity Issues: Understand what's consuming cluster capacity
- Troubleshooting: Correlate performance issues with resource usage
Optimization Planning
Resource Right-sizing
- Usage Analysis: Compare actual usage to resource requests/limits
- Efficiency Improvement: Identify opportunities for better resource utilization
- Cost Optimization: Find ways to reduce resource costs
- Performance Tuning: Optimize resource allocation for better performance
Capacity Management
- Growth Planning: Understand resource growth patterns
- Scaling Decisions: Make informed decisions about cluster scaling
- Workload Distribution: Plan optimal workload distribution
- Resource Allocation: Optimize resource allocation strategies
Performance Optimization
Workload Analysis
- Resource Patterns: Understand resource consumption patterns
- Peak Usage: Identify peak resource usage periods
- Baseline Requirements: Establish baseline resource requirements
- Scaling Triggers: Determine when workloads need scaling
Cluster Efficiency
- Utilization Rates: Monitor overall cluster utilization efficiency
- Resource Balance: Ensure balanced resource usage across nodes
- Waste Reduction: Minimize resource waste and over-provisioning
- Performance Correlation: Correlate resource usage with application performance
Best Practices
Regular Monitoring
- Daily Reviews: Check top consumers daily for anomalies
- Weekly Analysis: Perform deeper analysis weekly
- Trend Tracking: Monitor changes in top consumers over time
- Pattern Recognition: Identify recurring resource usage patterns
Optimization Actions
- Resource Requests: Adjust resource requests based on actual usage
- Resource Limits: Set appropriate resource limits to prevent resource hogging
- Pod Distribution: Ensure even distribution of resource-intensive pods
- Scaling Configuration: Configure appropriate horizontal and vertical scaling
Proactive Management
- Alert Thresholds: Set up alerts for unusual resource consumption
- Capacity Planning: Use data for proactive capacity planning
- Cost Management: Monitor resource costs and optimization opportunities
- Performance Optimization: Continuously optimize based on usage patterns
Data Accuracy and Limitations
Data Sources
- Metrics Server: Real-time resource usage from Kubernetes metrics server
- Update Frequency: Data refreshed every 30 seconds with dashboard
- Accuracy: High accuracy for operational decision-making
- Scope: Covers all pods across all namespaces in the cluster
Considerations
- Point-in-time: Shows current usage, not historical averages
- Sampling: Based on most recent metrics collection
- Variability: Resource usage can vary significantly over time
- Context: Consider workload characteristics when interpreting data
Integration with Other Monitoring
Cross-reference with Node Health
- Node Correlation: Correlate top consumers with node resource pressure
- Distribution Analysis: Understand how top consumers are distributed across nodes
- Capacity Impact: See how top consumers affect overall node capacity
- Load Balancing: Ensure top consumers are properly distributed
Analytics Integration
- Namespace Analysis: Cross-reference with namespace resource analytics
- Pod Distribution: Correlate with pod distribution patterns
- Trend Analysis: Combine with historical trend data
- Optimization Planning: Integrate with capacity planning analytics
For additional cluster monitoring capabilities, see the Cluster Management section.