--- description: Monitor system health and performance metrics argument-hint: [monitor-type] [time-range] --- # System Monitoring Command Monitor system health, performance metrics, and operational status with comprehensive analysis. ## Context - Monitor type: $1 (health|performance|errors|all - default: all) - Time range: $2 (1h|6h|24h|7d - default: 24h) - System status: !`ps aux | head -10` - Disk usage: !`df -h` - Memory usage: !`free -h` ## Monitoring Analysis ### 1. **System Health Check** - Service status and availability - Resource utilization (CPU, memory, disk) - Network connectivity and latency - Database connection and performance ### 2. **Performance Metrics** - Response times and throughput - Error rates and success rates - Queue depths and processing times - Cache hit rates and efficiency ### 3. **Error Analysis** - Error frequency and patterns - Critical error identification - Performance degradation detection - Anomaly detection and alerting ### 4. **Operational Status** - Deployment status and version - Configuration validation - Security posture assessment - Compliance status check ## Monitoring Thresholds - **CPU Usage**: < 80% - **Memory Usage**: < 85% - **Disk Usage**: < 90% - **Response Time**: < 500ms (p95) - **Error Rate**: < 1% ## Expected Outcome - Comprehensive system health report - Performance metrics and trends - Error analysis and recommendations - Operational status summary ## Alert Conditions Immediate attention required for: - Critical service failures - Performance degradation > 50% - Security incidents or breaches - Resource exhaustion warnings ## Recommendations Based on monitoring data: - Performance optimization opportunities - Capacity planning suggestions - Infrastructure scaling recommendations - Preventive maintenance actions