Monitoring
15 skills
All Monitoring Skills
Distributed Tracing
Implement distributed tracing with Jaeger and Tempo to track requests across microservices and identify performance bottlenecks.
Startup Metrics Framework
Key metrics for startup success and performance optimization
Incident Runbook Templates
Create structured incident response runbooks with step-by-step procedures, escalation paths, and recovery actions.
Prometheus Configuration
Set up Prometheus for comprehensive metric collection, storage, and monitoring of infrastructure and applications.
Grafana Dashboards
Create and manage production Grafana dashboards for real-time visualization of system and application metrics.
Marketplace Manager
Automatically manages marketplace catalog updates and plugin distribution.
Propositional Logic
Problem-solving strategies for propositional logic in mathematical logic
Predicate Logic
Problem-solving strategies for predicate logic in mathematical logic
Proof Theory
Problem-solving strategies for proof theory in mathematical logic
Service Mesh Observability
Implement comprehensive observability for service meshes including distributed tracing, metrics, and visualization.
Slo Implementation
Define and implement Service Level Indicators (SLIs) and Service Level Objectives (SLOs) with error budgets and alerting.
Appinsights Instrumentation
Instrument a webapp to send useful telemetry data to Azure App Insights
On Call Handoff Patterns
Master on-call shift handoffs with context transfer, escalation procedures, and documentation.
Postmortem Writing
Write effective blameless postmortems with root cause analysis, timelines, and action items.
Examples Auto Run
Run python examples in auto mode with logging, rerun helpers, and background control.