Portal Community

Checklist Sign-Off Template

# BizFirst Observe Production Readiness Sign-Off
# Date: YYYY-MM-DD
# Engineer: [name]
# Environment: production / staging

Checklist Results:
  [x] 1. All stack components healthy
  [x] 2. All Prometheus targets up
  [x] 3. Logs in Loki for all services
  [x] 4. Traces in Tempo
  [x] 5. Cross-signal correlation works
  [x] 6. All 10 dashboards load without errors
  [x] 7. Dashboard variables populate
  [x] 8. Alert rules in Normal state
  [x] 9. Alert notification delivered to Slack
  [x] 10. Data retention configured

Status: PASS / FAIL
Notes: [any observations]
After Passing: Set Up a Weekly Health Check

Schedule a weekly 5-minute check: verify all Prometheus targets are still "up", check that data is still flowing in each signal, and review any alerts that fired in the past week. The observability stack is infrastructure — it needs maintenance just like any other production component.