Alerts & Incidents

12 open · 5 acknowledged

12
MC

Incident queue · 17

SevMessageDeviceCustomerMetricRaisedSLAAck
Inlet temperature 52°C above threshold on nrth-sensor-03nrth-sensor-03Northwind Logisticsenv.temp2h 14m ago9m left
UPS nrth-ups-17 on battery — runtime 12mnrth-ups-17Northwind Logisticsups.battery2h 1m ago85m left
Interface error rate spike on axiom-server-23/Gi0/24axiom-server-23Axiom Health Networkif.errors1h 38m ago73m left
Interface error rate spike on axiom-firewall-24/Gi0/24axiom-firewall-24Axiom Health Networkif.errors18m ago8m left
SNMP poll timeout on axiom-ups-32axiom-ups-32Axiom Health Networksnmp.timeout37m ago38m left
Memory utilisation 91% on pier-switch-37pier-switch-37Pier 9 Manufacturingmem.used12m agobreached -6ms.okafor
UPS pier-switch-38 on battery — runtime 12mpier-switch-38Pier 9 Manufacturingups.battery3h 49m ago30m left
Inlet temperature 59°C above threshold on pier-ups-41pier-ups-41Pier 9 Manufacturingenv.temp2h 41m ago0m left
Running config changed outside change window on pier-ups-42pier-ups-42Pier 9 Manufacturingconfig.drift1h 46m ago45m left
Device nrth-firewall-01 not responding to ICMPnrth-firewall-01Northwind Logisticsicmp.reach36m ago22m lefta.patel
CPU above 90% for 5m on nrth-firewall-02nrth-firewall-02Northwind Logisticscpu.5m3h 42m ago86m left
Inlet temperature 52°C above threshold on nrth-sensor-03nrth-sensor-03Northwind Logisticsenv.temp56m ago71m lefta.patel
Interface error rate spike on nrth-router-04/Gi0/24nrth-router-04Northwind Logisticsif.errors32m ago34m left
CPU above 90% for 5m on nrth-switch-05nrth-switch-05Northwind Logisticscpu.5m11m ago61m left
Interface error rate spike on nrth-pdu-06/Gi0/24nrth-pdu-06Northwind Logisticsif.errors3h 49m ago32m left
Inlet temperature 37°C above threshold on nrth-sensor-07nrth-sensor-07Northwind Logisticsenv.temp1h 45m ago32m lefta.patel
Device nrth-switch-08 not responding to ICMPnrth-switch-08Northwind Logisticsicmp.reach56m ago12m leftm.chen

Escalation timeline · a-001

  1. +00:00Threshold crossed: cpu.5m > 90%engine
  2. +00:01Notification sent to NOC channelemail + webhook
  3. +00:04Acknowledged by m.chenoperator
  4. +00:11Customer notified (auto)email
  5. +00:18Escalated to L2 — runbook RB-04 attachedautomation
  6. +00:32Mitigated: reload of process queueoperator