Incident Workspace
Service Metrics
Environment
Production
Error Rate %
1%
avg ยท 1h
P95 Latency ms
48ms
avg ยท 1h
Request Volume k/min
13k
avg ยท 1h
Success Rate %
99%
avg ยท 1h
order-service
All metrics nominal โ no indication of a problem from application layer
Orders processed (1h)
2,841
Orders confirmed missing
47
Errors in last 1h
0
Incident Scope
Affected time window
11:34:01โ11:34:03 UTC
Duration of data loss risk
2 seconds
Orders in window
47 confirmed missing
Orders before window
2,794 โ all durable
Orders after failover
Ongoing โ durable (same risk)
Client notification
Pending โ required
Endpoint Breakdown โ Last Hour
POST /confirm-order
2,841 requests
P95
48ms
Errors
0
GET /order/:id
14,220 requests
P95
12ms
Errors
0
GET /orders
3,104 requests
P95
34ms
Errors
0
DELETE /order/:id
218 requests
P95
22ms
Errors
0
The logs say it succeeded. The database says it never happened. Both are correct.