🖥️

Real incidents need a real screen.

Open senioreng.dev on your laptop for the full experience.

Live·00:00elapsed

Incident Workspace

Service Metrics

Environment

Production

P95 Latency ms

3.9s

avg · 1h

11.2s8.4s5.6s2.8s0
190ms
188ms
191ms
190ms
192ms
189ms
190ms
192ms
11.2s
11.2s
11.2s
11.2s
09:0009:0609:1209:18

Error Rate %

2%

avg · 1h

5%4%3%2%0%
1%
1%
1%
1%
1%
1%
1%
1%
3%
3%
3%
3%
09:0009:0609:1209:18

Request Volume k/min

13k

avg · 1h

15k11k7k4k0
12k
12k
13k
12k
13k
12k
12k
13k
13k
12k
13k
13k
09:0009:0609:1209:18

DB CPU %

38%

avg · 1h

100%75%50%25%0%
9%
9%
10%
9%
10%
9%
9%
10%
94%
94%
94%
94%
09:0009:0609:1209:18

Production Incident

Catalog Under Pressure

On-Call Alert

catalog-service P95 latency spiked from 190ms to 11 seconds. DB CPU at 94%. Traffic is normal.

Meridian Commerce serves 2 million shoppers. The product catalog has been slow for the past 18 minutes. DB CPU jumped from 9% to 94% with no change in traffic. A new feature shipped this morning.

You're on-call. Investigate and restore service.