Apr 19, 16:44 UTCResolved. Webhook queue fully drained. All endpoints receiving events in normal time. Post-mortem scheduled for Apr 22.
Apr 19, 15:22 UTCMonitoring. Queue backlog clearing at ~1,200 events/min. Estimated full recovery in 45 minutes.
Apr 19, 14:58 UTCInvestigating. Identified queue consumer crash in eu-west-1b. Restarted with increased worker pool. Payment processing unaffected.