Incident Report
Date: January 24, 2025
Time: 11:30 AM UTC
Reported By: Incident Response Team
Incident ID: IR-20250124-001
---
Degraded Performance for OBS Pay and OBS Pay APIs & Partial Outage for OBS SecuRA Runtime Environment and Encryption APIs
---
Incident Timeline
Detection: January 24, 2025, 9:00 AM UTC
First User Report: January 24, 2025, 9:15 AM UTC
Mitigation Initiated: January 24, 2025, 9:30 AM UTC
Partial Restoration: January 24, 2025, 10:30 AM UTC
Full Resolution (Estimated): January 24, 2025, 1:00 PM UTC
---
Affected Services
1. Degraded Performance:
OBS Pay
OBS Pay APIs
2. Partial Outage:
OBS SecuRA Runtime Environment
OBS SecuRA Encryption APIs
---
Incident Description
OBS Pay and OBS Pay APIs:
Between 9:00 AM and 11:00 AM UTC, users experienced significant delays in payment processing and intermittent API failures. The latency for processing payments increased by over 67.3%, and some API requests timed out. Initial diagnostics pointed to high CPU utilization on database servers due to an unexpected spike in transaction volume.
OBS SecuRA Runtime Environment and Encryption APIs:
A partial outage was detected for the SecuRA Runtime Environment and associated encryption APIs. Approximately 40% of API requests failed due to a service node experiencing connectivity issues with the central orchestration layer. This impacted encryption and decryption processes critical to secure transactions.
---
Impact Assessment
1. OBS Pay and APIs:
Users Affected: Estimated 25,000 transactions delayed or failed globally.
Severity: Medium
Financial Impact: Pending calculation based on transaction delays.
2. OBS SecuRA Services:
Scope: Approx. 40% of encryption requests affected for users relying on SecuRA APIs.
Severity: High
Security Impact: No evidence of data compromise; issue limited to service availability.
---
Root Cause Analysis
OBS Pay and APIs:
Primary Cause: Increased transaction volume caused database contention, leading to slow query responses and API timeouts.
Contributing Factors: Insufficient auto-scaling thresholds for peak load management.
OBS SecuRA Services:
Primary Cause: A failed service node caused connectivity disruptions with the orchestration layer.
Contributing Factors: Lack of failover readiness for the affected node and delayed health-check responses.