Datacake - Data processing delays – Incident details

Data processing delays

Resolved
Partial outage
Started 9 months agoLasted about 10 hours

Affected

Web Application

Operational from 11:38 AM to 11:44 AM, Partial outage from 11:44 AM to 12:03 PM, Degraded performance from 12:03 PM to 10:02 PM

Updates
  • Update
    Update

    The full post mortem can be found on our Engineering Blog: https://engineering.datacake.co/post-mortem-service-degradation-on-march-14-2024/

  • Resolved
    Resolved

    All queues have now been cleared with no data loss. We apologize for these delays and appreciate your understanding. In dedication to clarity and process improvement, a comprehensive follow-up about the incident, explaining its causes and the measures taken to prevent recurrence, will be provided shortly.

  • Monitoring
    Monitoring

    The queries in question have now stabilized, resulting in a consistent reduction of the queue. We'll continue to provide updates on the situation in this space.

  • Update
    Update

    We've identified the initial root cause of the delays as a few long-running migration queries. We are actively monitoring the situation and adjusting resources as necessary to expedite the process.

  • Update
    Update

    We are still in the process of examining an extended measurement queue. Please note that data transmission still encounters delays.

  • Identified
    Identified

    We've identified the root cause of the recent issue as an unexpected high load on one of our databases. We've enhanced the resources allocated to this database and are currently monitoring its performance closely.

    Please note that due to a pending backlog, temporary data gaps may be evident in the chart visualizations. However, we're working to ensure data integrity as soon as possible.

  • Update
    Update
    We are currently investigating this incident.
  • Investigating
    Investigating

    Our system is experiencing processing delays with incoming data. We are currently investigating the cause of this delay.