Resolved -
Cross checking DNS records has been successful for some time. Customers report negative caches have since expired. Likely cause: race condition that caused misapplication of DNS records that was exposed under higher parallelism. We have to analyze it and patch it in the coming days.
Feb 6, 22:13 UTC
Monitoring -
We've identified this problem, and believe negative caching of DNS records to be all that remains. We are going to continue to check for at least thirty minutes, the longest time reported for negative caching so far.
Feb 6, 21:41 UTC
Identified -
Some records are proving more difficult to address than we expected.
Feb 6, 21:35 UTC
Monitoring -
We've hotfixed the remaining records. Negative caching of DNS can slow recovery, depending on the upstream DNS server. We will remain in contact with all affected customer until all symptoms are resolved.
Feb 6, 21:16 UTC
Investigating -
There is a defect in Postgres that causes unbound DNS records. Negative caching can exacerbate the incident after our fix. Databases affected: many at the outset, some now. Diagnosing the last records.
Feb 6, 20:00 UTC