For the replication to be duff means the system was not fit for purpose. Plus as per my opening post, before even this level of information was released:
Surely something that could affect BA worldwide should have at least two if not more mirrored systems/hubs geographically far apart? With dedicated links with minimal latency. It was possible in my day, so what and why and how can this have happened?
I agree with
The article does state there is a secondary data centre that took up "some of the slack". Given BA's reliance on IT I would have thought the secondary data centre should be scaled to take up all of the operations in the event of a failure of the primary - that does not appear to be the case if the article is correct.
Also, if it isn't capable of taking all of the load then prioritising systems that would keep core airline operations up and running should have been key - it appears that everything died when they should have been able to keep up a percentage of key services.
in some ways, but not in others. In particular, from the article:
Under normal circumstances, power would have been returned to the servers in Boadicea House slowly, allowing the airline�s other Heathrow data centre, at Comet House, to take up some of the slack.
But, on Saturday morning, just minutes after the UPS went down, power was resumed in what one source described as �uncontrolled fashion.� �It should have been gradual,� the source went on.
This caused �catastrophic physical damage� to BA�s servers, which contain everything from customer and crew information to operational details and flight paths. No data is however understood to have been lost or compromised as a result of the incident.
BA�s technology team spent the weekend rebuilding the servers, allowing the airline to return to normal operations as of today.
Sources close to the airline indicated that had the power been restored more gradually, BA would have been able to cope with the outage, and return services far more quickly than was the case.
There is no suggestion there that the backup system was in a position to take over any functions instantaneously. It could well have been purely running remote disc mirroring.
As for the Heathrow system coming straight back up so wrecking everything. What?
The point is that there should have been no downtime at all to the international online systems. We aren't taking about the Sainsbury's national network, which could legitimately work with a central failure, with tills and stock control running happily on the instore systems.
My broadband basic info/help site -
www.robertos.me.uk. Domains, site and mail hosting -
Tsohost.
Connection - AAISP Home::1 80/20. Sync 63679/13080Kbps @ 600m. BQMs -
IPv4 &
IPv6