How one CTO avoided a Web site disaster after data center fire |
Connect with TechFlash on our Facebook page for all the latest technology news headlines and commentary, plus information and access to special events, photos from events, promotions and more.
Flickr photo via Jamison_Judd
Most Seattle geeks probably didn't think they'd be spending a portion of their 4th of July holiday dealing with broken Web sites, back-up generators and damaged servers. But the small fire at the Fisher Plaza data center in downtown Seattle late last night knocked a number of sites offline for most of Friday, raising questions on TechFlash about how companies handle disaster planning and server co-location.
We actually first learned of the problem around 1 a.m. when Seattle-based Redfin posted a message on Twitter noting that their real estate site was offline because of problems at the data center. But by 4 a.m. Redfin's site was back online, purring along whereas other sites struggled.
We asked Redfin CTO Michael Young how they avoided the catastrophic failure that other sites are experiencing today. Turns out, the company learned some important lessons after a similar electrical fire hit the same data center last June.
Here's what Young told TechFlash today.
We were pretty embarrassed last June when Adhost had a similar electrical fire and took our site down for 8 hours (well into our core business hours) with brown-outs a day or two after that had us scrambling. 'Fool me once, shame on you; fool me twice, shame on me' resonated in our brains.
So by October 2008, we basically instituted a disaster avoidance plan where we had redundant-everything for our mission-critical databases, servers and networks in separate buildings.
When the problem happened last night, our beepers went off, we saw what looked like a major outage in one building, and were able to switch to the redundant systems.
Everything was up and running by 4am PST / 7am EST, well before our core business hours. We’re a startup, but we try to maintain high standards in our datacenter operations without spending too much money. The failover didn’t happen at the push-of-a-button, but the disaster planning paid off for us.
Young's explanation is interesting given that many sites -- including high-profile consumer-oriented sites such as AllRecipes, Bing Travel and Big Fish Games -- have been offline most of the day.
I have a feeling there will be some high-level meetings with CTOs, IT administrators and co-location operators on Monday discussing some of the ways to make sure this doesn't happen again.
I asked Young -- who was up at 5 a.m. dealing with the situation -- why other larger companies didn't appear to have a similar plan in place.
"It's hard to get every single point of failure," said Young. "And most people need to be burned once, like us."
[Flickr photo via Jamison_Judd]
If you are commenting using a Facebook account, your profile information may be displayed with your comment depending on your privacy settings. By leaving the 'Post to Facebook' box selected, your comment will be published to your Facebook profile in addition to the space below.
Who's creating today's energy efficient buildings? Find out at the BetterBricks Awards, Feb. 16
BetterBricks Awards salute the individuals leading the way for high performance commercial buildings with an emphasis on energy efficiency. Join us as we recognize these standout green building professionals.
Award categories include: Advocate; Architect/Designer; Facility Manager/Operator; and Owner/Developer.
Keynote Speaker: Kevin Kampschroer, Director of U.S. GSA's Office of Federal High Performance Buildings. Kevin leads the U.S. General Services Administration's efforts in building sustainability and accelerating industry adoption of sustainable principles across all aspects of a building's life.
Register here by February 10!
If you are interested in buying a table, email Monica Alquist or call her at 206-876-5404.
The Triple Door Presents: The Atomic Bombshells "J'ADORE!: A Burlesque Valentine"
Seattle's reigning Burlesque super-troupe delivers a gorgeous and glittering VALENTINE featuring some of the Bombshells' most exhilarating acts to date. J'Adore! promises to celebrate l'amour with good humor, style, and a healthy dose of dazzle! Bring a friend, a lover, a family member, or a secret crush, and celebrate with the Valentine's Burlesque spectacular that will leave you shouting: "J'ADORE......The Atomic Bombshells!" The incomparable Jasper McCann emcees with high style and charm.
Please visit www.thetripledoor.net for a full schedule of future performances.
The Triple Door Presents: Bob Mould – See A Little Light: An Evening of Reading and Music
"Bob Mould. Those two words are synonymous with integrity. From Husker Du in the last century to right at this moment, Bob is the real deal, writing and playing music for music's sake. He's a great songwriter and performer. I have been a fan of Bob's for thirty years now with no end in sight." -Henry Rollins
Please visit www.thetripledoor.net for a full schedule of future performances.
Why Choose BDO for your SOC (previously SAS 70) Reports?
BDO’s experience in providing attestation services (SAS 70/SSAE 16, AT 101, AT 201, AT 601, etc.) to a broad range of industries, and our team of skilled professionals distinctly qualifies us to serve as your company’s Service Auditor. By leveraging the BDO global network of control specialists, we are poised to provide global services in more than 1,000 offices and across 119 countries. Many organizations find that investing in reports on controls may result in benefits, including:
• Increased client confidence
• Improved competitive advantage
• Minimization of frequent audits
• Streamlined business processes and controls
• Enhanced risk management
For detailed information contact Paul Martini at pmartini@bdo.com.