On last.fm: Coldplay - Listen now!
BNET Business Network:
BNET
TechRepublic
ZDNet

June 22nd, 2007

Google's three rules

Posted by Robin Harris @ 12:29 pm

Categories: Clusters, Infrastructure

Tags: Google Inc., Data Center, Google, Robin Harris

They roll out new applications for millions of users with surprising speed, especially compared to corporate IT. They build data centers with hundreds of thousands of servers - and millions of disk drives - and run it all on free software.

Costly corporate kit, like RAID arrays and 15k FC drives, aren’t used. Yet they do more work in an hour than most companies do in a year.

Google’s IT capabilities are a modern wonder of the world. Underneath the complexity though are just three simple rules. Rules that no enterprise data center (EDC) would ever think of following.

I’m attending the Google scalability conference (see a short version of the agenda here) in Bellevue, WA tomorrow, which got me thinking about the Google rules of IT.

This is not your father’s data center
How does a Google data center differ from an EDC? Other than using electricity, in just about every way that matters.

Cheap
The key to Google’s competitive strategy is that they have the cheapest compute, network and storage (CNS) in the industry. Free or home-made software. Mass produced - by Intel, these days - servers-on-a-board with network, storage and energy-efficient dual-core processors. SATA drives. Unmanaged 48 port switches.

EDCs don’t care about cost. They focus on uptime. The low-volume hardware they buy is reliable and very expensive. As a result, EDC services growth is much less than Moore’s Law. EDCs are nursing homes for aging apps, not hotbeds of innovation.

Embrace failure
Cheap also means things break. And when you’ve got several million servers, lots of things break every day. Get over it. Google expects failure and builds recovery into the software layer that connects the cheap kit.

The EDC buys low-volume kit that tries to engineer-out failure. Google gets uptime by building failover on top of the hardware, not into it. Today’s data center guys break out in hives just thinking about it. Twenty years from everyone will do it that way, but not today.

Architect for scale
This is the flip side of cheap. Google hired some of the best minds in the business to architect for scale. They have multiple 8,000 node clusters that they’ve talked about and I wouldn’t be surprised if they’ve got some up 16,000 nodes.

Architecting for scale leverages cheap CNS to give Google the lowest-cost growth as well. Competitors such as Yahoo, who rely more on standard EDC products, can do the same things as Google, but it costs them about 10x in capital expense and several times the operations expense.

Fast growing applications play to Google’s strength.

The Storage Bits take
If Google has it all figured out, why the scalability conference? Good question. I think they’d like more scalability: 40,000 node clusters; 4 million processor data centers; exabyte storage. This isn’t just about gluing the bits together to get work done, either. They want lower power consumption, cheaper hardware, faster protocols and better software.

This is more than first-mover advantage. The faster they can grow, the greater their cost advantage over smaller, less nimble competitors. Their ROI brings them cheap capital, which increases their ability to invest in new businesses and more capacity. The higher their volumes, the cheaper growth becomes. A perfect storm.

Google is not invincible, by any means. Their marketing is pathetic. The concentration of power in the hands of three largely untried individuals means a major cock-up is only a question of when, not if. The stagnant share price puts pressure on management to increase returns by cutting back on costly perks. Google’s purpose-built infrastructure is also relatively inflexible: they can’t just paste on ACID transaction processing.

But that is all in the future. Tomorrow I’m looking forward to hearing about the latest in scalable systems from the industry’s leading innovators.

Comments welcome.

Robin HarrisRobin Harris has been messing with computers for over 30 years and selling and marketing data storage for over 20 in companies large and small. See his full profile and disclosure of his industry affiliations.


Email Robin Harris

Subscribe to Storage Bits via Email alerts or RSS.

  • Talkback
  • Most Recent of 31 Talkback(s)
Tapes are not everything
Google's distributed architecture should do away with the need for outmoded and slow tapes.

It's about time we all got away from tapes. They are slow, unreliable and inflexible.

My pref... (Read the rest)
Posted by: Ian Lewis Posted on: 07/10/07 You are currently: a Guest | | Terms of Use
why not?  CWButler | 06/25/07
Search vs. Enterprise Apps  johndoe445566 | 06/25/07
Enterprise applicaiton especially ERP  maldain | 06/25/07
Yep...Application-Specific  tpbishop@... | 06/26/07
finally someone sees sense  ismoore | 06/25/07
Google will suprise everyone  jim.aimone@... | 06/25/07
Google and 700 Mhz  jim.aimone@... | 06/25/07
Workload Matters  davea_hm | 06/25/07
You are correct  R HarrisZDNet Moderator | 06/27/07
It works and not just for google  sysop-dr | 06/25/07
Good ole Google  cjs@... | 06/25/07
Google Lucky  wademan | 06/25/07
Not really  maldain | 06/25/07
Lucky You  Dominick-Murphy | 06/27/07
why lucky?  ismoore | 06/25/07
The danger behind "Nobody got fired for buying from IBM"  venkats2000 | 06/25/07
It is the only way to go  mames1701 | 06/25/07
Re: It is the only way to go  tsmith@... | 06/25/07
A good reason to NOT buy boxed software with no source code.  DonnieBoy | 06/26/07
Hard to replicate  smithkl42 | 06/25/07
On a smaller scale, you do not need everything that Google has. You can  DonnieBoy | 06/26/07
This is why Amazon is the bigger threat  R HarrisZDNet Moderator | 06/27/07
Google 'Keep products in Beta'  rtb | 06/25/07
Not invincible, maybe, but obviously flexible, innovative, and effective,  mhenriday | 06/26/07
Typo  spyro17@... | 06/26/07
No tape backup?  jgaskell | 06/27/07
Only if it is all in one server room  R HarrisZDNet Moderator | 06/27/07
Tapes are not everything  Ian Lewis | 07/10/07
Cheap Storage...  andy@... | 06/27/07
And rule #4:  HypnoToad72 | 06/29/07
Google first to the post?  d.s.williams | 07/02/07

What do you think?

SponsoredWhite Papers, Webcasts, and Downloads

Click Here
advertisement

Recent Entries

advertisement

Archives

Favorite Links

ZDNet Blogs

White Papers, Webcasts, and Downloads

Meet Doc

  • Here to help you with your Document Management Needs
  • Doc is an enigma. Born to a Russian ballerina and a German electrical engineer, he grew up in various locations in the United States. He’s seen the insides of more brands, versions, and generations of printer and printer-related hardware than almost anyone.
  • To learn more about this mysterious figure check out his blog on ZDNet and his Workspace on TechRepublic. You’ll be glad you did.
  • Produced by
    ZDNet and