On The Insider: Britney's Bikini-Clad Top 10
BNET Business Network:
BNET
TechRepublic
ZDNet

June 10th, 2008

PS3 chip powers world's fastest computer

Posted by Robin Harris @ 7:05 am

Categories: Clusters, Infrastructure

Tags: Sony PlayStation 3, AMD Opteron, RAM, Computer, Cell, Chip, InfiniBand, Processors, Networking, Semiconductors

Some scoffed at the 8 PS3 supercomputer. But not the scientists at Los Alamos National Labs. They used the idea to build a 1 petaflop computer named Roadrunner - the world’s fastest. Here’s how.

1,000 trillion floating point operations per second
Fine-grained simulation of aging nuclear weapons is the new computer’s ultimate gig. They couldn’t just string 14,000 PS3’s together - who’d believe the results?

Besides, it’s American to want something better - and way faster.

First they built a new Cell Broadband Engine
The new version of the PS3 chip - called a PowerXCell 8i Processor - features 8x faster double-precision floating point and over 25 GB/sec of memory bandwidth. That is the building block of a new and really honking compute node.

IBM PowerXCell 8i processor [PowerXCell 8i photo courtesy of IBM Systems and Technology Group]

Each compute node consists of 2 dual-core AMD Opterons and 4 PowerXCell 8i’s. Each Opteron has a fast connection to 2 PowerXCells enabling a theoretical 25x boost in floating point performance over a stock Opteron.

No one mentioned how much RAM they gave each PowerXCell, but the chip can address 64 GB of RAM, so each compute node could easily support 264 GB of RAM (4×64GB + 2×4GB or more on the Opterons). With over 100 GB/sec of memory bandwidth.

We’ll take 3,250 of them
That’s about how many nodes are in the completed Roadrunner. They’re interconnected by a standard - for HPC clusters - Infiniband DDR network.

Infiniband is a switched fabric interconnect featuring microsecond latencies and data rates of 2 GB/sec for 4x-DDR. That’s about as much as you can get out of a PCI-Express x8 bus anyway.

Update: I learned more about the storage infrastructure behind Roadrunner - 2,000 terabytes of file server - and wrote it up in my other blog StorageMojo.

The money quote:

Roadrunner currently has about 80TB of RAM, roughly 24 GB per compute node. That works out to about 4 GB RAM per processor.

The jobs these machines run are huge. A simulation can run 6 months or more. Depending on criticality a job gets checkpointed every hour or maybe once a day.

The Panasas installation at LANL, begun in 2003, is currently 2 PB. Assuming an average of 500 GB drives, that means 4,000 disk drives.

Big computers require big storage. End update.

Software is problem
The hardware specs are drool-worthy, but without the right codes it is just an expensive furnace. As the best single article on Roadrunner I found explains:

For the Cell, the programmer must know exactly what’s needed to do one computation and then specify that the necessary instructions and data for that one computation are fetched from the Cell’s off-chip memory in a single step. . . . IBM’s Peter Hoftstee, the Cell’s chief architect, describes this process as “a shopping list approach,” likening off-chip memory to Home Depot. You save time if you get all the supplies in one trip, rather than making multiple trips for each piece just when you need it.

The programmers optimized codes for a variety of applications, including radiation and neutron transport, molecular dynamics, fluid turbulence and plasma behavior. With the optimized codes they got a real-world 6-10x performance boost over the standard Opterons.

The Storage Bits take
Back when I was hawking vector processors a Gigaflop was considered respectable. A couple of decades later and we have a machine 1 million times faster. Cool!

We won’t be able to shrink feature sizes forever though, so architecture and bandwidth will be key to further speed-ups. Hopefully that time is still a few decades away.

Comments welcome, of course.

Robin HarrisRobin Harris has been messing with computers for over 30 years and selling and marketing data storage for over 20 in companies large and small. See his full profile and disclosure of his industry affiliations.


Email Robin Harris

Subscribe to Storage Bits via Email alerts or RSS.

  • Talkback
  • Most Recent of 49 Talkback(s)
...Then next year...
Next year they'll be another "fastest computer in the
world", and it will most likely use a chip that is hot
on the market at that time, no the Cell processor.
Last year the fastest compu... (Read the rest)
Posted by: Narg Posted on: 09/18/08 You are currently: a Guest | | Terms of Use
.... i believe the link is wrong  waylander | 06/10/08
Oops! Fixed.  R HarrisZDNet Moderator | 06/10/08
And it runs..  Tim Patterson | 06/10/08
RE: PS3 chip powers world's fastest computer  Linux User 147560 | 06/10/08
IBM never  s_souche | 06/10/08
Other OSs  Hemlock Stones | 06/11/08
It's not the chip used in the PS3...  olePigeon | 06/10/08
Misleading  itpro_z | 06/10/08
I doubt anyone would be here if he didn't mention PS3  Kid Icarus-21097050858087920245213802267493 | 06/10/08
I don't know  itpro_z | 06/10/08
What's a PS3?  deanders | 06/11/08
On a more interesting note...  olePigeon | 06/10/08
Co-processors: an old idea  R HarrisZDNet Moderator | 06/10/08
It's on your UK sister site...  olePigeon | 06/11/08
Opteron the controller, work down by the Cell  Richard Flude | 06/10/08
Opterons do the essential work  Hemlock Stones | 06/11/08
Which is a HPC  Richard Flude | 06/11/08
The xFLOPS part is mostly CELL  DevGuy_z | 06/18/08
Magnetic memory chip  BALTHOR | 06/10/08
Hmmm.  seanferd | 06/10/08
Analog ?  Hemlock Stones | 06/11/08
Need for Speed Must Fly!  TripleII | 06/10/08
Ray tracing ?  Hemlock Stones | 06/11/08
NOTHING can do Crysis  flatliner | 06/11/08
RE: PS3 chip powers world's fastest computer  Gamesgirl24 | 06/11/08
RE: PS3 chip powers world's fastest computer  kgriggs | 06/11/08
RE: PS3 chip powers world's fastest computer  cwallen19803@... | 06/11/08
Yes, it runs Linux  R HarrisZDNet Moderator | 06/11/08
Pointless article  Skullet | 06/11/08
In Your Opinion  deanders | 06/11/08
I agree  Skullet | 06/11/08
Still only a PowerPC.....  Narg | 06/11/08
What?  psychosmurf | 06/11/08
Only a PPC you say  rflulling@... | 06/11/08
...Then next year...  Narg | 09/18/08
RE: PS3 chip powers world's fastest computer  J_B_A@... | 06/11/08
RE: When will we get to "add on" power for our desktops?!?!?!  dorkiedorkfromdorktown | 06/11/08
Easier said than done  rflulling@... | 06/11/08
Good Article; Bad Purpose  psychosmurf | 06/11/08
Sorry. . .  psychosmurf | 06/11/08
Not only PS3 processor was used!!  Gradius2 | 06/11/08
Re: Not only PS3 processor was used!!  Gradius2 | 06/11/08
Provided the Opterons run on a 64-bit platform...  gmureddu@... | 06/11/08
This may be old news to China  martyh@... | 06/11/08
Sadams PS2s  rflulling@... | 06/11/08
Plural of code is code  grail@... | 06/11/08
So the plural of program is program?  R HarrisZDNet Moderator | 06/12/08
RE: PS3 chip powers world's fastest computer  Chomps | 06/12/08
RE: PS3 chip powers world's fastest computer  walkerjian@... | 06/19/08

What do you think?

SponsoredWhite Papers, Webcasts, and Downloads

Click Here
advertisement
Click Here

Recent Entries

advertisement

Archives

Favorite Links

ZDNet Blogs

White Papers, Webcasts, and Downloads

SmartPlanet

Click Here