On CNET: Keep your software up-to-date
BNET Business Network:
BNET
TechRepublic
ZDNet

December 11th, 2008

The many-core performance wall

Posted by Robin Harris @ 7:01 am

Categories: RAM

Tags: Performance, Memory Bandwidth, Microprocessor, Chip, Wall, Sandia National Lab, Memory Wall, Performance Management, Semiconductors, Processors

Many-core chips are the great hope for more performance but Sandia National Lab simulations show they are about to hit a memory wall. How bad is it?

Some background
Memory bandwidth is the limiting performance factor in CPUs. If you can’t feed the beast it stops working, simple as that.

John von Neumann - your PC/Mac is a von Neumann architecture machine - made the point in his very technical First Draft of a Report on the EDVAC (pdf)

. . . the main bottleneck, of an automatic very high speed computing device lies: At the memory.

Here’s the ugly results of Sandia’s simulations:

Simulated performance of many-core chips with 2 different memory implementations[graph courtesy Sandia National Lab]

Performance roughly doubles from 2 cores to 4 (yay!), near flat to 8 (boo!) and then falls (hiss!).

Did Pink Floyd forecast this?
Chip packages support just so many pins and so much bandwidth. Transistors per chip double every couple of years - but the number of pins don’t.

Professors William Wulf and Sally McKee named it “the memory wall” in their 1994 paper Hitting the Memory Wall: Implications of the Obvious (pdf) saying:

We all know that the rate of improvement in microprocessor speed exceeds the rate of improvement in DRAM memory speed - each is improving exponentially, but the exponent for microprocessors is substantially larger than that for DRAMs. The difference between diverging exponentials also grows exponentially; so, although the disparity between processor and memory speed is already an issue, downstream someplace it will be a much bigger one.

According to an article in IEEE spectrum that time is almost upon us. With cores per processor doubling every 2-3 years - and graphics chips moving faster - we don’t have long to wait.

The memory wall’s impact is greatest on so-called informatics applications, where massive amounts of data must be processed. Like sifting through petabytes of remote sensing data to find bad guys with nukes.

Can this be fixed?
Sandia is investigating stacked memory architectures, popular in cell phones for space reasons, to get more memory bandwidth. But as the simulation shows, that doesn’t improve performance.

RAMbus is working on a Terabyte Bandwidth Initiative that may help. Their goal: 64 16GB/sec DRAMs with differential data channels to feed a system-on-a-chip memory controller.

Intel needs to pick up the pace. Nehalem processors are the first with an on-chip memory controller and the new Quick Path Interconnect. But server-class Nehalems are now limited to 2 QPI links for a total theoretical bandwidth of only 50 GB/sec. Faster, pussycat, faster!

The Storage Bits take
Many-core is the future for computer performance. Memory bandwidth is one big problem. Software support for efficient many-core use is another. Either could bring the performance expected from Moore’s Law to dead stop.

The industry is making big investments in both problems. If it is a problem for Sandia today it will be a problem for consumers in 10 years.

What if one doesn’t get solved? Then the Moore’s Law rocket we’re been riding will sputter and die. Life on the glidepath won’t be nearly so much fun.

Comments welcome, of course. If anyone wants to make the case that von Neumann was wrong, I’m all ears.

Robin HarrisRobin Harris has been messing with computers for over 30 years and selling and marketing data storage for over 20 in companies large and small. See his full profile and disclosure of his industry affiliations.


Email Robin Harris

Subscribe to Storage Bits via Email alerts or RSS.

  • Talkback
  • Most Recent of 37 Talkback(s)
RE: The many-core performance wall
Who cares about performance and green computing when my biggest waste of time using my computer is reading ZDNet talkbacks?

I have to click each individual Talkback and it only displays on abou... (Read the rest)
Posted by: afcas86@... Posted on: 01/07/09 You are currently: a Guest | | Terms of Use
Ultimately, software will need to change  timboldt | 12/11/08
Parallel computing  ArrowQuick | 12/13/08
RE: The many-core performance wall  timiteh | 12/11/08
only innovation will save us  jsm88 | 12/11/08
We've got a Solution. AMD had nailed it a long time ago 64bit  Breetai | 12/16/08
RE: The many-core performance wall  V@... | 12/11/08
time to separate "performance" from "scalability"  diane wilson | 12/12/08
Re: Scaleability  V@... | 12/12/08
Great point, but the results of this test are even less worrisome  georgeou | 12/13/08
Still Waiting for the CPU & GPU to Merge  V@... | 12/13/08
RE: The many-core performance wall  bbodnar1@... | 12/12/08
RE: The many-core performance wall  jscott418 | 12/12/08
Annual - "The end is near"  No_Ax_to_Grind | 12/12/08
Nearer than you think.  V@... | 12/12/08
Well, your laws of physics...  Spiritusindomit@... | 12/12/08
Yup, someone makes that claim every year.  No_Ax_to_Grind | 12/12/08
Maybe not so near  ron.cleaver@... | 12/12/08
Ugh?  V@... | 12/13/08
Quantum computing, photonic CPU's, etc...  T1Oracle | 12/14/08
Yeah, Yeah - Still Waiting for Serious Commitment  V@... | 12/14/08
No... It has been broken.  Breetai | 12/16/08
Well if this isn't filed under [Obvious]...  Spiritusindomit@... | 12/12/08
One solution, go optical  Silex | 12/12/08
Bring on quantum computing!  gnesterenko | 12/12/08
RE: One solution, Go optical  bfilipiak@... | 12/12/08
RE: The many-core performance wall  bhansalibg@... | 12/12/08
Answer to the problem is on the graph  P. Douglas | 12/12/08
One more thing ...  P. Douglas | 12/13/08
Except that....  TheTruthisOutThere@... | 12/14/08
Parallel computing will add complexity  P. Douglas | 12/14/08
Yes, your knowledge is limited.  TheTruthisOutThere@... | 12/14/08
Okay, okay  P. Douglas | 12/14/08
Stop trying to polish a turd. It's a dead end.  V@... | 12/14/08
Re: Blah - Blah - Blah Have You Any Sheep?  The Rifleman | 12/12/08
SSD Multi Simultaneous READ/Write  VONDRASHEK@... | 12/13/08
Why worry?  cornpie | 12/14/08
RE: The many-core performance wall  afcas86@... | 01/07/09

What do you think?

SponsoredWhite Papers, Webcasts, and Downloads

Click Here
advertisement

Recent Entries

advertisement

Archives

Favorite Links

ZDNet Blogs

White Papers, Webcasts, and Downloads

SmartPlanet

  • Thought-provoking progressive ideas on diverse topics that intersect with technology, business, and life, and matter to the world at large. Visit SmartPlanet
  • More from IBM
  • Innovate your business' process model, play against the market, compete against others on our scoreboards and WIN! Try INNOV8 2.0: A BPM Simulator
  • Enabling Real-World Business Transformation through IBM Service Management Read the EMA Analyst Report
Click Here