On mySimon: LG Electronics 32LH40 32" LCD TV
BNET Business Network:
BNET
TechRepublic
ZDNet

February 20th, 2008

How do you benchmark real-world work?

Posted by Ed Bott @ 8:00 pm

Categories: Windows Vista, Windows XP

Tags: Usability, PC, Microsoft Windows Vista, Ed Bott, Usability Professional

Adrian Kingsley-Hughes and I have been focusing lately on a tiny aspect of PC performance. He ran two sets of file management benchmarks on a test PC in his lab, I performed similar tests on a machine in my lab. Results? Inconclusive.

But are both of us missing the real point of owning and using a PC? Can any stopwatch-based measurement of isolated tasks as performed by individual hardware and software components really measure the worth of a technology investment? I don’t think so.

This is not a new question for me. Back in the early 1990s, when I was editor of the late, lamented PC Computing, we differentiated our product reviews from those of sister public PC Magazine by focusing on usability. The highly regarded PC Magazine Labs was the quintessential “speeds and feeds” shop. We focused on usability, going to the extreme of spending a small fortune (I still remember the budget battles) building a state-of-the-art usability lab and hiring usability professionals to run it.

I liked our reviews better than the ones at PC Mag because we didn’t have a one-size-fits-all conclusion. Instead, using the usability data, we tried to determine which product was a better fit for readers and prospective buyers with different needs. I think that approach still works today.

In the Talkback section of my earlier post, there’s a lively discussion of what sort of benchmarking would work better than flawed speed tests that don’t map to real world activities. The short version, from commenter frgough, says that Adrian and I should

simply do stopwatch tests on their normal daily workflow and see how the two operating systems compare, because, at the end of the day, that’s what it comes down to.

Easier said than done. Here’s a short list of lessons I learned from the PC Computing usability lab that are still valuable today:

Preconceptions affect perceptions. In the case of Windows Vista, that’s a double whammy. The relentless drumbeat of “Vista sucks” press coverage is pretty hard to ignore. Try to find a usability tester who hasn’t read any of that coverage and doesn’t already have a bias going in.

Bad experiences affect perceptions too. The negative reviews of Vista are in many cases grounded in painful reality. There’s no doubt that bad drivers, bugs in Vista itself, and crappy OEM hardware configurations caused a lot of early adopters to have unpleasant experiences with Windows Vista. Those initial impressions affect perceptions in a fundamental, hard-to-shake way. Even a minor problem can be painful if you don’t know the solution. If it requires indeterminate amounts of troubleshooting to figure out why something doesn’t work the way it’s supposed to, that can be a deal-breaker.

The older, established system has a built-in advantage. Switching to a new computing platform involves unlearning old ways and learning new procedures (just look at the advice offered to people switching from Windows to a Mac). Initial productivity will be lower on the new system.

Are you testing learnability or usability? One trap that usability professionals warn about is the danger of disproportionately crediting a product that has a great out-of-box experience but doesn’t deliver over the long haul. Jeff Atwood offers an excellent summary of the issues, capped by this great quote from Joel Spolsky:

If you did a usability test of cars, you would be forced to conclude that they are simply unusable.

Faster isn’t always better. Simply measuring productivity by seeing who finishes first doesn’t necessarily give you the right answer either. In the hands of someone who knows a system well, even a terrible design can be highly efficient. I can be tremendously productive at a command prompt and can probably finish many tasks faster with command-line tools. But if you forced me to choose between a command-line interface and a GUI for daily work I would choose the latter every time. I don’t miss MS-DOS.

Sometimes there is no right answer. I talked with a usability professional at Microsoft recently who described an all-too-common real-world dilemma. The interface designers had to decide how the up arrow should work in a particular feature. There were only two possible choices. The trouble is, usability testing proved conclusively that 50% of the test subjects thought it should work one way, and 50% thought it should work the other way. No matter which design you choose, half of your customers will think you designed an unintuitive interface.

Ultimately, for mainstream business use and everyday consumer scenarios, I think usability is the key to measuring how well a piece of hardware performs. The trouble is finding the metrics to measure usability.

I’m interested in your thoughts. Regardless of which computing platform you use, what aspects of usability are important to you? Leave your thoughts in the Talkback section.

Ed BottEd Bott is an award-winning technology writer with more than two decades' experience writing for mainstream media outlets and online publications. See his full profile and disclosure of his industry affiliations.

Email Ed Bott

Subscribe to Ed Bott's Microsoft Report via Email alerts or RSS.

  • Talkback
  • Most Recent of 77 Talkback(s)
Professional Versus Personal
Hi everybody! I work as a computer technician and spend a lot of time working with consumers as they deploy technology both on personal and professional levels.

While I have read many of the c... (Read the rest)
Posted by: acedelmar Posted on: 04/29/08 You are currently: a Guest | | Terms of Use
Usability  Frank from Holland | 02/21/08
I agree with you...  Ben_E | 02/21/08
agree ?  aussieblnd@... | 02/21/08
Personal preference  Ben_E | 02/21/08
Professional Versus Personal  acedelmar | 04/29/08
Real-world vs. Synthetic  Adrian Kingsley-HughesZDNet Moderator | 02/21/08
I think it's ashame you benchmarked with a Pentium-D  DonBurnett | 04/10/08
Benchmarks are definitely not...  bjbrock | 02/21/08
Spot on in an immature market...  jasonp@... | 02/21/08
Agreed but...  bjbrock | 02/21/08
My in-laws love Vista  Spats30 | 02/21/08
Useability improvements equals....  brewakeg | 02/21/08
So what you're telling us...  jasonp@... | 02/21/08
It's a response  coffeeshark | 02/21/08
Subjective vs. objective  frgough | 02/21/08
To beat the tired old car analogy  frgough | 02/21/08
Quick and dirty example  frgough | 02/21/08
But do these translate into increased/decreased productivity?  ye | 02/21/08
Don't underestimate the annoyance factor  frgough | 02/21/08
I agree. But every platform has its annoyances.  ye | 02/21/08
All that really matters to the user...  John L. Ries | 02/21/08
While how much you like software is important to you it...  ye | 02/21/08
Kicking the tires  muzhik | 02/21/08
Useless analogy  scott@... | 02/21/08
yes and no  Ed BottZDNet Moderator | 02/21/08
For most it's all pointless.  ye | 02/21/08
Objective View - streaming MCE video  davidadkins1@... | 02/21/08
Objectivity and subjectivity are both important  John L. Ries | 02/21/08
Subjectivity  Ed BottZDNet Moderator | 02/21/08
Responsibillity  Ed BottZDNet Moderator | 02/21/08
Certification  t_mohajir | 02/21/08
Certified for what?  muzhik | 02/21/08
I beg to differ  John L. Ries | 02/21/08
You miss the point  Ed BottZDNet Moderator | 02/21/08
No, it's not  John L. Ries | 02/21/08
Risk with any OS change  Ed BottZDNet Moderator | 02/21/08
Sorry about the formatting  Ed BottZDNet Moderator | 02/21/08
Missed Points  Harry Bardal | 02/21/08
Feudalism  John L. Ries | 02/21/08
Let me get this straight:  muzhik | 02/21/08
I don't fully agree with you there...  Ben_E | 02/23/08
Benchmarking Switchers  Harry Bardal | 02/21/08
Of course they compete  Ed BottZDNet Moderator | 02/21/08
They Most Certainly Do Not  Harry Bardal | 02/21/08
We just have to disagree  Ed BottZDNet Moderator | 02/21/08
Competition  Harry Bardal | 02/21/08
You don't seem to understand the business world.  rtk | 02/21/08
rtk  Harry Bardal | 02/21/08
Harry  rtk | 02/21/08
Muscle Memory vs Intellect  Harry Bardal | 02/22/08
Another distortion from Harry  Ed BottZDNet Moderator | 02/22/08
Distortions  Harry Bardal | 02/24/08
On the other hand ...  Adrian Kingsley-HughesZDNet Moderator | 02/21/08
Transition Tool  Harry Bardal | 02/21/08
You're stretching, bud.  rtk | 02/21/08
Virtualization  Harry Bardal | 02/21/08
re: Virtualization  rtk | 02/21/08
it's all in the user's perception  coffeeshark | 02/21/08
Once burned is enough!  jim.denny@... | 02/21/08
RE: How do you benchmark real-world work?  atari8bit@... | 02/21/08
Huh?  Ed BottZDNet Moderator | 02/21/08
configuration  svv1999@... | 02/21/08
usability.  rtk | 02/21/08
Step 1  dfolk | 02/21/08
Fact checking  Ed BottZDNet Moderator | 02/21/08
You are correct Ed  dfolk | 02/21/08
What? Pogue?  rtk | 02/21/08
there are loads of proper pre-prepared benchmark tests out there  james.faction | 02/21/08
Yes ... but ...  Adrian Kingsley-HughesZDNet Moderator | 02/21/08
RE: How do you benchmark real-world work?  colin.hutchison@... | 02/21/08
Enough Excuses, Vista Deserves Its Bad Reputation  chessmen | 02/22/08
In the interests of fairness...  Ben_E | 02/23/08
Not fairness - realism  colin.hutchison@... | 02/23/08
Paranoid much?  Ben_E | 02/24/08
Think of it as a movie review  Kerry from BC | 02/25/08
Why start now?  Ole Man | 02/25/08
RE: How do you benchmark real-world work?  DonBurnett | 04/10/08

What do you think?

SponsoredWhite Papers, Webcasts, and Downloads

Click Here
advertisement

Recent Entries

advertisement

Archives

ZDNet Blogs

White Papers, Webcasts, and Downloads

SmartPlanet

Click Here