On TechRepublic: 10 dying IT skills
BNET Business Network:
BNET
TechRepublic
ZDNet

October 20th, 2005

PDF vs. Office XML

Posted by John Carroll @ 9:19 am

Categories: General, ODF

Tags:

Continuing the theme started in my last post, Massachusetts’ recently ratified a digital document standard that excludes technology backed by the maker of the dominant Office automation solution on the market - Microsoft. In my last post, I questioned whether that outcome was truly a demonstration of the power of democracy rather than proof of the power of interest group politics, if not a bullet item in the case against letting governments - who don’t operate according to the same procurement rules as private citizens - to override technology decisions made by the open marketplace.

Besides ODF, a standard for digital documents ratified last May by the OASIS group, Adobe’s PDF was also included on the list. That inclusion raises a number of questions, however. PDF has qualities that justify its inclusion. However, many of those justifications also apply to Office XML, and Office XML has merits in its own right that make it better, in some ways, than PDF.

First, a few misconceptions need to be erased. Office XML is offered under a royalty free license to anyone that wishes to implement it.

In other words, Office XML does NOT require Windows, or even a Microsoft product, to read or write it. Therefore, it satisfies the requirement for an "open" standard outlined by Marc Wagner in a recent blog post

no citizen (or government agency) of the Commonwealth of Massachusetts should be compelled to buy any Microsoft product (or products from any other single vendor) in order to have access to public records.

That is already the case with Office XML.  Therefore, it’s false to claim that Microsoft needs to publish their formats — or otherwise open their licenses to development and transfer without royalty payments. Publication and lack of royalty payments already exists.

That’s the same situation with PDF, a format developed by Adobe but offered to third parties for implementation. That open status has certainly existed for longer than it has for Office XML, and the result - the presence of PDF readers on most computer platforms in existence and near-universal support for the format - is testimony to Adobe’s careful nurturing of the technology.

That isn’t an argument, however, for PDF being more "open" from a specification standpoint. Microsoft may have been late to make its document formats open, but that doesn’t change the fact that Office XML is now an open specification that can be implemented on other platforms. Besides, consider the alternative, a format named ODF that was ratified in May of 2005 and has far smaller an installed based than Office XML (a format developed for Office 2003).

Some might argue that PDF’s standardization by third party standardization groups (i.e. "joint stewardship") make it more "open" than Office XML. PDF/A (ISO 19005-1) has been ratified by the ISO for long term document preservation and archiving, and PDF/X (ISO 15930-1) is for the reliable exchange of press-ready, high end graphic information that facilitates the exchange of, among other things, high-end color advertisements.

If Massachusetts’ chose the ISO-ratified PDF variant, then we’d have something to talk about. Unfortunately, they didn’t. Rather, they chose PDF version 1.5, a specification completely controlled by Adobe. Just to put that in perspective, that would be like the Massachusetts’ Department for the Promotion of Video Arts (which doesn’t exist) standardizing on Windows Media 9 (WM9) as opposed to VC-1. WM9 serves as the foundation of VC-1, but it is not the same thing as VC-1, which is a standard in the final approval stages by SMPTE and into which third parties can have input.

The choice calls into question the notion that one of the "standards" of openness was that "it must be subject to joint stewardship. If so, they would have specified the officially sanctioned variant of PDF, not the one owned completely by Adobe.

On that note, Microsoft isn’t averse to placing its technology under "joint stewardship." It did that with the .NET CLI, as well as with VC-1 (which, as noted, is based on WM9). The question is whether anyone asked them to do that. Again, though, I would have expected the Massachusetts’ standardization group to insist on the ISO variant of PDF and not one controlled by Adobe if "joint stewardship" was a critical requirement.

Berlind noted that Microsoft chose an open-patent licensing policy for its Office XML specification. This means that anyone implementing a reader/writer could use any Microsoft patent to do so. In contrast, Adobe opted for the "patent list" approach to licensing, which means a specific list of patents were licensed out for use by implementers.

This means the possibility exists (however small) that someone implementing the specification might run afoul of something not on the list. This matters, however, as non-specific patent grants "future proof" the specification, as you can never predict what patents may arise in future which could be relevant to the implementation of a particular specification.  Therefore, it’s worth marking that one down on Microsoft’s side of the scorecard.

An area that Berlind claimed makes PDF more open, however, is in the ability to make derivative works:

What Engelhardt basically said is that developers are free to do whatever they want with Adobe’s PDF specification. For example, they can break it apart or remix it with other specifications. The only restriction on this activity is that if the final output of the software isn’t 100 percent compliant with the PDF specification, the developer cannot say that the software or the documents it produces are "PDF." The freedom to remix Adobe’s work lies in stark contrast to Microsoft’s license which says: "A ‘Licensed Implementation’ means only those specific portions of a software product that read and write files.

That’s all true. That position is inconsistent, however, with the point of the Massachusetts’ standardization effort, which was to settle on stable document format for long-term document archive purposes, among other things.

Placing the ability to make incompatible variants of PDF on a list of reasons defending its status as an approved long-term document storage format is like listing the ability to write English-language poetry that no one but the author understands as proof of the consistency of the English language. If the situations were swapped and Microsoft was the one with a policy that defended the ability to make incompatible derivative works, the open source world would be up in arms about the potential lack of consistency.

An advantage Office XML has over PDF is its status as an XML grammar. PDF was only allowed for documents whose content and structure will not undergo further modifications and need to be preserved. ODF was the real foundation of the Massachusetts’ policy, as it was the only document format that allowed modification. ODF is also an XML grammar, and according to Berlind’s article, everyone involved in the standardization effort was enthusiastic about the use of an XML format for digital documents.

If XML is the goal, then Office XML makes a better standard than PDF, particularly given that a) it better ensures standard implementations through legal enforcement of compatibility, b) allows any and all Microsoft patents to be used in the implementation, which defends the specification against future patent grants, c) joint stewardship doesn’t appear to be as important given that the approved PDF specification is not "jointly managed," and d) even if it were, Microsoft has already demonstrated willingness to standardize technology through third party standardization groups.

The inclusion of PDF is an Achilles heel to arguments that Massachusetts wasn’t swayed by the predictable hue and cry of open source advocates and Microsoft competitors in response to the suggestion that Microsoft’s format would be included on its approved list. If PDF was included, it at LEAST makes sense for Office XML to be included at the same level as PDF…which is for read-only documents. I argue, however, that Office XML should have been allowed to do more.

Unfortunately, I argue that in my next post.

John CarrollJohn Carroll has delivered his opinion on ZDNet since the last millennium. Since May 2008, he is no longer a Microsoft employee. He is currently working at a unified messaging-related startup. See his full profile and disclosure of his industry affiliations.

Email John Carroll

Subscribe to A Developer's View via Email alerts or RSS.

  • Talkback
  • Most Recent of 104 Talkback(s)
yes, Yes, YES
sometimes people rationality seems to be obliterated by their prejudice or ignorance. How can anybody states "we don't need it" when talking about Office's document movement into opened and standardiz... (Read the rest)
Posted by: emersonlopes Posted on: 11/01/05 You are currently: a Guest | | Terms of Use
You should read your talkbacks  Robert Crocker | 10/20/05
I do...  John CarrollZDNet Moderator | 10/20/05
Office XML License  Robert Crocker | 10/20/05
Since you asked so nicely...  No_Ax_to_Grind | 10/20/05
Door is slammed in the face of ALL FOSS licenses.  John Le'Brecage | 10/20/05
Which rights?  Yagotta B. Kidding | 10/21/05
Detailed from John  Robert Crocker | 10/20/05
Sorry but you are DEAD WRONG.  No_Ax_to_Grind | 10/20/05
Strike TWO!  John Le'Brecage | 10/20/05
State sovereignty requires a definition of open?  Anton Philidor | 10/20/05
Oops... thank you Anton... (and the obligatory counter)  John Le'Brecage | 10/20/05
You're misquoting, Anton  Yagotta B. Kidding | 10/20/05
YBK, he did say open source developers.  Anton Philidor | 10/21/05
Anton, quoting your own post  Yagotta B. Kidding | 10/21/05
Relying on memory.  Anton Philidor | 10/21/05
Anton: memories  Yagotta B. Kidding | 10/21/05
Yagotta: memories  John CarrollZDNet Moderator | 10/21/05
John, CYA  Yagotta B. Kidding | 10/20/05
bookmark this page  glocks out | 10/20/05
FUD  John CarrollZDNet Moderator | 10/20/05
Your choice, John  Yagotta B. Kidding | 10/20/05
Re: Transferability  John CarrollZDNet Moderator | 10/20/05
You mean like Linux give recognition to everyone...  No_Ax_to_Grind | 10/20/05
Attribution  Yagotta B. Kidding | 10/21/05
A dead (problematic) horse  OleAndersen | 10/21/05
Straw man  Yagotta B. Kidding | 10/21/05
meaning of perpetual  buzzl | 10/20/05
Well...  John CarrollZDNet Moderator | 10/20/05
Is PDF patented?  John L. Ries | 10/20/05
It HAS patents in it  John CarrollZDNet Moderator | 10/20/05
The problem with the GPL is...  John L. Ries | 10/20/05
An additional point...  John L. Ries | 10/20/05
CYA  George Jay | 10/20/05
Funny  John CarrollZDNet Moderator | 10/20/05
John, getting permission and making sure you CYA are different things (NT)  George Jay | 10/21/05
I prefer an uncovered A  John CarrollZDNet Moderator | 10/21/05
John, you too are being lead to believe this is about the format.  No_Ax_to_Grind | 10/20/05
Please elaborate..  Patrick Jones | 10/20/05
There's a reason  IT_User | 10/20/05
Hey, Don  Yagotta B. Kidding | 10/20/05
Cucumber Sauce again?  Robert Crocker | 10/20/05
if you did contract work the the government  Monkey_MCSE | 10/20/05
Not the whole government  IT_User | 10/20/05
my findings are different  Monkey_MCSE | 10/20/05
Fine about Nellis  IT_User | 10/20/05
And you are leading yourself to believe that you understand the issues  Taz_z | 10/20/05
Stay tuned for the law suit boys and girls.  No_Ax_to_Grind | 10/20/05
It'd be a great precedent  Yagotta B. Kidding | 10/20/05
Tell ya what, lets watch and see.  No_Ax_to_Grind | 10/20/05
Sure  Yagotta B. Kidding | 10/20/05
Business plan? Ask AMD asa you seem to think they are doing it right.  No_Ax_to_Grind | 10/20/05
What fun!  IT_User | 10/20/05
Boies?  jbroche18 | 10/21/05
Loyalty  Yagotta B. Kidding | 10/21/05
Re: Boies  jbroche18 | 10/28/05
About the data  Gregory.J.Bradley@... | 10/20/05
Pure BS...  No_Ax_to_Grind | 10/20/05
Very good, Don  Yagotta B. Kidding | 10/20/05
Put up or.... You know the rest...  No_Ax_to_Grind | 10/20/05
Not completely..  Patrick Jones | 10/20/05
Default file format in Office 2003 is DOC  simon@... | 10/20/05
John, you haven't really  IT_User | 10/20/05
Re; John  John CarrollZDNet Moderator | 10/20/05
Wow...  Anti_Zealot | 10/20/05
And what are you doing?  No_Ax_to_Grind | 10/20/05
It's like watching a train wreck  Yagotta B. Kidding | 10/20/05
I do have better things to do...  Anti_Zealot | 10/21/05
Killer Sentence  John L. Ries | 10/20/05
Your asking the right questions  John CarrollZDNet Moderator | 10/20/05
Links  Patrick Jones | 10/20/05
To simplify the ETRM  IT_User | 10/20/05
ISO technicalities  Yagotta B. Kidding | 10/20/05
Well, at least I made my point  IT_User | 10/20/05
Office XML can't compete with PDF  buzzl | 10/20/05
PDF is Non-Modifiable!  An_Axe_to_Grind | 10/20/05
Doesn't whether a pdf file is modifiable...  dsentman@... | 10/21/05
no, No, NO!  BitTwiddler | 10/20/05
Re: PDF vs. Office XML  none none | 10/20/05
Spin  baggins_z | 10/20/05
hello again john (a few point of view )  toxicfreak | 10/20/05
Hey, if Mass wants to win its court cases outside of court  Boot_Agnostic | 10/20/05
ONCE AGAIN! PDF IS NON-MODIFIABLE!  An_Axe_to_Grind | 10/20/05
I thought...  dsentman@... | 10/21/05
Specific software  Anti_Zealot | 10/23/05
ISO approval  Eduardo_z | 10/20/05
informative links  Eduardo_z | 10/20/05
License  Gregory.J.Bradley@... | 10/20/05
Nope, not true.  No_Ax_to_Grind | 10/20/05
Are you a lawyer, Don?  Yagotta B. Kidding | 10/20/05
Are you?  No_Ax_to_Grind | 10/20/05
OpenOpen Office and Visioo John  Ed_Meyers | 10/20/05
On joint stewardship.  Anton Philidor | 10/20/05
Good points  John CarrollZDNet Moderator | 10/20/05
This has been my primary concern with it, but there are solutions  Mark Miller | 10/21/05
Interesting points  John CarrollZDNet Moderator | 10/21/05
Re: Interesting points  Anti_Zealot | 10/23/05
User continuum.  Anton Philidor | 10/21/05
Yes, that would be a tragedy  IT_User | 10/21/05
How much XML use in the past?  Anton Philidor | 10/21/05
Counterexample  Yagotta B. Kidding | 10/21/05
Re: User Continuum (on standards)  jbroche18 | 10/28/05
PDF versions  MacduffHughesAdobe | 10/21/05
Further discussion  MacduffHughesAdobe | 10/21/05
yes, Yes, YES  emersonlopes | 11/01/05

What do you think?

SponsoredWhite Papers, Webcasts, and Downloads

advertisement
Click Here

Recent Entries

Top Rated

Archives

ZDNet Blogs

White Papers, Webcasts, and Downloads