Skip to main content

The XML Paradox

I have been working on my tutorial for the O'Reilly Tools of Change conference. I'm presenting PDF as a cost-effective option to create revenue from the the backlist as an alternative to XML. As a dedicated markup advocate from the days of SGML, and someone who helped simplify SGML down to XML, I still find it odd to be talking about other kinds of solutions, but I think I learned something from my custom web site customers... The XML Paradox is that XML is a high-quality archival medium, and obviously then, books and scholarly content would make the jump first. It just makes sense that everyone would use the high-value format for the longest-lived, highest value content. Wrong! The economics of publishing have played out the opposite way. The more ephemeral the content, the faster production methods can change. So newspapers were doing full-text databases from very early on. In the scholarly markets, journals are now almost all electronic. Books, however, are only starting to move fitfully in the XML direction, and are mostly not digital at all. So the least archivable stuff, moves to the best archival format fastest — because serial content does not have a legacy that needs conversion to make a new channel profitable, so the payoff from a production change can be pretty fast. A publisher with a rich backfile has items that can earn for 20 years or more — as long as costs can be controlled. So any change to the book production process has to pay off immediately on new books. And for any large-scale change across a publisher's line to be successful, it must be very cheap for old books. And that's where e-books stand, revenue unearned because there's not a clear path to get it. XML is great, and enables the production of an optimized presentation for a new media format, but it's not cheap at all. It's an expensive and tricky management challenge to change editorial production processes for new content, and data-conversion costs for old content are very high. Once the data is in hand, the development cost to create a new output format (print, web, handheld, or whatever) is not cheap either. Problems like typesetting, layout and display all have to be solved anew for each output format. It takes work to optimize presentation, especially from the level of abstraction gives good XML that power. So page images (and especially PDF) get a big boost from the XML paradox because they capture a lot of the production value of the existing process and they're the cheapest searchable format to produce from paper. So here I am, a guy who courted his wife over conversations about markup, working with page images. We are managing them with very rich metadata at a fine level, to capture much of the commercial benefit of XML, but still, I'm enabling something I used to rail against. And it's not easy to make page images work over the web, let publishers control the presentation, and still be good to readers. In this discussion I am leaving out the small number of crown-jewel properties that earn large amounts quickly in a new channel, and thus merit technology investment — Projects like that are important, but don't shift the business as a whole. And their emphasis on frequent updates makes them similar to serials in the need for continuous editorial management. Coming soon: I used to think that page scanning projects were a waste of money in terms of long-term investment, and I hope to post soon about why I no longer believe that either.

Comments

Popular posts from this blog

The importance of XML is real, but practicality of PDF gets short shrift

Publisher's weekly seems to have missed a key part of my message during Rebecca's and my backlist tutorial, which is that the long-term term payoff of XML is sufficiently expensive and disruptive that it can't happen quickly for publishers with significantly smaller resources than Thomson's, and that image based solutions like PDF can meet a lot of needs very quickly, for publishers that don't want to postpone full entry into online markets another 2-5 years. The Adobe announcements (especially integration of new e-book formats into print-oriented production tools) seems to present a more practical way for smaller publishers to change their workflows than the "big-bang" conversion project. But that kind of incremental strategy leaves existing PDF and image backlists just the way they are, and means that PDF will be a key part of all solutions for online marketing and product definition for the foreseeable future. Sometimes the future's so bright tha...

Tizra Upgrade Provides a Crisper, More Interactive E-Reading Experience

In the print world, when you think about a reader’s user experience, you consider factors like the size and weight of a book, paper quality, typeface, layout and design.  Moving to digital, some of these factors still hold true, but others are replaced with concerns such as speed, intuitive controls, cross-platform compatibility, plus as with any human interface, a host of intangibles.  We’re always working to make the Tizra reading experience crisper, easier, and less distracting, because happier readers mean happier publishers. Tizra reader upgrade makes it easy to enhance content with interactive lightbox effects. The update builds on Tizra’s ability to provide usability and compatibility across all the most popular web browsers and viewing devices, and is now available to all Tizra customers. Enhancements include:   Speed -- e-reading should be as crisp, fast and simple as turning a page. Your readers are not going to tolerate delays waiting for cont...

Announcing the Tizra Publishing Webinar Series 2017

This September Tizra is launching a new educational program for its association publishing clients and others interested in digital publication best practices. The free series of three monthly webinars led by publishing expert Thad McIlroy is directed to help you become a more effective publishing manager by examining best practices and new trends in online publishing. The first webinar in the series is " The 5 Top Issues Facing Association Publishing Management and How ToTackle Them! " and it takes place on Thursday, September 28 from 1pm - 2pm ET. Registration is free. Description: It’s never been a better time to be an association publisher. The tools, technologies and formats bring information and knowledge to members in record time and in multiple formats. There are challenges, but, as the saying goes, challenge brings opportunity. When Tizra talks with association publishing managers these are the top issues we hear about: 1. Enabling discovery via se...