Skip to main content

Tizra gets faster

Non-technical summary: things are lots faster at Tizra sites and admin tools. There's certainly more to do, but we've got more tricks up our sleeves! Because the big current speed boost is related to one cause, and it took me a while to track down, the geek appendage to this post describes what we found and how we fixed it.

Geekly details

I spent a bunch of time last week looking at system performance. As we've been adding customers and usage, we were beginning to feel the pinch. Performance always varies, but the range of response times was getting wider as things slowed, leading me to think that there might be some systemic issues that would give us a quick improvement (and indeed there was some Linux tuning that helped a bit). But data access seemed to be the real issue, so I spent a bunch of time looking into hibernate, and our caching and querying, and then wound up spending a day or so basically watching all the queries go through Postgres. And you know what? most of them seemed much slower than they should be, even though they are pretty hairy.

Of course, the next step was to check for database indexes, and how the query plans were using them. But in hand testing the plans looked good, and the indexes were sensible. But when run by hand the queries were also significantly faster than when hibernate ran them! This was much easier to see now that we have a live load, which is inevitably different from a test setup. So why the difference? Postgres was ignoring our indexes only when Tizra publisher made the queries.

Turns out that there's an old bug in Postgres where it would ignore indexes on bigint fields in prepared statements unless there was an explicit data type cast. (That type confusion was an obscure result of skew between Postgresql and the SQL standard.) And that was the behavior I was seeing, even though we were using a much more recent vintage of all the software. This was terrible for us, because we have a multi-tenant publishing system for large document collections and we use bigints as primary object identifiers!

So, why the old problem if the bug is gone, and we are not using postgres 7? It turns out that we dynamically build those hairy queries, in HQL (hibernate query language), using the String trick. But nowadays instead of making your indexes work, it breaks them! The differences are invisible in the SQL. It turned out that we were in a version "donut hole." Our database was recent enough so the String trick worked the opposite way (preventing fast queries for our prepared statements), but the JDBC driver wasn't making the calls in the right way to make the old trick work. End result: we're now running the latest JDBC driver with compatibility options set while we update our hairy query generator. And now we can really start tuning our setup!

If the web had not provided the history of the old bug, I would have had a much worse time even knowing where to look to find our somewhat subtle configuration issue. So enjoy the speedup, I sure am!

Comments

Anonymous said…
This is pretty wierd, any chance you can post the exact version numbers of Postgres, Hibernate, and JDBC involved?

Popular posts from this blog

The importance of XML is real, but practicality of PDF gets short shrift

Publisher's weekly seems to have missed a key part of my message during Rebecca's and my backlist tutorial, which is that the long-term term payoff of XML is sufficiently expensive and disruptive that it can't happen quickly for publishers with significantly smaller resources than Thomson's, and that image based solutions like PDF can meet a lot of needs very quickly, for publishers that don't want to postpone full entry into online markets another 2-5 years. The Adobe announcements (especially integration of new e-book formats into print-oriented production tools) seems to present a more practical way for smaller publishers to change their workflows than the "big-bang" conversion project. But that kind of incremental strategy leaves existing PDF and image backlists just the way they are, and means that PDF will be a key part of all solutions for online marketing and product definition for the foreseeable future. Sometimes the future's so bright tha...

Tizra Upgrade Provides a Crisper, More Interactive E-Reading Experience

In the print world, when you think about a reader’s user experience, you consider factors like the size and weight of a book, paper quality, typeface, layout and design.  Moving to digital, some of these factors still hold true, but others are replaced with concerns such as speed, intuitive controls, cross-platform compatibility, plus as with any human interface, a host of intangibles.  We’re always working to make the Tizra reading experience crisper, easier, and less distracting, because happier readers mean happier publishers. Tizra reader upgrade makes it easy to enhance content with interactive lightbox effects. The update builds on Tizra’s ability to provide usability and compatibility across all the most popular web browsers and viewing devices, and is now available to all Tizra customers. Enhancements include:   Speed -- e-reading should be as crisp, fast and simple as turning a page. Your readers are not going to tolerate delays waiting for cont...

Announcing the Tizra Publishing Webinar Series 2017

This September Tizra is launching a new educational program for its association publishing clients and others interested in digital publication best practices. The free series of three monthly webinars led by publishing expert Thad McIlroy is directed to help you become a more effective publishing manager by examining best practices and new trends in online publishing. The first webinar in the series is " The 5 Top Issues Facing Association Publishing Management and How ToTackle Them! " and it takes place on Thursday, September 28 from 1pm - 2pm ET. Registration is free. Description: It’s never been a better time to be an association publisher. The tools, technologies and formats bring information and knowledge to members in record time and in multiple formats. There are challenges, but, as the saying goes, challenge brings opportunity. When Tizra talks with association publishing managers these are the top issues we hear about: 1. Enabling discovery via se...