those xcalloc/xfree/xalloc/XNFprintf/... are deprecated. Replace then with the new one. And fix some other minor problems.
This doesn't yet have an optimized glamor_composite_rects() implementation, but it does triple the speed of x11perf -aa10text.