Nice. And now we wait for negative comments from the same naysayers that thwarted Chris's efforts. ;-)
Re: legacy MEMF_REVERSE usage, if you observed that your allocations were in the upper half of memory (either physically or with respect to memory lists), wouldn't using MEMF_REVERSE have decreased allocation time by at least a factor of two, barring fragmentation?
It also seems to me that using an allocaligned-like function would have increased fragmentation with very little benefit, particularly for alignments <= 16, by unnecessarily increasng the size of the system memory lists. I'm speaking from the perspective of someone with no real knowledge of the internals of Amiga OS memory allocation, though.