I don't uderstand the results of the tests --- all of these seem to show the new library takes LONGER to copy the same amount of data as the old library.
For example:
Copying 65536 bytes 282 times (long -> long offset)
Old CopyMem : 1.46 secs
New CopyMem : 1.51 secs (+ 3.4%)