im not sure it this is still the case but initial implementation or rtg on vampire was likely simple framebuffer, without any accel laike blitter, as thomas says. now it might be they have implemented some accelerated memory copy functionality into the core, maybe even masked and such, but i dont know it.
another issue is that vampire shares the same bus to memory for cpu and rtg, so it can starve with higher resolutions and frequencies, while cpu is doing much memory access. the solution to this is as far as im aware a lower rtg frequency, as long as the hdmi display device supports it.