One thing the FPGAArcade has going for it is far faster memory, so even if the AGA implementation has no enhancements, the 256 colour modes should perform a lot faster simply because there will be far less contention on the memory bus.
Wonder if you can put faster chip RAM in an A4000 to help the performance in AGA?