One of the biggest issues with Amiga emulation as I understood it was the graphics. Would it not be possible to have the CUDA processor deal more directly with it rather then needing the cpu to handle that particular facet?
This isn't my area of expertise so I could be viewing this in completely the wrong way.
Ah right, now I get you. I've wondered about that too. You could probably use it to speed up planar to chunky (yes, that's right, not chunky 2 planar) and rendering of HAM framebuffers to the host display format, you could probably use it to speed up blitter emulation too.
However, the thing is that on current CPU's these aren't really too taxing. A more widely supported solution I think would be to leverage multiple core support on the CPU. Since AmigaOS 3.x exec just isn't cut out for SMP (something else I've pondered for UAE is a hacked exec that can use multiple concurrent JIT instances for handling as many AmigaOS "ready to run" processes as the host feasibly allows), you can have the JIT on one core and offload things like display emulation to another core. Worth noting too is that if you are using UAE for anything beyond a spot of retro gaming you are probably going to be using RTG anyway and not even worried about "native" amiga modes.
Also, anything you do with the GPU is probably best left until OpenCL becomes more widespread as it'll work on more systems than CUDA.