This time we'll code a CUDA kernel to do the C++ algorithm we looked at last tute. The kernel we'll code is quite simple but it still gives us an excellent s...