Cleaning up CUDA code base. Got rid of useless device syncs
4 files changed