Added copying form of CudaTensor.cmul

i.e.
	x:cmul(y,z)
Previously only
	x:cmul(y)
was implemented.
2 files changed