Generic support for SpatialFullConvolution and SpatialDilatedConvolution.

Uses matrix multiple for matrix vector multiply for half (no matrix vector
implementation exists).
7 files changed