| commit | 56a6054dc4d46a79409902ef6adfeb333052f92c | [log] [tgz] |
|---|---|---|
| author | Sam Gross <sgross@fb.com> | Thu Jun 11 13:38:22 2015 -0700 |
| committer | Sam Gross <sgross@fb.com> | Mon Jun 15 08:03:21 2015 -0700 |
| tree | f8beb84ef68b26afe52210c439c7f288734e9b44 | |
| parent | ff1384d12ddf9c5b29aaee630bfa0ddad09ca872 [diff] |
Optimized indexSelect kernel for contiguous inputs Adds an optimized indexSelect kernel for contiguous inputs indexed in the first dimension. This will remove the need for a separate CUDA LookupTable forward pass, since it can just use indexSelect.