| commit | a23b378052095e62740aabf095ecb77d28463fce | [log] [tgz] |
|---|---|---|
| author | Aapo Kyrola <akyrola@fb.com> | Wed May 10 23:19:27 2017 -0700 |
| committer | Facebook Github Bot <facebook-github-bot@users.noreply.github.com> | Wed May 10 23:32:44 2017 -0700 |
| tree | 63eb2afe9b411007538c782388c9a8178250ea7c | |
| parent | e16ea46013c76fa31b2a997cf8448c6f3d44a175 [diff] |
set cuda stream for cub::DeviceReduce in SumReduceLike Summary: After a long and painful debugging of indeterministic behavior on Machine Translation team's attention model, I found that in certain cases SumReduceLike will use cub::DeviceReduce, and it lacked the stream param. Reviewed By: jamesr66a, asaadaldien Differential Revision: D5043347 fbshipit-source-id: bb91aacfc6786cc2b85ebc4e432c67e5f876e235
Caffe2 is a lightweight, modular, and scalable deep learning framework. Building on the original Caffe, Caffe2 is designed with expression, speed, and modularity in mind.
Please use Github issues (https://github.com/caffe2/caffe2/issues) to ask questions, report bugs, and request new features.
Please participate in our survey (https://www.surveymonkey.com/r/caffe2). We will send you information about new releases and special developer events/webinars.
Caffe2 is released under the BSD 2-Clause license.
Detailed build matrix (hit refresh if you see icons not showing up due to heroku):
| Target | Status |
|---|---|
| Linux | |
| Mac (CPU) | |
| Android | |
| iOS | |
| Linux + MKL | |
| Windows |