commit | 1aa6300696ea603057c432aac5462e04bcfb7370 | [log] [tgz] |
---|---|---|
author | Simon Layton <slayton58@gmail.com> | Tue May 30 16:43:02 2017 -0700 |
committer | Facebook Github Bot <facebook-github-bot@users.noreply.github.com> | Tue May 30 16:46:38 2017 -0700 |
tree | 80b3a7f77c17c4becde5dd3a70baf99bbabe2c77 | |
parent | 47e921ba496b19057358766390e8c199efbf1682 [diff] |
Option to use NCCL for broadcast Summary: Fixes some performance issues when `broadcast_computed_params=True` is passed to Parallelize_GPU. Enabled via the same `use_nccl` flag as AllReduce Closes https://github.com/caffe2/caffe2/pull/630 Differential Revision: D5149828 Pulled By: akyrola fbshipit-source-id: 12c9714c7fa078811f1cde61c8523dca8f7f968f
Caffe2 is a lightweight, modular, and scalable deep learning framework. Building on the original Caffe, Caffe2 is designed with expression, speed, and modularity in mind.
Please use Github issues (https://github.com/caffe2/caffe2/issues) to ask questions, report bugs, and request new features.
Please participate in our survey (https://www.surveymonkey.com/r/caffe2). We will send you information about new releases and special developer events/webinars.
Caffe2 is released under the BSD 2-Clause license.
Detailed build matrix (hit refresh if you see icons not showing up due to heroku):
Target | Status |
---|---|
Linux | |
Mac (CPU) | |
Android | |
iOS | |
Linux + MKL | |
Windows |