commit | 98cf176baadd1cd67572e3065858884f634e4da6 | [log] [tgz] |
---|---|---|
author | Aapo Kyrola <akyrola@fb.com> | Sat May 06 19:26:24 2017 -0700 |
committer | Facebook Github Bot <facebook-github-bot@users.noreply.github.com> | Sun May 07 01:08:42 2017 -0700 |
tree | fe17f95a9e64028b9ad978caa6eaae12b783ad78 | |
parent | 8f692b5642db3a139b7b2e5d8f3857881d1fe21e [diff] |
improve style + a bit of perf for ScatterWeightedSum CUDA Summary: For perf, it is better to check weight0 inside the kernel and avoid host synchronization when copying to a stack variable. Improved style a bit (github does not have Lint, so contributed code may not conform to our style). Differential Revision: D5011668 fbshipit-source-id: 1eb85912f6f499acd3190cfcb59e7e39c2220d89
Caffe2 is a lightweight, modular, and scalable deep learning framework. Building on the original Caffe, Caffe2 is designed with expression, speed, and modularity in mind.
Please use Github issues (https://github.com/caffe2/caffe2/issues) to ask questions, report bugs, and request new features.
Please participate in our survey (https://www.surveymonkey.com/r/caffe2). We will send you information about new releases and special developer events/webinars.
Caffe2 is released under the BSD 2-Clause license.
Detailed build matrix (hit refresh if you see icons not showing up due to heroku):
Target | Status |
---|---|
Linux | |
Mac (CPU) | |
Android | |
iOS | |
Linux + MKL | |
Windows |