commit | b070197e8a44bd77d778bf153935e5d58fce239d | [log] [tgz] |
---|---|---|
author | Yiming Wu <wyiming@fb.com> | Thu May 11 20:58:04 2017 -0700 |
committer | Facebook Github Bot <facebook-github-bot@users.noreply.github.com> | Thu May 11 21:08:10 2017 -0700 |
tree | ffa5f46324a8e603d7342dd1c51c557f0a55a66f | |
parent | 942f53b5a6efd9ab147be5f4b5abc2bb71a0e437 [diff] |
cuda unique op Summary: cuda unique op , unittest provided, will provide benchmark agains CPU SpeedUp results for synthetic real data. Input of size 20k, range[1, 10million], **~5x** speedup CPU 9.05795(ms) Unique GPU 1.79434(ms) Unique SpeedUp results for 5x synthetic data. Input of size 1 million, range[1, 10million] **~13.7x** speedup CPU 54.7539(ms) Unique GPU 3.99473(ms) Unique Reviewed By: akyrola Differential Revision: D5007726 fbshipit-source-id: 0a00c518fd1809d0ae8c6cfcba09b0bd982ffaff
Caffe2 is a lightweight, modular, and scalable deep learning framework. Building on the original Caffe, Caffe2 is designed with expression, speed, and modularity in mind.
Please use Github issues (https://github.com/caffe2/caffe2/issues) to ask questions, report bugs, and request new features.
Please participate in our survey (https://www.surveymonkey.com/r/caffe2). We will send you information about new releases and special developer events/webinars.
Caffe2 is released under the BSD 2-Clause license.
Detailed build matrix (hit refresh if you see icons not showing up due to heroku):
Target | Status |
---|---|
Linux | |
Mac (CPU) | |
Android | |
iOS | |
Linux + MKL | |
Windows |