commit | 85f1d947dd5324925e59d00e248f3f2b299b2e9f | [log] [tgz] |
---|---|---|
author | James Reed <jamesreed@fb.com> | Wed May 17 20:26:40 2017 -0700 |
committer | Facebook Github Bot <facebook-github-bot@users.noreply.github.com> | Wed May 17 20:33:36 2017 -0700 |
tree | 225b936ec1aa17099d09fec39ee78728c404cb32 | |
parent | 12edbcb154076b7dadd741aa7ff0e4d516829368 [diff] |
Vectorize SigmoidOp on CPU Summary: I noticed that Sigmoid was taking an inordinate amount of time in our NMT benchmark, so I looked at the implementation and it didn't seem optimal. I replaced the implementation with an Eigen version so that when the Eigen update goes through, we will get proper AVX(2) vectorization. Differential Revision: D5082464 fbshipit-source-id: aa951f7d730fc05198f7dd04076ec58d471b74c8
Caffe2 is a lightweight, modular, and scalable deep learning framework. Building on the original Caffe, Caffe2 is designed with expression, speed, and modularity in mind.
Please use Github issues (https://github.com/caffe2/caffe2/issues) to ask questions, report bugs, and request new features.
Please participate in our survey (https://www.surveymonkey.com/r/caffe2). We will send you information about new releases and special developer events/webinars.
Caffe2 is released under the BSD 2-Clause license.
Detailed build matrix (hit refresh if you see icons not showing up due to heroku):
Target | Status |
---|---|
Linux | |
Mac (CPU) | |
Android | |
iOS | |
Linux + MKL | |
Windows |