| commit | 01c76bf830d050995314bf2ae1a88ab35ce1edbc | [log] [tgz] |
|---|---|---|
| author | James Reed <jamesreed@fb.com> | Thu Apr 20 18:20:49 2017 -0700 |
| committer | Facebook Github Bot <facebook-github-bot@users.noreply.github.com> | Thu Apr 20 18:31:40 2017 -0700 |
| tree | aae8d8587c5144dae11453128291ee4000e5dc41 | |
| parent | 9f86de2dc76e072a409bd97ceb6cd444206ceaa8 [diff] |
Optimize TransposeOp by using strided access pattern, bulk memory transfer, and other profile-guided optimizations Summary: Work in progress for improving the performance of the TransposeOp on CPU. This is used extensively for inference in several neural MT systems, so optimizing this function is worthwhile and will reduce request latency. Differential Revision: D4913075 fbshipit-source-id: fa2742829291d91f3eba00fdfe7d6c0dae83e206
Caffe2 is a lightweight, modular, and scalable deep learning framework. Building on the original Caffe, Caffe2 is designed with expression, speed, and modularity in mind.
Please use Github issues (https://github.com/caffe2/caffe2/issues) to ask questions, report bugs, and request new features.
Please participate in our survey (https://www.surveymonkey.com/r/caffe2). We will send you information about new releases and special developer events/webinars.
Caffe2 is released under the BSD 2-Clause license.
| Target | Status |
|---|---|
| Linux | |
| Android | |
| iOS | |
| Linux + MKL |