commit | 3f860af0509de000590e86ae62309295a0114cac | [log] [tgz] |
---|---|---|
author | Jeff Johnson <jhj@fb.com> | Sat Jun 17 08:42:29 2017 -0700 |
committer | Facebook Github Bot <facebook-github-bot@users.noreply.github.com> | Sat Jun 17 08:47:38 2017 -0700 |
tree | 105e58826c2e0d80e9d7c55ecd0a64c662d2b7ca | |
parent | 044679ca7e329a4b280a50e9cf44f73015d99eb6 [diff] |
Implement TopKOp for GPU Summary: This is a real implementation (not GPUFallbackOp) of the TopKOp for GPU. There are two algorithm implementations: -for k <= 512, it maps to a warp-wide min-heap implementation, which requires only a single scan of the input data. -for k > 512, it maps to a multi-pass radix selection algorithm that I originally wrote in cutorch. I took the recent cutorch code and removed some cutorch-specific things as it made sense. Also added several utility files that one or the other implementations use, some from the Faiss library and some from the cutorch library. Reviewed By: jamesr66a Differential Revision: D5248206 fbshipit-source-id: ae5fa3451473264293516c2838f1f40688781cf3
Caffe2 is a lightweight, modular, and scalable deep learning framework. Building on the original Caffe, Caffe2 is designed with expression, speed, and modularity in mind.
Caffe2 Bay Area Meetup at NVIDIA, May 31 6-8:30, Santa Clara, CA: https://www.meetup.com/Caffe2-Bay-Area/events/239836290/
Caffe2 Community Facebook Group: join to ask questions, talk to other users, and keep informed of important Caffe2 updates.
Please use Github issues (https://github.com/caffe2/caffe2/issues) to ask questions, report bugs, and request new features.
Please participate in our survey (https://www.surveymonkey.com/r/caffe2). We will send you information about new releases and special developer events/webinars.
Caffe2 is released under the BSD 2-Clause license.