commit | 61356cbadca26965087dc21c0f578892a080d4ac | [log] [tgz] |
---|---|---|
author | Frank Jiang <ffjiang@fb.com> | Tue Jan 16 19:23:25 2018 -0800 |
committer | Facebook Github Bot <facebook-github-bot@users.noreply.github.com> | Tue Jan 16 19:39:31 2018 -0800 |
tree | 188a2739c32bc049a3cddcb4e9be5437a396e733 | |
parent | 6ba96952a6c7b9a6a495f333427c15c7ba18fa22 [diff] |
RowWiseSparseAdam operator Summary: Added the RowWise functionality for SparseAdam, which saves roughly 2/3 memory usage by only keeping one first and second moment term for each row of the parameter tensor, rather than one for each individual parameter. Differential Revision: D6679342 fbshipit-source-id: ce6fb27e35ce41a890c66f6089cd2748d10e7a44
Caffe2 is a lightweight, modular, and scalable deep learning framework. Building on the original Caffe, Caffe2 is designed with expression, speed, and modularity in mind.
Please use Github issues (https://github.com/caffe2/caffe2/issues) to ask questions, report bugs, and request new features.
Please participate in our survey (https://www.surveymonkey.com/r/caffe2). We will send you information about new releases and special developer events/webinars.
Caffe2 is released under the Apache 2.0 license. See the NOTICE file for details.