fix CuDNN RecurrentOp Gradient init Summary: CuDNN RecurrentNet GradientOp did not pass the DROPOUT information to the initializer, causing incorrect scratch space size to be estimated. We have an assertion encorcing that scratch space is same for forward and backward ops, so this failed an assertion. We currently hard-code dropout to be 1.0, so this has had no effect on correctness in our tests. For some reason with num_layers=1 there wasn't an issue, but with num_layers>=2, the scratch space size was different. Reviewed By: salexspb Differential Revision: D4904715 fbshipit-source-id: 780266c5ecf1f7a32387edcb6fc498a13ac782ac

commit: 6595545843d35580c966ebc19df2a54175e8f059 [log] [tgz]
author: Aapo Kyrola <akyrola@fb.com> Tue Apr 18 08:24:30 2017 -0700
committer: Facebook Github Bot <facebook-github-bot@users.noreply.github.com> Tue Apr 18 08:36:18 2017 -0700
tree: f4efe4b3cb8187b1e229cb4e6357a145811c1e7d
parent: 2d28087529c52a8ebb2c9460f41052b6595d0c76 [diff]
diff --git a/caffe2/operators/recurrent_op_cudnn.cc b/caffe2/operators/recurrent_op_cudnn.cc
index 6951852..e36c2e3 100644
--- a/caffe2/operators/recurrent_op_cudnn.cc
+++ b/caffe2/operators/recurrent_op_cudnn.cc

@@ -295,7 +295,7 @@
 bool RecurrentGradientOp<T>::RunOnDevice() {
   const int seqLength = Input(INPUT).dim32(0);
   if (Input(INPUT).dims() != cachedInputDims_) {
-    initialize(Input(INPUT));
+    initialize(Input(INPUT), Output(DROPOUT_STATES));
     cachedInputDims_ = Input(INPUT).dims();
   }
   CUDNN_ENFORCE(cudnnGetRNNTrainingReserveSize(
commit	6595545843d35580c966ebc19df2a54175e8f059	[log] [tgz]
author	Aapo Kyrola <akyrola@fb.com>	Tue Apr 18 08:24:30 2017 -0700
committer	Facebook Github Bot <facebook-github-bot@users.noreply.github.com>	Tue Apr 18 08:36:18 2017 -0700
tree	f4efe4b3cb8187b1e229cb4e6357a145811c1e7d
parent	2d28087529c52a8ebb2c9460f41052b6595d0c76 [diff]