Adding the implementation for rmsprop operator by kavyasrinet · Pull Request #4565 · PaddlePaddle/Paddle

kavyasrinet · 2017-10-03T00:54:57Z

Adding the implementation for RMSprop:

MeanSquareOut = decay * MeanSquare + (1 - decay) * Grad * Grad
MomentOut = momentum * Moment + LearningRate * Grad / sqrt(MeanSquareOut + epsilon)
ParamOut = Param - MomentOut

dzhwinter

Good Job! there are some small pieces need to fix.

dzhwinter · 2017-10-03T02:12:47Z

paddle/operators/rmsprop_op.cc

+      : OpProtoAndCheckerMaker(proto, op_checker) {
+    AddInput("Param", "Input parameter");
+    AddInput("Grad", "Input gradient");
+    AddInput("Moment", "Second moment");


here is a typo. It is the momentum. And, the comment is not helpful, we should make the comment self-explained, in format of (type):comment. e.g. (tensor): blabla

This is a good point, will fix this.

dzhwinter · 2017-10-03T02:25:59Z

paddle/operators/rmsprop_op.cc

+RMSprop
+
+MomentOut = decayRate * Moment + (1 - decayRate) * Grad * Grad
+ParamOut = Param - learningRate * Grad / (sqrt(MomentOut) + epsilon)


the Paddle old version, tensorflow, caffe2 had implemented rmsprop algorithm. They all follow the paper's formula parameter names, users used to use the same name between different version of our framework.
https://caffe2.ai/docs/operators-catalogue.html#rmsprop
tensorflow

dzhwinter · 2017-10-03T02:28:28Z

paddle/operators/rmsprop_op.h

+class RmspropOpKernel : public framework::OpKernel<T> {
+ public:
+  void Compute(const framework::ExecutionContext& ctx) const override {
+    auto param_out = ctx.Output<Tensor>("ParamOut");


here please use auto* since param_out is a pointer. auto keyword always hidden the real type, pointer can be more clear for users who read the code.

I see, I just assumed for now that auto will resolve this by itself. But I see the point of make it more understandable for users. Will fix.

dzhwinter · 2017-10-03T02:33:30Z

python/paddle/v2/framework/tests/test_rmsprop_op.py

+        param = np.random.random((123, 321)).astype("float32")
+        grad = np.random.random((123, 321)).astype("float32")
+        moment = np.zeros((123, 321)).astype("float32")
+        learning_rate = np.array([0.01]).astype("float32")


why is 'learning_rate' not an attribute?

As Abhinav commented above, I think I will retain this as an input for now, given that we are doing the same in all other PRs too.

dzhwinter · 2017-10-03T02:36:10Z

paddle/operators/rmsprop_op.cc

+http://www.cs.toronto.edu/~tijmen/csc321/slides/lecture_slides_lec6.pdf)
+does not have the epsilon attribute. It is added here for numerical stability
+to avoid division by zero.
+


We'd better use the same common used parameter name since it is a popular optimizer. User used to the same name between frameworks.

https://github.com/tensorflow/tensorflow/blob/994226a4a992c4a0205bca9e2f394cb644775ad7/tensorflow/core/ops/training_ops.cc#L1281
https://caffe2.ai/docs/operators-catalogue.html#rmsprop

… rmsprop

dzhwinter

LGTM!

Adding the implementation for rmsprop operator

61c03f9

kavyasrinet assigned reyoung and dzhwinter Oct 3, 2017

Made learning rate the input

163d287

dzhwinter requested changes Oct 3, 2017

View reviewed changes

Kavya Srinet added 4 commits October 4, 2017 12:45

Fixed changes proposed in the review

94855f4

Adding the default attribute test case

fa12e51

Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into…

0336304

… rmsprop

Updated RMSProp to have learning rate as an input and work with GPU

f52cdaa

dzhwinter approved these changes Oct 6, 2017

View reviewed changes

kavyasrinet merged commit 48f98a6 into PaddlePaddle:develop Oct 6, 2017

kavyasrinet deleted the rmsprop branch October 9, 2017 23:08

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Adding the implementation for rmsprop operator#4565

Adding the implementation for rmsprop operator#4565
kavyasrinet merged 6 commits intoPaddlePaddle:developfrom
kavyasrinet:rmsprop

kavyasrinet commented Oct 3, 2017 •

edited

Loading

dzhwinter left a comment

dzhwinter Oct 3, 2017

kavyasrinet Oct 4, 2017

dzhwinter Oct 3, 2017

dzhwinter Oct 3, 2017

kavyasrinet Oct 4, 2017 •

edited

Loading

dzhwinter Oct 3, 2017

kavyasrinet Oct 4, 2017

dzhwinter Oct 3, 2017

dzhwinter left a comment

Labels

3 participants

Conversation

kavyasrinet commented Oct 3, 2017 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

dzhwinter left a comment

Choose a reason for hiding this comment

dzhwinter Oct 3, 2017

Choose a reason for hiding this comment

kavyasrinet Oct 4, 2017

Choose a reason for hiding this comment

dzhwinter Oct 3, 2017

Choose a reason for hiding this comment

dzhwinter Oct 3, 2017

Choose a reason for hiding this comment

kavyasrinet Oct 4, 2017 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

dzhwinter Oct 3, 2017

Choose a reason for hiding this comment

kavyasrinet Oct 4, 2017

Choose a reason for hiding this comment

dzhwinter Oct 3, 2017

Choose a reason for hiding this comment

dzhwinter left a comment

Choose a reason for hiding this comment

Labels

3 participants

kavyasrinet commented Oct 3, 2017 •

edited

Loading

kavyasrinet Oct 4, 2017 •

edited

Loading