[BUGFIX] fix log_sigmoid bugs #20372

Adnios · 2021-06-22T14:07:13Z

Description

Solve #20371

Checklist

Essentials

PR's title starts with a category (e.g. [BUGFIX], [MODEL], [TUTORIAL], [FEATURE], [DOC], etc)
Changes are complete (i.e. I finished coding on this PR)
All changes have test coverage
Code is well-documented

Comments

When the input's shape=(), the backward log_sigmoid activation is incorrect
- Reason: For y = log_sigmoid(x), the inputs for log_sigmoid_grad is (dy, y). This problem is similar to _backward_softsign activation is incorrect #10868
There is an error when run the log_sigmoid in gpu.
- Reason: There is no cudnn log_sigmoid implement in cudnn_activation-inl.h
  https://github.com/apache/incubator-mxnet/blob/da4ff3a4dc0bd6a54af3d75c492021d18ba1867b/src/operator/nn/cudnn/cudnn_activation-inl.h#L48-L65

mxnet-bot · 2021-06-22T14:07:17Z

Hey @Adnios , Thanks for submitting the PR
All tests are already queued to run once. If tests fail, you can trigger one or more tests again with the following commands:

To trigger all jobs: @mxnet-bot run ci [all]
To trigger specific jobs: @mxnet-bot run ci [job1, job2]

CI supported jobs: [edge, centos-gpu, sanity, unix-gpu, windows-gpu, miscellaneous, unix-cpu, centos-cpu, clang, windows-cpu, website]

Note:
Only following 3 categories can trigger CI :PR Author, MXNet Committer, Jenkins Admin.
All CI tests must pass before the PR can be merged.

Adnios · 2021-06-23T05:08:57Z

@mxnet-bot run ci [unix-gpu]

mxnet-bot · 2021-06-23T05:09:03Z

Jenkins CI successfully triggered : [unix-gpu]

Adnios · 2021-06-23T06:47:24Z

@mxnet-bot run ci [unix-gpu]

mxnet-bot · 2021-06-23T06:47:30Z

Jenkins CI successfully triggered : [unix-gpu]

Adnios · 2021-06-28T02:14:06Z

@szha @bartekkuncer @bgawrych can you help preview review?

bartekkuncer · 2021-06-28T08:50:17Z

@Adnios change looks alright but as it mostly involves gpu I believe @szha should take a look.

szha · 2021-06-29T03:45:53Z

src/operator/tensor/elemwise_unary_op_basic.cc


 )code" ADD_FILELINE)
 .set_attr<FCompute>("FCompute<cpu>", UnaryOp::Compute<cpu, mshadow_op::log_sigmoid>)
-.set_attr<nnvm::FGradient>("FGradient", ElemwiseGradUseIn{"_backward_log_sigmoid"});


The previous version looks correct with the ElemwiseGradUseIn which makes the input to the gradient function the input of the elementwise function. Could you elaborate on in which cases this would fail and why you need to change it to ElemwiseGradUseOut and the definition?

I'm not sure how scalar array would trigger the problem yet

Hi, @szha .
The reason of "scalar array would trigger the problem" is:
https://github.com/apache/incubator-mxnet/blob/835e25031f847b80277b6d11db0519723d26a80a/src/operator/nn/activation.cc#L126-L140

If the input x is scalar array, SupportMKLDNNAct(param, inputs[0]) will return false.

If the input x is vector array, SupportMKLDNNAct(param, inputs[0]) will return true. And the function MKLDNNActivationBackward will make it work. Maybe the following code take effect.
https://github.com/apache/incubator-mxnet/blob/835e25031f847b80277b6d11db0519723d26a80a/src/operator/nn/mkldnn/mkldnn_act.cc#L266-L272

There are 2 solutions to make scalar array input to work.

The input of log_sigmoid_grad should be y. So we can modify the following code which takes x as its input. This is also what I am doing in this pr.
https://github.com/apache/incubator-mxnet/blob/835e25031f847b80277b6d11db0519723d26a80a/src/operator/mshadow_op.h#L416
MXNET_UNARY_MATH_OP(log_sigmoid_grad, 1.0f - math::exp(a));

Since log_sigmoid_grad takes x as input, we can also change the following code. Now it will make x as input to log_sigmoid_grad.
https://github.com/apache/incubator-mxnet/blob/835e25031f847b80277b6d11db0519723d26a80a/src/operator/nn/activation-inl.h#L207-L210
case activation::kLogSigmoid: ActivationBackward<xpu, mshadow_op::log_sigmoid, mshadow_op::log_sigmoid_grad>( ctx, inputs[0], inputs[2], req[0], outputs[0]); break;

I think solution_1 is better. For y = log_sigmoid(x), it calculates dx based on (dy, y) instead of (dy, x) which enables inplace operation during y = log_simoid(x) (i.e. y and x shares the same memory).

Another problem arose when I adopted the solution_1. The gradient of sym.log_sigmoid() will be wrong. The reason of this problem is that the input of _backward_log_sigmoid is x. When I adopt the solution_1, the input of _backward_log_sigmoid should be y. The source code of sym.log_sigmoid() is the following.
https://github.com/apache/incubator-mxnet/blob/835e25031f847b80277b6d11db0519723d26a80a/src/operator/tensor/elemwise_unary_op_basic.cc#L152-L167
So, I change it to ElemwiseGradUseOut in reference of the source code of sym.sigmoid().

Thanks for the detailed analysis. The proposed change looks good and I have no further concern.

fix log_sigmoid bugs

bcb39f2

mseth10 added pr-awaiting-testing PR is reviewed and waiting CI build and test pr-work-in-progress PR is still work in progress and removed pr-awaiting-testing PR is reviewed and waiting CI build and test labels Jun 22, 2021

use forward interface

06bbfaf

mseth10 added pr-awaiting-testing PR is reviewed and waiting CI build and test pr-work-in-progress PR is still work in progress and removed pr-work-in-progress PR is still work in progress pr-awaiting-testing PR is reviewed and waiting CI build and test labels Jun 22, 2021

forget updata rtc(backward_log_sigmoid)

e1f6b13

mseth10 added pr-awaiting-testing PR is reviewed and waiting CI build and test pr-work-in-progress PR is still work in progress and removed pr-work-in-progress PR is still work in progress pr-awaiting-testing PR is reviewed and waiting CI build and test labels Jun 23, 2021

mseth10 added pr-awaiting-testing PR is reviewed and waiting CI build and test pr-awaiting-review PR is waiting for code review and removed pr-work-in-progress PR is still work in progress pr-awaiting-testing PR is reviewed and waiting CI build and test labels Jun 23, 2021

szha reviewed Jun 29, 2021

View reviewed changes

szha merged commit cb5bd4e into apache:master Jun 30, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[BUGFIX] fix log_sigmoid bugs #20372

[BUGFIX] fix log_sigmoid bugs #20372

Uh oh!

Adnios commented Jun 22, 2021 •

edited

Loading

Uh oh!

mxnet-bot commented Jun 22, 2021

Uh oh!

Adnios commented Jun 23, 2021

Uh oh!

mxnet-bot commented Jun 23, 2021

Uh oh!

Adnios commented Jun 23, 2021

Uh oh!

mxnet-bot commented Jun 23, 2021

Uh oh!

Adnios commented Jun 28, 2021

Uh oh!

bartekkuncer commented Jun 28, 2021

Uh oh!

szha Jun 29, 2021

Uh oh!

szha Jun 29, 2021

Uh oh!

Adnios Jun 29, 2021

Uh oh!

szha Jun 30, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

[BUGFIX] fix log_sigmoid bugs #20372

[BUGFIX] fix log_sigmoid bugs #20372

Uh oh!

Conversation

Adnios commented Jun 22, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Checklist

Essentials

Comments

Uh oh!

mxnet-bot commented Jun 22, 2021

Uh oh!

Adnios commented Jun 23, 2021

Uh oh!

mxnet-bot commented Jun 23, 2021

Uh oh!

Adnios commented Jun 23, 2021

Uh oh!

mxnet-bot commented Jun 23, 2021

Uh oh!

Adnios commented Jun 28, 2021

Uh oh!

bartekkuncer commented Jun 28, 2021

Uh oh!

szha Jun 29, 2021

Choose a reason for hiding this comment

Uh oh!

szha Jun 29, 2021

Choose a reason for hiding this comment

Uh oh!

Adnios Jun 29, 2021

Choose a reason for hiding this comment

Uh oh!

szha Jun 30, 2021

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Adnios commented Jun 22, 2021 •

edited

Loading