1. 06 2月, 2020 1 次提交
  2. 17 1月, 2020 1 次提交
  3. 16 1月, 2020 1 次提交
  4. 14 1月, 2020 1 次提交
  5. 13 1月, 2020 1 次提交
  6. 09 1月, 2020 1 次提交
  7. 08 1月, 2020 4 次提交
  8. 07 1月, 2020 2 次提交
  9. 06 1月, 2020 2 次提交
  10. 04 1月, 2020 2 次提交
    • L
      control flow: support optimizer called (#21851) · 7d8d4599
      liym27 提交于
      * append optimize op in the grad block of current block if current block is in control flow. test=develop
      
      * add conditional grad op when optimizer used in control flow. test=develop
      
      * add comment and modify typo. test=develop
      
      * fix append_backward to support control flow. test=develop
      
      * add test. test=develop
      
      * fix copy_var_to_parent_block and conditional_block_grad. test=develop
      
      * fix bug: revert to append conditional_block_grad vars to sub grad block. test=develop
      
      * fix bug: revert to assign var to parent block even if var already is in parent block
      
      * fix bug: consider outputs is empty. test=develop
      
      * move _rename_grad_ out. test=develop
      
      * modify code according to reviews from Huihuang. test=develop
      
      * modify code according to reviews from Jinle. test=develop
      7d8d4599
    • S
      add package for centerloss (#21806) · 3fab86ff
      ShenLiang 提交于
      3fab86ff
  11. 03 1月, 2020 3 次提交
  12. 02 1月, 2020 1 次提交
  13. 31 12月, 2019 2 次提交
  14. 26 12月, 2019 1 次提交
  15. 24 12月, 2019 3 次提交
  16. 23 12月, 2019 1 次提交
  17. 20 12月, 2019 1 次提交
  18. 18 12月, 2019 1 次提交
  19. 17 12月, 2019 1 次提交
  20. 16 12月, 2019 3 次提交
  21. 11 12月, 2019 1 次提交
  22. 09 12月, 2019 2 次提交
  23. 06 12月, 2019 3 次提交
    • H
      Add Much Complex Test and Fix Bugs for Control Flow cond API (#21532) · 1dcf6a72
      Huihuang Zheng 提交于
      Add tests to use dy/dx to make sure the gradient values calculated by the control flow backward is correct. Also fixed bugs detected by those tests.
      
      Fix bugs:
      
      1. Unlike sum_op, optimizer ops don't allow uninitialized input tensor. But in conditional_block_grad_op, since the conditional_block may not run, the output gradient tensor may be uninitialized, which will cause the optimizer op error. To fix it, we should let optimizer ops support uninitialized input like sum_op or assign the uninitialized gradient to 0 when the conditional_block_grad_op doesn't run. I found there are about 10+ optimizer ops. **To be simpler, I just assign output gradient of the conditional_block_grad_op to 0 in this PR**. But it can be further explored whether we can make optimizer ops like sum_op to support uninitialized input tensor because theoretically we can speed up without the assigning in conditional_block_grad_op.
      
      2. Infer parameter shapes during append_backward. I didn't know that all our parameters are in global block. When op_desc is inferring shapes at the sub-block, it may not know the shape of gradients of parameters whose shape information is at global block. I fixed it by inferring shapes of gradients from forward var.
      
      This PR also did some code clean up:
      1. Print the var name when sgd_op catches shape error so that it is easier to debug
      2. Fix a typo: dicta -> dict
      1dcf6a72
    • F
      add fluid.layers.gelu & doc (#21515) · 2057df7a
      Feiyu Chan 提交于
          Add a python interface for Gelu.
          Add documentation for fluid.layers.gelu.
      2057df7a
    • W
      fix doc typo test=develop (#21566) · 29c38445
      wangchaochaohu 提交于
      29c38445
  24. 05 12月, 2019 1 次提交