1. 06 1月, 2020 1 次提交
  2. 04 1月, 2020 2 次提交
    • L
      control flow: support optimizer called (#21851) · 7d8d4599
      liym27 提交于
      * append optimize op in the grad block of current block if current block is in control flow. test=develop
      
      * add conditional grad op when optimizer used in control flow. test=develop
      
      * add comment and modify typo. test=develop
      
      * fix append_backward to support control flow. test=develop
      
      * add test. test=develop
      
      * fix copy_var_to_parent_block and conditional_block_grad. test=develop
      
      * fix bug: revert to append conditional_block_grad vars to sub grad block. test=develop
      
      * fix bug: revert to assign var to parent block even if var already is in parent block
      
      * fix bug: consider outputs is empty. test=develop
      
      * move _rename_grad_ out. test=develop
      
      * modify code according to reviews from Huihuang. test=develop
      
      * modify code according to reviews from Jinle. test=develop
      7d8d4599
    • S
      add package for centerloss (#21806) · 3fab86ff
      ShenLiang 提交于
      3fab86ff
  3. 03 1月, 2020 3 次提交
  4. 02 1月, 2020 1 次提交
  5. 31 12月, 2019 2 次提交
  6. 26 12月, 2019 1 次提交
  7. 24 12月, 2019 3 次提交
  8. 23 12月, 2019 1 次提交
  9. 20 12月, 2019 1 次提交
  10. 18 12月, 2019 1 次提交
  11. 17 12月, 2019 1 次提交
  12. 16 12月, 2019 3 次提交
  13. 11 12月, 2019 1 次提交
  14. 09 12月, 2019 2 次提交
  15. 06 12月, 2019 3 次提交
    • H
      Add Much Complex Test and Fix Bugs for Control Flow cond API (#21532) · 1dcf6a72
      Huihuang Zheng 提交于
      Add tests to use dy/dx to make sure the gradient values calculated by the control flow backward is correct. Also fixed bugs detected by those tests.
      
      Fix bugs:
      
      1. Unlike sum_op, optimizer ops don't allow uninitialized input tensor. But in conditional_block_grad_op, since the conditional_block may not run, the output gradient tensor may be uninitialized, which will cause the optimizer op error. To fix it, we should let optimizer ops support uninitialized input like sum_op or assign the uninitialized gradient to 0 when the conditional_block_grad_op doesn't run. I found there are about 10+ optimizer ops. **To be simpler, I just assign output gradient of the conditional_block_grad_op to 0 in this PR**. But it can be further explored whether we can make optimizer ops like sum_op to support uninitialized input tensor because theoretically we can speed up without the assigning in conditional_block_grad_op.
      
      2. Infer parameter shapes during append_backward. I didn't know that all our parameters are in global block. When op_desc is inferring shapes at the sub-block, it may not know the shape of gradients of parameters whose shape information is at global block. I fixed it by inferring shapes of gradients from forward var.
      
      This PR also did some code clean up:
      1. Print the var name when sgd_op catches shape error so that it is easier to debug
      2. Fix a typo: dicta -> dict
      1dcf6a72
    • F
      add fluid.layers.gelu & doc (#21515) · 2057df7a
      Feiyu Chan 提交于
          Add a python interface for Gelu.
          Add documentation for fluid.layers.gelu.
      2057df7a
    • W
      fix doc typo test=develop (#21566) · 29c38445
      wangchaochaohu 提交于
      29c38445
  16. 05 12月, 2019 4 次提交
    • D
    • H
      Add dygraph linear warm up decay (#21186) · 08483a68
      hong 提交于
      * dygraph mode support linear lr warm up; test=develop
      
      * add unitest for linear warmup; test=develop
      
      * add input type check; test=develop
      
      * fix type check assert error; test=develop
      
      * change type error; test=develop
      08483a68
    • L
    • L
      Split VarBase from Python Variable for Dygraph (#21359) · cdd46d7e
      Leo Chen 提交于
      * test=develop, fix docker with paddle nccl problem
      
      * don't expose numerous Tensor.set(), test=develop
      
      * fix condition, test=develop
      
      * fix float16 bug, test=develop
      
      * feed should be Tensor or np.array, not Variable or number, test=develop
      
      * use forcecast to copy numpy slice to new array, test=develop
      
      * remove float16-uint16 hacking, test=develop
      
      * add variable method to varbase and refactor to_variable to support return varbase
      
      * support kwargs in varbase constructor
      
      * add VarBase constructor to support default python args
      
      * refine varbase initial method
      
      * reset branch
      
      * fix ut for change VarBase error info to PaddleEnforce
      
      * cherry is parameter change before
      
      * overload isinstance to replace too many change of is_variable
      
      * rm useless files
      
      * rm useless code merged by git
      
      * test=develop, fix some ut failed error
      
      * test=develop, fix test_graph_wrapper
      
      * add some tests, test=develop
      
      * refine __getitem__, test=develop
      
      * add tests, test=develop
      
      * fix err_msg, test=develop
      cdd46d7e
  17. 04 12月, 2019 1 次提交
  18. 03 12月, 2019 5 次提交
  19. 02 12月, 2019 4 次提交