1. 06 9月, 2021 1 次提交
  2. 30 7月, 2021 1 次提交
    • J
      Added reshape, reshape2, squeeze and squeeze2 BF16/FP32 FWD/BWD kernels (#34219) · 22c4c189
      jakpiase 提交于
      * test version of matmul_v2
      
      * added matmul_v2 grad kernel
      
      * minor changes
      
      * minor changes
      
      * minor change for CI approval
      
      * CI fix
      
      * CI fix
      
      * added squeeze and squeeze2 kernels
      
      * CI fix
      
      * CI fix
      
      * CI fix
      
      * disabled tests when compiled with cuda
      
      * added setting format_tag by strides
      
      * added sigmoid BF16 FWD/BWD and gelu BF16 BWD
      
      * changes after review
      
      * Revert "added sigmoid BF16 FWD/BWD and gelu BF16 BWD"
      
      This reverts commit 6e3f76720b545abfcff9f6052b46b73a1e745cae.
      
      * Revert "Merge branch 'matmul_v2_grad' into squeeze2_op"
      
      This reverts commit 06fcf67843a4a7884eccdf67a02a03575e1d4cb8, reversing
      changes made to 6e3f76720b545abfcff9f6052b46b73a1e745cae.
      
      * minor change
      
      * added reshape1/2 kernels
      
      * moved some functions into private block
      
      * CI fix
      
      * CI fix
      
      * CI fix
      22c4c189
  3. 20 11月, 2020 1 次提交
  4. 12 10月, 2020 1 次提交
  5. 08 8月, 2020 1 次提交
    • L
      refine unsqueeze, test=develop (#25470) · 5258d53d
      Leo Chen 提交于
      * refine unsqueeze, test=develop
      
      * update unsqueeze, test=develop
      
      * refine unsqueeze, test=develop
      
      * refine unsqueeze, test=develop
      
      * update
      
      * remove None, test=develop
      
      * follow comments
      
      * support bool
      
      * update doc
      
      * follow comments
      
      * merge develop
      5258d53d
  6. 23 7月, 2020 1 次提交
    • L
      Refine squeeze, test=develop (#25281) · 4ec1251a
      Leo Chen 提交于
      * refine squeeze, test=develop
      
      * update squeeze, test=develop
      
      * refine compile-time infershape, test=develop
      
      * add more unittest, test=develop
      
      * follow comments, test=develop
      
      * add update_api, test=develop
      
      * follow comments, test=develop
      4ec1251a
  7. 27 5月, 2020 1 次提交
  8. 13 5月, 2020 1 次提交
  9. 25 3月, 2020 1 次提交
  10. 09 3月, 2020 1 次提交
  11. 29 11月, 2019 1 次提交
    • L
      add unused input vars check for OpWithKernel, test=develop (#21169) · e0c9d856
      Leo Chen 提交于
      * add unused input vars check for OpWithKernel, test=develop
      
      * remove unused vars in some ops, test=develop
      
      * fix batch_norm, test=develop
      
      * add white list, test=develop
      
      * add CI check for white list, test=develop
      
      * :ove white list to c++, test=develop
      
      * solve failure of CI, test=develop
      
      * add unittest for unused_var_check, test=develop
      
      * refine code, enable check in operator_test, test=develop
      
      * skip mkldnn, test=develop
      
      * extend white list, test=develop
      
      * refine condition of mkldnn, test=develop
      
      * fix paddle_build, test=develop
      
      * follow comments, test=develop
      
      * fix GetExpectedKernelType
      
      * add wiki ref to err_msg, test=develop
      
      * follow comment, test=develop
      e0c9d856
  12. 31 10月, 2019 1 次提交
    • H
      GradMaker for dygraph (#19706) · 8c4573a3
      hong 提交于
      * refactor dygraph,test=develop
      
      * fix failed unittest,test=develop
      
      * polish code,test=develop
      
      * check windows ci error,test=develop
      try to fix windows ci error by np.allclose,test=develop
      
      * polish vlog and profiler, test=develop
      
      * try to fix preceding ops order,test=develop
      
      * test transformer in windows ci, test=develop
      
      * use python c-api to speed up tracer.trace,test=develop
      
      * test=develop, fix docker with paddle nccl problem
      
      * test=develop, add ut for debug string and gradient_accumulator
      
      * test=develop, add tests for layer/gradient_accumulator/prepared_op
      
      * test=develop, fix complie error for test_prepared_op
      
      * test=develop, add more ut for dygraph
      
      * test=develop, create API.spec for dygraph api change
      
      * optimize grad maker; test=develop
      
      * optimize grad maker
      
      * test
      
      * grad make optim; test=develop
      
      * fix unittest bugs; test=develop
      
      * add dygraph grad op maker and split_op
      
      * grad op maker refactor; test=develop
      
      * add dygraph grad maker; test=develop
      
      * fix op deformable_conv_v1_op bug; test=develop
      
      * fix deformable_conv prroi pool bugs;
      
      * fix new op grad op maker bug; test=develop
      
      * fix split by ref bug; test=develop
      
      * fix dygraph auto prune bug; test=develop
      
      * fix test_trace bug; test=develop
      
      * fix fused emb seq pool bug; test=develop
      
      * remove useless code in op_desc file; test=develop
      
      * remove useless code, StrVarBaseNode; test=develop
      
      * fix review issues; test=develop
      
      * fix rank_loss grad maker; test=develop
      
      * remove flag in VarBase; test=develop
      
      * fix distributed_notify_op compile bug ; test=develop
      
      * fix reshape op double grad; test=develop
      
      * fix expand as op; test=develop
      
      * add impertive type_defs.h for demo_train; test=develop
      
      * fix inference lib cmake; test=develop
      
      * fix inference lib; test=develop
      
      * fix infernce_lib; test=develop
      
      * fix inference cmake; test=develop
      
      * fix inference lib; test=develop
      
      * fix inference lib; test=develop
      
      * remove condition dygraph grad maker, modify local name; test=develop
      
      * fix split grad maker bug; test=develop
      
      * fix pyramid_op bug; test=develop
      
      * change travis time out limit; test=develop
      
      * restore travis; test=develop
      
      * change timeout limit; test=develop
      8c4573a3
  13. 28 10月, 2019 1 次提交
  14. 09 10月, 2019 1 次提交
  15. 16 9月, 2019 1 次提交
  16. 05 9月, 2019 2 次提交
  17. 21 3月, 2019 2 次提交
  18. 19 3月, 2019 1 次提交
  19. 15 3月, 2019 1 次提交
  20. 06 9月, 2018 1 次提交
  21. 20 8月, 2018 1 次提交
  22. 11 7月, 2018 1 次提交
  23. 10 7月, 2018 2 次提交
  24. 09 7月, 2018 1 次提交
  25. 03 7月, 2018 2 次提交
  26. 02 7月, 2018 1 次提交
  27. 29 6月, 2018 1 次提交