1. 04 4月, 2022 1 次提交
    • H
      Add expand as sigmoid api (#41311) · fa250aa1
      hong 提交于
      * update epxand and sigmoid with cross entropy
      
      * skip expand as infrt check
      
      * fix sigmoid cross entropy bug
      
      * remove no grad set white list
      
      * remove no grad set
      
      * fix bug
      
      * fix sigmoid error
      
      * fix bug
      fa250aa1
  2. 03 4月, 2022 2 次提交
  3. 02 4月, 2022 1 次提交
  4. 01 4月, 2022 2 次提交
  5. 30 3月, 2022 2 次提交
  6. 27 3月, 2022 1 次提交
  7. 25 3月, 2022 2 次提交
    • J
      Refactor Dygraph Flags (#40786) · 3085d5e4
      Jiabin Yang 提交于
      * refactor eager flags
      
      * fix flags error when we switch from eager to dygraph
      
      * fix ci problem
      
      * fix ci
      
      * fix ci
      
      * merge develop and fix code style
      
      * merge develop and fix code style
      
      * fix op test error
      
      * fix op test error
      
      * fix op test error
      
      * fix op test error
      
      * fix op test error
      
      * merge develop
      3085d5e4
    • X
      [OpTest] Polish optest (#40879) · d43e8433
      xiongkun 提交于
      * 1. add the python api grad 2. add final and intermediate state vlog 3. change the python_api error logic
      
      * add python api or close the check_eager=True
      
      * fix the compatibility
      
      * matmul
      
      * disable unittests: test_elementwise_add_op test_scatter_nd_op test_gather_nd_op test_scatter_op test_index_sample_op test_elementwise_add_mkldnn_op
      
      * refine the logic of prepara_parameter logic
      
      * fix Tensor(gpu) 2 Scalar segment fault.
      
      * add multi-attribute. (test_unsqueeze_op); add python_sig_out for customizing op sig out
      
      * fix some bugs, support python_out_sig
      d43e8433
  8. 24 3月, 2022 1 次提交
    • X
      Polish optest: refine the optest parameter logic. support name, dtype, out,... · a8df3901
      xiongkun 提交于
      Polish optest: refine the optest parameter logic. support name, dtype, out, output in arbitrary position (#40824)
      
      * 1. add the python api grad 2. add final and intermediate state vlog 3. change the python_api error logic
      
      * add python api or close the check_eager=True
      
      * fix the compatibility
      
      * matmul
      
      * disable unittests: test_elementwise_add_op test_scatter_nd_op test_gather_nd_op test_scatter_op test_index_sample_op test_elementwise_add_mkldnn_op
      
      * refine the logic of prepara_parameter logic
      
      * fix Tensor(gpu) 2 Scalar segment fault.
      a8df3901
  9. 22 3月, 2022 1 次提交
    • X
      polish python api logic and add backward python api check (#40666) · c29f85b6
      xiongkun 提交于
      * 1. add the python api grad 2. add final and intermediate state vlog 3. change the python_api error logic
      
      * add python api or close the check_eager=True
      
      * fix the compatibility
      
      * matmul
      
      * disable unittests: test_elementwise_add_op test_scatter_nd_op test_gather_nd_op test_scatter_op test_index_sample_op test_elementwise_add_mkldnn_op
      c29f85b6
  10. 15 3月, 2022 1 次提交
  11. 09 3月, 2022 1 次提交
  12. 08 3月, 2022 1 次提交
  13. 07 3月, 2022 1 次提交
  14. 01 3月, 2022 2 次提交
    • Z
      [bf16] add bf16 kernel: layer_norm p_norm reduce_sum (#39843) · ce8ed978
      zhangbo9674 提交于
      * add layer norm
      
      * add p norm
      
      * add reduce sum
      
      * refine layer norm register bf16 for cudnn811
      
      * add bf16 cast for hip
      
      * add unittest
      
      * refine rocm
      
      * refine layer_norm unittest
      
      * refine reduce op
      
      * refine unittest
      
      * enhance atol for reduce unittest
      ce8ed978
    • Z
      [bf16] add bf16 kernel: scale gather sum (#39683) · 6d26b332
      zhangbo9674 提交于
      * add scale gather sum
      
      * refine CUDA_ATOMIC_WRAPPER ADD for bf16
      
      * add gather unittest
      
      * solve conflict
      
      * add scale uinttest
      
      * add sum unittest
      
      * solve conflict
      
      * refine gather unittest
      
      * refine unittest
      6d26b332
  15. 24 2月, 2022 1 次提交
  16. 23 2月, 2022 1 次提交
  17. 21 2月, 2022 1 次提交
  18. 16 2月, 2022 1 次提交
    • L
      [bf16] pten matmul cuda kernel support bf16 (#39485) · d5a0d31a
      Leo Chen 提交于
      * pten matmul cuda kernel support bf16
      
      * fix pten kernel name
      
      * add matmul_grad bf16 kernel
      
      * add emptylike bf16 kernel
      
      * fix compile
      
      * suppport rocm
      
      * fix error
      
      * fix rocm
      
      * add bf16 header file
      
      * fix compile
      d5a0d31a
  19. 28 1月, 2022 1 次提交
  20. 24 1月, 2022 1 次提交
    • Z
      Refactored python-level trace_op to call through _C_ops instead of... · c3796061
      Zhanlue Yang 提交于
      Refactored python-level trace_op to call through _C_ops instead of Tracer::TraceOp, under eager_mode (#38338)
      
      * Replaced core.ops with _C_ops
      
      * Refactored python-level trace_op to call through _C_ops instead of Tracer::TraceOp, under eager_mode
      
      * Modified trace_op interface
      
      * Refactored trace_op logic for eager mode
      
      * Added Eager Dygraph support for OpTest
      
      * Fixed ci issues
      
      * Fixed CI failures
      
      * Fixed Coverage CI Issues
      
      * Fixed XPU CI Issues
      c3796061
  21. 14 1月, 2022 1 次提交
    • Q
      [MLU]Add mean and reduce_mean op (#38872) · 7f8d5bc8
      qipengh 提交于
      * [MLU]: add mean and reduce mean op
      
      * [MLU]add mlu pytest dir in CMakeLists.txt
      
      * [MLU]fix tensor data
      
      * [MLU]fix TensorToPyArray and license
      7f8d5bc8
  22. 31 12月, 2021 1 次提交
  23. 10 11月, 2021 1 次提交
  24. 09 10月, 2021 1 次提交
  25. 28 9月, 2021 1 次提交
    • L
      add API paddle.linalg.eig (#35674) · bc7e2b92
      Lijunhui 提交于
      * Add paddle.linalg.eig op
      
      * remove comments
      
      * remove comments
      
      * extend batch_size to the origin
      
      * add real times complex functor & destroy the backward complex output bug
      
      * terminate output diff when input real tensors
      
      * correct tiny doc errors
      
      * move functions from eig_helper to svd_helper and remove eig_helper
      
      * remove tensor.Resize
      
      * remove no longer used code
      
      * use existing lapack functions
      
      * reply review comments 21/27
      
      * remove .cu as this op is only executed on CPU
      
      * remove const_cast & add const in argument list for read-only references
      
      * fix sample code error in CI
      
      * remove template typename Tbase and more
      
      * remove eig exposure in paddle.*
      
      * add 'name=None' in eig python implementation
      
      * handle the unittest
      
      * try to solve the unittest
      
      * solve CI coverage
      
      * remove no longer used code
      
      * polish API doc and more
      
      * reply review comments
      
      * polish unittest, commit plan B
      
      * polish unittest
      bc7e2b92
  26. 18 9月, 2021 1 次提交
    • F
      Add new API "eigvals" in linalg (#35720) · d411a038
      From00 提交于
      * Add linalg.eigvals API
      
      * pre-commit check
      
      * Adjust code style
      
      * Fix conflict
      
      * Improve code style
      
      * Modify the test code to ignore testing CUDA kernel
      
      * Sort ouput data before checking in test code
      
      * Set timeout value for UT
      
      * Improve API example code to pass CI
      
      * Fix bug for None fetch_list in Windows
      
      * Delete grad Op
      d411a038
  27. 14 9月, 2021 1 次提交
  28. 08 9月, 2021 2 次提交
    • F
      merge CMakeList.txt manual (#35378) · c4a3e8b4
      feng_shuai 提交于
      * merge CMakeList.txt manual
      
      * add platform for changethreadnum
      
      * repair some bugs according to make error
      
      * do nothing just flush CI
      
      * forget change thread num
      
      * add inplace_atol param for check_output_with_place
      
      * Windows
      
      * std:min and std::max should be change because of windows
      c4a3e8b4
    • L
      [NPU] release gil before op run (#35370) · db6242e9
      Leo Chen 提交于
      * release gil before op run
      
      * support npu grad test
      
      * fix op_test
      db6242e9
  29. 14 7月, 2021 1 次提交
    • A
      Added OpTestTool for BF16 (#33977) · 171ed2cf
      arlesniak 提交于
      * Added OpTestTool for BF16 convenience
      
      * fixes after review, names changed to snake case.
      
      * fixes after review, naming reflects cpu.
      171ed2cf
  30. 07 7月, 2021 2 次提交
  31. 24 6月, 2021 1 次提交
  32. 02 6月, 2021 1 次提交
  33. 26 5月, 2021 1 次提交