1. 16 6月, 2022 1 次提交
    • Q
      [Cherry-pick] Fix numpy 1.20+ deprecation warnings (#43513) · 689e0999
      Qi Li 提交于
      * Fix numpy 1.20+ deprecation warnings (#42929)
      
      * Replace np.bool/np.bool8 with np.bool_
      
      * Replace np.object with np.object_
      
      * Replace np.complex with np.complex128
      
      * Replace np.float with np.float64
      
      * Replace np.int with np.int_
      
      * Rerun pre-commit for newer pre-commit configuration
      
      * Use builtin bool instead of np.bool_ based on the context
      
      * fix mode dtype
      Co-authored-by: Nzlsh80826 <rewang@nvidia.com>
      689e0999
  2. 04 4月, 2022 1 次提交
  3. 03 4月, 2022 1 次提交
  4. 02 4月, 2022 2 次提交
  5. 01 3月, 2022 1 次提交
    • Z
      [bf16] add bf16 kernel: layer_norm p_norm reduce_sum (#39843) · ce8ed978
      zhangbo9674 提交于
      * add layer norm
      
      * add p norm
      
      * add reduce sum
      
      * refine layer norm register bf16 for cudnn811
      
      * add bf16 cast for hip
      
      * add unittest
      
      * refine rocm
      
      * refine layer_norm unittest
      
      * refine reduce op
      
      * refine unittest
      
      * enhance atol for reduce unittest
      ce8ed978
  6. 08 2月, 2022 1 次提交
  7. 16 8月, 2021 1 次提交
  8. 05 8月, 2021 1 次提交
  9. 15 6月, 2021 1 次提交
    • J
      Support reduce_sum_op float16 (#32966) · 606939de
      jiangcheng 提交于
      * add reduce_sum_op by add self-kernel
      
      * set all ReduceKernel MPType for accuracy
      
      * add float16 test script which input is integer number
      
      * solve reduce sum float16 check_grad problem
      
      * solve conflict and change test script for CI
      
      * change kernel register for CI
      
      * remove all useless template
      606939de
  10. 09 3月, 2021 1 次提交
  11. 26 10月, 2020 1 次提交
    • C
      [cherry pick ] cherry pick 28108 28198 28199 from release2.0rc (#28215) · 99408718
      cnn 提交于
      * Release 2.0rc cherry pick api rename #28108 (#28184)
      
      * rename count_include_pad-->exclusive  return_indices-->return_mask
      
      * remove track_running_stats
      
      * fix typo.
      
      * rename xxxd-->xxxxD
      
      * solve conflicts
      
      * 2.0rc api add all any (#28199)
      
      * reduce trt warning message (#28011)
      
      add paddle.enable_static() on sample code
      
      alias recude_all-->all, reduce_any-->any
      
      add import reduce_all and reduce_any in python/paddle/tensor/math.py
      
      import all and any in python/paddle/tensor/__init__.py
      
      remove all and any OP in python/paddle/tensor/logic.py, add all and any OP in python/paddle/tensor/math.py
      
      fix import error
      
      remove TestAllAPI temporary
      
      * fix doc of recdue_all and reduce_any, test=document_fix
      
      * fix typo
      
      * fix unittest for all and any API
      Co-authored-by: NPei Yang <peiyang@baidu.com>
      
      * rename conv_transposeXd-->convXd_transpose (#28198)
      
      * fix sample code of reduce_all and reduce_any
      Co-authored-by: NPei Yang <peiyang@baidu.com>
      99408718
  12. 17 9月, 2020 1 次提交
  13. 04 9月, 2020 1 次提交
  14. 25 8月, 2020 1 次提交
  15. 19 8月, 2020 1 次提交
  16. 12 8月, 2020 2 次提交
  17. 09 6月, 2020 1 次提交
  18. 17 4月, 2020 1 次提交
  19. 14 4月, 2020 1 次提交
  20. 05 4月, 2020 1 次提交
  21. 04 1月, 2020 2 次提交
  22. 12 12月, 2019 1 次提交
    • Z
      fix input shape of op tests (#21682) · 4c987a60
      zhupengyang 提交于
      * fix input shape of op tests for elementwise_sub,
      gather, pad2d, transpose, softmax, scale,
      elementwise_max, hierarchical_sigmoid, reshape2,
      sign, squeeze, reduce_sum, sum, squeeze2,
      unsqueeze, unsqueeze2, cast, reverse
      
      test=develop
      
      * fix cast, elementwise_mul, gather, scale, sign,
      softmax, transpose
      
      test=develop
      4c987a60
  23. 04 12月, 2019 1 次提交
  24. 03 10月, 2019 1 次提交
  25. 26 9月, 2019 1 次提交
  26. 25 3月, 2019 1 次提交
    • Z
      test=develop · 664c342c
      zhoukunsheng 提交于
      split reduce_all_any_op.h into two files
      add unit test for reduce_all, reduce_any
      664c342c
  27. 09 10月, 2018 1 次提交
  28. 15 8月, 2018 1 次提交
  29. 26 7月, 2018 2 次提交
  30. 22 7月, 2018 2 次提交
  31. 23 5月, 2018 1 次提交
    • W
      Enhance reduce op (#10708) · 8655904b
      whs 提交于
      * Enhance reduce op for multi dims.
      
      * Uncomment some unitest.
      
      * Uncomment unitest.
      
      * Remove unused code.
      
      * Fix infershape and python wrapper.
      
      * Add more examples.
      
      * Fix l2_normalize.
      
      * Fix normalization_wrapper.
      
      * Polish code.
      1. Rename unitest function.
      2. Rename const variable.
      8655904b
  32. 07 3月, 2018 1 次提交
  33. 24 2月, 2018 1 次提交
  34. 13 2月, 2018 1 次提交
    • X
      Run Python OP tests in a single Python process to improve test time. (#8362) · cde6241a
      Xin Pan 提交于
      Currently, our tests run with 2 GPUs, the init time is absurdly long:
      about 4s for each process.  Currently, we run each OP test on
      different processes. This PR:
      
      1. create cmake function py_test_modules which will generate the
      Makefile that runs a list of Python unittest module in a single Python
      process.
      
      2. move all "python unittest compatible" (e.g., used the unittest
      package, not just a regular python file). from fluid/tests to
      fluid/tests/unittests.
      
      3. cmake now will run all OP tests in fluid/tests/unittests in a
      single process, except the time-consuming tests, they are separated
      into different processes to utilize parallelism. Please make sure to
      use the unittest package if you put the python test file in
      fluid/tests/unittests
      
      4. remove all exit(0) from fluid/tests/unittests/*.py, exit(0) is used
      to disable unittest, we can not do it when running all tests in a
      single process since it will terminate the process without running the
      other tests. Instead, the test is disabled in
      fluid/tests/unittests/CMakeLists.txt. FIXME is added for each disabled
      item. Please disable the unittest from
      fluid/tests/unittests/CMakeLists.txt, instead of adding exit(0) to the
      Python file, for all Python file in fluid/tests/unittests/.
      
      5. add an option WITH_FAST_BUNDLE_TEST. When OFF, will run the unit
      tests in separate process so that they can be tested individually.
      cde6241a
  35. 12 2月, 2018 1 次提交