1. 01 3月, 2022 1 次提交
    • Z
      [bf16] add bf16 kernel: scale gather sum (#39683) · 6d26b332
      zhangbo9674 提交于
      * add scale gather sum
      
      * refine CUDA_ATOMIC_WRAPPER ADD for bf16
      
      * add gather unittest
      
      * solve conflict
      
      * add scale uinttest
      
      * add sum unittest
      
      * solve conflict
      
      * refine gather unittest
      
      * refine unittest
      6d26b332
  2. 22 12月, 2021 1 次提交
  3. 16 8月, 2021 1 次提交
  4. 21 6月, 2021 1 次提交
    • L
      Add AXPY oneDNN handler (#33632) · 773aabc7
      lidanqing 提交于
      * Add oneDNN AXPY handler.
      
      * Add fallback for small tensors.
      
      * Fix ifdefs
      
      * Remove unnecessary namespace prefixes and add missing headers.
      
      * Guard handler_axpy with proper ifdefs.
      
      * Compilation of this function is possible only when Paddle is not build
      with CUDA nor HIP.
      
      * Move AXPY handler code to separate files.
      
      * Use oneDNN AXPY handler in SGD op.
      
      * Use axpy handler only when Paddle is built with oneDNN.
      
      * Add test for SUM BF16 with big rows.
      
      * Fix SFINAE rules for elementwise_add_to.
      
      * Add test case for SGD with big rows.
      
      * update
      
      * update
      Co-authored-by: NAdam Osewski <adam.osewski@intel.com>
      773aabc7
  5. 06 5月, 2021 1 次提交
  6. 13 10月, 2020 1 次提交
    • S
      Update api 2.0 for some ops · 92b3a717
      Steffy-zxf 提交于
      * 1. remove paddle.unique_with_counts api, which counts as unique api
      2. add paddle.math.increment(x, value=1.0, name=None) api
      3. replace paddle.sums with paddle.add_n api
      4. update paddle.metric.accuracy api (add name parameter)
      92b3a717
  7. 15 5月, 2020 1 次提交
  8. 12 4月, 2020 1 次提交
  9. 05 4月, 2020 1 次提交
  10. 20 12月, 2019 1 次提交
  11. 12 12月, 2019 2 次提交
    • Z
      fix input shape of op tests (#21682) · 4c987a60
      zhupengyang 提交于
      * fix input shape of op tests for elementwise_sub,
      gather, pad2d, transpose, softmax, scale,
      elementwise_max, hierarchical_sigmoid, reshape2,
      sign, squeeze, reduce_sum, sum, squeeze2,
      unsqueeze, unsqueeze2, cast, reverse
      
      test=develop
      
      * fix cast, elementwise_mul, gather, scale, sign,
      softmax, transpose
      
      test=develop
      4c987a60
    • J
      Change several tests to inherit the right parent class, test=develop (#21652) · f64d0066
      juncaipeng 提交于
      * change several tests to use the right parent class, test=develop
      
      * add dtype for TestLoDTensorAndSelectedRowsOp, test=develop
      f64d0066
  12. 08 11月, 2018 1 次提交
    • C
      Fix input<tensor> (#14208) · c5b6573a
      chengduo 提交于
      * fix input<tensor>
      test=develop
      
      * fix split_ids
      test=develop
      
      * ElementwiseMul should not support SelectedRows
      
      * fix scale op
      test=develop
      
      * change GetTensorFromVar() method to GetTensorOrSelectedRowsFromVar()
      
      * fix operator
      
      * refine MultiOutput
      
      * fix MultiOutput
      test=develop
      
      * disable test_dist_save_load
      test=develop
      
      * fix elementwise_op
      test=develop
      
      * add get_sparse_as_op
      test=develop
      
      * add info for check
      test=develop
      
      * rename get_sparse_as_op with extract_rows_as_op.
      test=develop
      
      * elementwise doesn't support selected_rows
      
      * fix regularizer
      
      * remove extract_rows_as
      test=develop
      
      * fix ci
      test=develop
      
      * add test for sum_op
      
      * fix regularizer
      test=develop
      
      *  test=develop
      
      * fix pserver weight decay multi inputs test=develop
      c5b6573a
  13. 07 11月, 2018 1 次提交
    • C
      Add fp16 backward support (#14202) · a9b5d42d
      chengduo 提交于
      * add fp16 backward support
      test=develop
      
      * add sum_op fp16 test
      
      * disable test_dist_save_load
      test=develop
      
      * add check_grad for sum
      
      * add unit test for softmax_grad fp16
      test=develop
      
      * add scale_op unit test
      
      * add mul_grad_op unit test for fp16
      
      * add cross_entropy_grad and eman_grad unit test for fp16
      test=develop
      
      * fix cross_entropy unit test
      
      * add pool2d fp16 unit test
      
      * refine conv2d fp16 unit test
      test=develop
      
      * refine activation unit test
      test=develop
      
      * fix ci
      test=develop
      
      * follow zhihong's comment, copy from https://github.com/PaddlePaddle/Paddle/pull/12796
      test=develop
      a9b5d42d
  14. 27 10月, 2018 1 次提交
  15. 25 10月, 2018 1 次提交
  16. 17 10月, 2018 1 次提交
  17. 08 10月, 2018 1 次提交
  18. 16 8月, 2018 1 次提交
  19. 15 8月, 2018 1 次提交
  20. 01 8月, 2018 1 次提交
  21. 26 7月, 2018 2 次提交
  22. 21 6月, 2018 2 次提交
  23. 19 6月, 2018 1 次提交
  24. 24 2月, 2018 1 次提交
  25. 13 2月, 2018 1 次提交
    • X
      Run Python OP tests in a single Python process to improve test time. (#8362) · cde6241a
      Xin Pan 提交于
      Currently, our tests run with 2 GPUs, the init time is absurdly long:
      about 4s for each process.  Currently, we run each OP test on
      different processes. This PR:
      
      1. create cmake function py_test_modules which will generate the
      Makefile that runs a list of Python unittest module in a single Python
      process.
      
      2. move all "python unittest compatible" (e.g., used the unittest
      package, not just a regular python file). from fluid/tests to
      fluid/tests/unittests.
      
      3. cmake now will run all OP tests in fluid/tests/unittests in a
      single process, except the time-consuming tests, they are separated
      into different processes to utilize parallelism. Please make sure to
      use the unittest package if you put the python test file in
      fluid/tests/unittests
      
      4. remove all exit(0) from fluid/tests/unittests/*.py, exit(0) is used
      to disable unittest, we can not do it when running all tests in a
      single process since it will terminate the process without running the
      other tests. Instead, the test is disabled in
      fluid/tests/unittests/CMakeLists.txt. FIXME is added for each disabled
      item. Please disable the unittest from
      fluid/tests/unittests/CMakeLists.txt, instead of adding exit(0) to the
      Python file, for all Python file in fluid/tests/unittests/.
      
      5. add an option WITH_FAST_BUNDLE_TEST. When OFF, will run the unit
      tests in separate process so that they can be tested individually.
      cde6241a
  26. 12 2月, 2018 1 次提交
  27. 21 1月, 2018 1 次提交
    • D
      "fix decode bug" (#7711) · e983cc90
      dzhwinter 提交于
      * "fix decode bug"
      
      * "follow commnet"
      
      * "fix error"
      
      * "fix hook bug"
      
      * fix based comment
      
      * fix copyright
      
      * fix based on comment
      e983cc90
  28. 15 1月, 2018 1 次提交
    • D
      Feature/hooks (#7513) · b9b75377
      dzhwinter 提交于
      * add copyright hook
      
      * add copyright hook
      
      * refine copyright hook
      
      * "test copyright hook"
      
      * fix check style
      
      * fix ci
      b9b75377
  29. 14 11月, 2017 1 次提交
  30. 11 9月, 2017 2 次提交
  31. 08 9月, 2017 1 次提交
  32. 05 9月, 2017 1 次提交