1. 31 3月, 2021 4 次提交
    • W
      Update eigen version to f612df27 (#31832) · 495e7f9c
      wuhuanzhou 提交于
      * update eigen version to f612df27, test=develop
      
      * fix compilation error, test=develop
      
      * remove patch command in eigen, test=develop
      
      * fix compilation error caused by call Eigen function with float16 and bfloat16, test=develop
      
      * fix unittest error, test=develop
      
      * fix unittest error caused by precision, test=develop
      
      * remove patch files used by old version eigen, test=develop
      495e7f9c
    • T
      fix some bug in transformer training in xpu (#31918) · 52b05bac
      taixiurong 提交于
      52b05bac
    • W
      support minus-int idx to LayerList (#31750) · 5394194e
      Wenyu 提交于
      * support minus-int idx to LayerList
      * update layerlist test
      5394194e
    • F
      [ROCM] Add ROCm support for warpctc op (#31817) · ef8323d4
      furnace 提交于
      * bugfix for warpctc
      
      * fix warpctc commit id
      
      * fix warpctc commit id
      
      * fix warpctc commit id
      
      * fix warpctc commit id
      
      * fix warpctc commit id
      
      * fix WARPCTC_WITH_HIP invalid
      
      * Add logs to find out why can not dlopen libwarpctc.so
      
      * fix warpctc commit id
      
      * fix unit test test_warpctc_op
      
      * Optime failed log for dlopen
      
      * Optime failed log for dlopen
      
      * Delete extra changes
      
      * fix warpctc commit id
      
      * fix warpctc commit id
      
      * Add is_compiled_with_rocm for test_warpctc_op
      
      * fix warpctc commit id
      
      * Cancel optimize dlopen failed reason, move to next pr, due to it makes windows ci failed
      
      * Cancel optimize dlopen failed reason, move to next pr, due to it makes windows ci failed
      
      * Cancel optimize dlopen failed reason, move to next pr, due to it makes windows ci failed
      
      * fix code style problems
      ef8323d4
  2. 30 3月, 2021 8 次提交
    • L
    • J
      Added int8 kernel for oneDNN LSTM op (#31894) · 6dca7a1d
      jakpiase 提交于
      6dca7a1d
    • Z
      fix bug when dtype of to_tensor is core.VarType (#31931) · 245252b8
      Zhou Wei 提交于
      245252b8
    • W
      fe284868
    • C
      add deprecated for softmax_with_cross_entropy (#31722) · 73a6fa3e
      chajchaj 提交于
      * add deprecated for softmax_with_cross_entropy, test=develop
      
      * test for deprecated in english doc, test=develop
      
      * test deprecated for softmax_with_cross_entropy in english doc, test=develop
      
      * fix readme and English doc for cross_entropy, test=develop
      
      * rm test for softmax_with_cross_entropy deprecated, test=develop
      
      * update readme for CrossEntropyLoss, test=develop
      
      * fix readme format, test=develop
      
      * fix readme format, test=develop
      
      * fix readme format for cross_entropy, test=develop
      
      * add softmax_switch and fix softlabel for cross_entropy, test=develop
      
      * 1)recovery softmax_with_cross_entropy in fluid 2) change softmax_switch to use_softmax 3) add example for softlabel for cross_entropy, test=develop
      
      * fix Example number for cross_entropy, test=develop
      
      * fix code format, test=develop
      
      * fix for CI-Coverage, test=develop
      
      * fix for CI-Coverage, test=develop
      
      * fix ci-coverage for Non-ASCII character '\xe2' in file, test=develop
      
      * fix ci-coverage for Non-ASCII character '\xe2' in nn.layer.loss.py, test=develop
      
      * update description for doc when use_softmax=Fasle, test=develop
      
      * fix some docs and code example for cross_entropy, test=develop
      
      * delete redundant description for soft_label parameter of cross_entropy, test=develop
      
      * fix some comment for test_cross_entropy_loss.py, test=develop
      73a6fa3e
    • S
      fix batchnorm when inpu dims < 3 (#31933) · 8084b759
      Shang Zhizhou 提交于
      * fix batchnorm when inpu dims < 3
      
      * add unittest for batchnorm dims = 2
      8084b759
    • Z
      [Paddle-TRT] yolobox (#31755) · 64ee255f
      zlsh80826 提交于
      * yolobox converter and plugin
      
      * yolobox unittest
      
      * add dynamic shape restriction
      
      * fix git merge log
      64ee255f
    • A
      Fix segment Fault from set_value (#31891) · c4b60efa
      Aurelius84 提交于
      * Avoid raising warning while import paddle
      
      * fix segment fault of set_value
      
      * fix code style
      c4b60efa
  3. 29 3月, 2021 4 次提交
    • L
      525c32e3
    • R
      123949eb
    • Z
      [Paddle-TRT] roi_align_plugin (#31732) · e3a38d79
      zlsh80826 提交于
      * add roi_align_plugin
      
      * add roi align unit_test
      
      * add roi align serialization
      
      * remove roi align static plugin because of batch dim issue
      
      * refine roi align unittest and add fp16/serialization
      
      * add trt roi align condition to op_teller
      
      * refine error message
      
      * remove unnecessary reshape layer
      e3a38d79
    • Z
      [Paddle-TRT] trt affine channel converter (#31628) · bfb5cf55
      zlsh80826 提交于
      * trt affine channel converter
      
      * add trt affine channel base test
      
      * add trt affine channel NHWC
      
      * remove asterisk for python2 compatibility
      
      * trt affine channel converter
      
      * add trt affine channel base test
      
      * add trt affine channel NHWC
      
      * remove asterisk for python2 compatibility
      
      * fix rebase
      
      * move LodTensor to Tensor
      
      * add dbg info
      
      * affine channel converter only support NCHW
      
      * scale,bias are parameters, use create_parameters api
      
      * reduce test input size to not exceed the timelimit of ci
      
      * refine affine channel unittest and add serialization/dynamic test
      
      * change super to InferencePassTest for python2 compatibility
      
      * change super to InferencePassTest for python2 compatibility
      
      * fix affine channel fp16 serialize setting
      bfb5cf55
  4. 26 3月, 2021 3 次提交
  5. 25 3月, 2021 2 次提交
  6. 24 3月, 2021 2 次提交
  7. 23 3月, 2021 2 次提交
  8. 22 3月, 2021 3 次提交
    • Z
      [Paddle-TRT] nearest_interp op (#31626) · bfced39e
      zlsh80826 提交于
      * nearest_interp op converter w/ dynamic/static
      
      * fix data_layout include
      
      * add trt nearest unit_test
      
      * add nearest_interp NHWC test
      
      * update trt nearest interp nhwc testcase
      
      * remove asterisk for python2 compatibility
      
      * add empty line to prevent conflict
      
      * nearest_interp op converter w/ dynamic/static
      
      * fix data_layout include
      
      * add trt nearest unit_test
      
      * add nearest_interp NHWC test
      
      * update trt nearest interp nhwc testcase
      
      * remove asterisk for python2 compatibility
      
      * add empty line to prevent conflict
      
      * change the priority of out_h, out_w
      bfced39e
    • A
      [oneDNN] Initial bf16 amp integration (#31093) · 7ccf6b60
      arlesniak 提交于
      7ccf6b60
    • L
      [3D-parallel] add 1f1b scheduler for pipeline (#31566) · a501a7b0
      lilong12 提交于
      * add 1f1b scheduler for pp, test=develop
      a501a7b0
  9. 21 3月, 2021 1 次提交
  10. 19 3月, 2021 4 次提交
  11. 18 3月, 2021 2 次提交
  12. 17 3月, 2021 1 次提交
  13. 16 3月, 2021 2 次提交
  14. 15 3月, 2021 1 次提交
  15. 12 3月, 2021 1 次提交