1. 01 4月, 2021 7 次提交
    • C
      add custom init grad for backward function (#31540) · 83b953f5
      chentianyu03 提交于
      * add custom init grad for backward function
      
      * add custom init grad for backward function
      
      * handle when the grad_tensor is none
      
      * handle when the grad_tensor is none
      
      * fix the args type error on windows platform
      
      * modify the args order and doc
      
      * format code
      
      * add grad_tensor to xpu
      
      * modify the grad_tensor type check
      
      * add paddle.backward api to support multi tensors gradient compute
      
      * add paddle.backward api to support multi tensors gradient compute
      
      * add paddle.atuograd module and backward api
      
      * change tensor.backward func args
      
      * modify tensor backward api
      
      * remove create_graph intputs args
      
      * add doc and examplex code for backward api
      
      * when have the same tensor, throw error
      
      * modify test Init func args
      
      * modify the execute.Init func args in test files
      
      * add paddle.autograd package in setup.py.in
      
      * modify error msg, remove _run_backward method in class Tensor
      
      * add test cases for backward api
      83b953f5
    • T
      LOG CLEAN (#31819) · 0589ed21
      tangwei12 提交于
      * upgrade vlog
      
      * train from dataset fetch optimize
      0589ed21
    • Z
      [Paddle-TRT] add anchor generator op plugin (#31730) · b807e408
      zlsh80826 提交于
      * add anchor generator op plugin
      
      * add anchor generator unit_test
      
      * remove dbg info
      
      * remove redundant line
      
      * replace assertion with paddle enforce
      
      * dynamic plugin replaces assertion with paddle enforce
      
      * anchor generator support dynamic shape on spatial axis
      
      * anchor generator test with fp16, dynamic shape
      
      * add anchor generator test all
      
      * add back main
      
      * reduce test input size to not exceed the timelimit of ci
      
      * change super to InferencePassTest for python2 compatibility
      
      * reuse paddle operator anchor generator
      
      * move creator construct to header with default
      
      * add cuda ifdef
      
      * reduce line
      
      * change super to InferencePassTest for python2 compatibility
      
      * fix anchor generator fp16 serialize setting
      
      * split unittest from test_all
      
      * restrict anchor generator input format before version 7234
      
      * anchor generator only support greater than trt7.1
      
      * change min_graph_size to 2
      
      * min_graph size to 3 if dynamic shape
      
      * reduce dynamic shape size to avoid trt search tactic too long to exceed time limit
      
      * remove anchor from fetch list
      
      * anchor generator support all trt version
      
      * fix memory not allocated but if serialized
      b807e408
    • Z
      Support uint8_t for fill_constant_op (#31911) · 980227f9
      Zhang Zheng 提交于
      980227f9
    • K
      new group (#31682) · 07741593
      kuizhiqing 提交于
      * new group
      
      * ci compatible fix
      
      * assert nccl
      07741593
    • C
      Refactor and simplify hook design & add Tensor.register_hook API (#31775) · dbeb3ea4
      Chen Weihang 提交于
      * refactor and simplify hook design
      
      * fix reducer add hook error
      
      * add Tensor.register_hook basic impl
      
      * refine prepare data impl
      
      * revert prepare data change
      
      * support register_hook for Tensor
      
      * add hook test in model
      
      * polish tests and doc example
      
      * fix double grad test failed
      
      * remove reduce hook func
      
      * fix set empty error
      
      * polish code by comments
      
      * change reduce_hook to mutable_hook
      
      * remove useless tmp_ins
      
      * fix shape code format error
      
      * fix shape code format error
      dbeb3ea4
    • T
      fix en doc for emb (#31980) · 6b744866
      tangwei12 提交于
      * fix en doc for emb, test=document_fix;
      Change-Id: I4757e67caacd7189f068493ed45a7445f87ffb40
      6b744866
  2. 31 3月, 2021 7 次提交
    • L
      Adjust pipeline optimizer for 3d parallelism (#31939) · 695dd371
      lilong12 提交于
      * update, test=develop
      695dd371
    • K
      Polish tensor pipeline (#31701) · e973bd73
      Kaipeng Deng 提交于
      * polish tensor pipeline. test=develop
      e973bd73
    • W
      Update eigen version to f612df27 (#31832) · 495e7f9c
      wuhuanzhou 提交于
      * update eigen version to f612df27, test=develop
      
      * fix compilation error, test=develop
      
      * remove patch command in eigen, test=develop
      
      * fix compilation error caused by call Eigen function with float16 and bfloat16, test=develop
      
      * fix unittest error, test=develop
      
      * fix unittest error caused by precision, test=develop
      
      * remove patch files used by old version eigen, test=develop
      495e7f9c
    • T
      fix whl package push pypi (#31585) · b09c1ce0
      tianshuo78520a 提交于
      * fix whl package push pypi
      
      * add rst
      b09c1ce0
    • T
      fix some bug in transformer training in xpu (#31918) · 52b05bac
      taixiurong 提交于
      52b05bac
    • W
      support minus-int idx to LayerList (#31750) · 5394194e
      Wenyu 提交于
      * support minus-int idx to LayerList
      * update layerlist test
      5394194e
    • F
      [ROCM] Add ROCm support for warpctc op (#31817) · ef8323d4
      furnace 提交于
      * bugfix for warpctc
      
      * fix warpctc commit id
      
      * fix warpctc commit id
      
      * fix warpctc commit id
      
      * fix warpctc commit id
      
      * fix warpctc commit id
      
      * fix WARPCTC_WITH_HIP invalid
      
      * Add logs to find out why can not dlopen libwarpctc.so
      
      * fix warpctc commit id
      
      * fix unit test test_warpctc_op
      
      * Optime failed log for dlopen
      
      * Optime failed log for dlopen
      
      * Delete extra changes
      
      * fix warpctc commit id
      
      * fix warpctc commit id
      
      * Add is_compiled_with_rocm for test_warpctc_op
      
      * fix warpctc commit id
      
      * Cancel optimize dlopen failed reason, move to next pr, due to it makes windows ci failed
      
      * Cancel optimize dlopen failed reason, move to next pr, due to it makes windows ci failed
      
      * Cancel optimize dlopen failed reason, move to next pr, due to it makes windows ci failed
      
      * fix code style problems
      ef8323d4
  3. 30 3月, 2021 10 次提交
  4. 29 3月, 2021 6 次提交
    • W
      modify API nn.Bilinear's doc (#31889) · b48841ba
      wanghuancoder 提交于
      * modify API nn.Bilinear's doc, test=develop
      
      * modify API nn.Bilinear's doc, test=develop
      b48841ba
    • L
      525c32e3
    • R
      123949eb
    • J
      [CustomOP] Add shape related constructor for Tensor (#31681) · 51eb29de
      Jiabin Yang 提交于
      * give shape related contructor and reshape warning
      
      * change line num to fit ut
      
      * change ut to fit
      
      * remove useless code
      
      * call resize directly in constructor
      51eb29de
    • Z
      [Paddle-TRT] roi_align_plugin (#31732) · e3a38d79
      zlsh80826 提交于
      * add roi_align_plugin
      
      * add roi align unit_test
      
      * add roi align serialization
      
      * remove roi align static plugin because of batch dim issue
      
      * refine roi align unittest and add fp16/serialization
      
      * add trt roi align condition to op_teller
      
      * refine error message
      
      * remove unnecessary reshape layer
      e3a38d79
    • Z
      [Paddle-TRT] trt affine channel converter (#31628) · bfb5cf55
      zlsh80826 提交于
      * trt affine channel converter
      
      * add trt affine channel base test
      
      * add trt affine channel NHWC
      
      * remove asterisk for python2 compatibility
      
      * trt affine channel converter
      
      * add trt affine channel base test
      
      * add trt affine channel NHWC
      
      * remove asterisk for python2 compatibility
      
      * fix rebase
      
      * move LodTensor to Tensor
      
      * add dbg info
      
      * affine channel converter only support NCHW
      
      * scale,bias are parameters, use create_parameters api
      
      * reduce test input size to not exceed the timelimit of ci
      
      * refine affine channel unittest and add serialization/dynamic test
      
      * change super to InferencePassTest for python2 compatibility
      
      * change super to InferencePassTest for python2 compatibility
      
      * fix affine channel fp16 serialize setting
      bfb5cf55
  5. 26 3月, 2021 3 次提交
  6. 25 3月, 2021 3 次提交
  7. 24 3月, 2021 4 次提交