提交 · 0e52cdfc02a9d1666b5b1b05fba941455c5f7015 · Crayon鑫 / Paddle

01 4月, 2021 10 次提交

Q

[ROCM] fix depthwise conv failure on ROCM, test=develop (#31998) · a4b30a12
由 Qi Li 提交于 4月 01, 2021

a4b30a12
S
Support control flow in DataParallel (#31625) · 8460698b
由 ShenLiang 提交于 4月 01, 2021
```
* support control flow

* supoort sync_parameters_buffers

* fix the bug of sparse embedding
```
8460698b

add custom init grad for backward function (#31540) · 83b953f5

由 chentianyu03 提交于 4月 01, 2021

* add custom init grad for backward function

* add custom init grad for backward function

* handle when the grad_tensor is none

* handle when the grad_tensor is none

* fix the args type error on windows platform

* modify the args order and doc

* format code

* add grad_tensor to xpu

* modify the grad_tensor type check

* add paddle.backward api to support multi tensors gradient compute

* add paddle.backward api to support multi tensors gradient compute

* add paddle.atuograd module and backward api

* change tensor.backward func args

* modify tensor backward api

* remove create_graph intputs args

* add doc and examplex code for backward api

* when have the same tensor, throw error

* modify test Init func args

* modify the execute.Init func args in test files

* add paddle.autograd package in setup.py.in

* modify error msg, remove _run_backward method in class Tensor

* add test cases for backward api

83b953f5

H

remove useless code (#32001) · 9c5d0286
由 hutuxian 提交于 4月 01, 2021

9c5d0286
T
LOG CLEAN (#31819) · 0589ed21
由 tangwei12 提交于 4月 01, 2021
```
* upgrade vlog

* train from dataset fetch optimize
```
0589ed21

[Paddle-TRT] add anchor generator op plugin (#31730) · b807e408

由 zlsh80826 提交于 4月 01, 2021

* add anchor generator op plugin

* add anchor generator unit_test

* remove dbg info

* remove redundant line

* replace assertion with paddle enforce

* dynamic plugin replaces assertion with paddle enforce

* anchor generator support dynamic shape on spatial axis

* anchor generator test with fp16, dynamic shape

* add anchor generator test all

* add back main

* reduce test input size to not exceed the timelimit of ci

* change super to InferencePassTest for python2 compatibility

* reuse paddle operator anchor generator

* move creator construct to header with default

* add cuda ifdef

* reduce line

* change super to InferencePassTest for python2 compatibility

* fix anchor generator fp16 serialize setting

* split unittest from test_all

* restrict anchor generator input format before version 7234

* anchor generator only support greater than trt7.1

* change min_graph_size to 2

* min_graph size to 3 if dynamic shape

* reduce dynamic shape size to avoid trt search tactic too long to exceed time limit

* remove anchor from fetch list

* anchor generator support all trt version

* fix memory not allocated but if serialized

b807e408

Z

Optimize the perf of SameDimsAdd CUDA Kernel (#31872) · 4acc87be
由 Zhang Zheng 提交于 4月 01, 2021

4acc87be
Z

Support uint8_t for fill_constant_op (#31911) · 980227f9
由 Zhang Zheng 提交于 4月 01, 2021

980227f9
K
new group (#31682) · 07741593
由 kuizhiqing 提交于 4月 01, 2021
```
* new group

* ci compatible fix

* assert nccl
```
07741593

Refactor and simplify hook design & add Tensor.register_hook API (#31775) · dbeb3ea4

由 Chen Weihang 提交于 3月 31, 2021

* refactor and simplify hook design

* fix reducer add hook error

* add Tensor.register_hook basic impl

* refine prepare data impl

* revert prepare data change

* support register_hook for Tensor

* add hook test in model

* polish tests and doc example

* fix double grad test failed

* remove reduce hook func

* fix set empty error

* polish code by comments

* change reduce_hook to mutable_hook

* remove useless tmp_ins

* fix shape code format error

* fix shape code format error

dbeb3ea4

31 3月, 2021 11 次提交

T
Delete legacy C++ training user-interface (#31949) · d5b5004b
由 tianshuo78520a 提交于 3月 31, 2021
```
* delete include framework.pb.h

* fix error

* delete fluid_train
```
d5b5004b

[Parallel UT]Improve Parallel UT level on Windows/Linux (#31377) · b05f6142

由 Zhou Wei 提交于 3月 31, 2021

* [Parallel UT]improve Parallel UT level on Windows/Linux

* [Parallel UT]improve Parallel UT level on Windows/Linux

* [Parallel UT]Improve Parallel UT level on Windows/Linux

* [Parallel UT]Improve Parallel UT level on Windows/Linux

* fix CI

b05f6142

L
Adjust pipeline optimizer for 3d parallelism (#31939) · 695dd371
由 lilong12 提交于 3月 31, 2021
```
* update, test=develop
```
695dd371

fix one error massage (#31904) · 6f85e241

由 Kqnonrime 提交于 3月 31, 2021

* fix one error massage

* fix a error message

* new fix three error messages

* new fix three error messages

* new fix some error

* new fix one error message

6f85e241

T

delete cuda9 code (#31883) · ea738dda
由 tianshuo78520a 提交于 3月 31, 2021

ea738dda
K
Polish tensor pipeline (#31701) · e973bd73
由 Kaipeng Deng 提交于 3月 31, 2021
```
* polish tensor pipeline. test=develop
```
e973bd73

Update eigen version to f612df27 (#31832) · 495e7f9c

由 wuhuanzhou 提交于 3月 31, 2021

* update eigen version to f612df27, test=develop

* fix compilation error, test=develop

* remove patch command in eigen, test=develop

* fix compilation error caused by call Eigen function with float16 and bfloat16, test=develop

* fix unittest error, test=develop

* fix unittest error caused by precision, test=develop

* remove patch files used by old version eigen, test=develop

495e7f9c

update compilation with C++14 (#31815) · 587d99ae

由 wuhuanzhou 提交于 3月 31, 2021

* update compilation with C++14, test=develop

* fix compilation error in eigen, test=develop

587d99ae

T
fix split core (#31892) · 393b3bd6
由 Thunderbrook 提交于 3月 31, 2021
```
* fix split core

* format
```
393b3bd6
T

fix some bug in transformer training in xpu (#31918) · 52b05bac
由 taixiurong 提交于 3月 31, 2021

52b05bac

[ROCM] Add ROCm support for warpctc op (#31817) · ef8323d4

由 furnace 提交于 3月 31, 2021

* bugfix for warpctc

* fix warpctc commit id

* fix warpctc commit id

* fix warpctc commit id

* fix warpctc commit id

* fix warpctc commit id

* fix WARPCTC_WITH_HIP invalid

* Add logs to find out why can not dlopen libwarpctc.so

* fix warpctc commit id

* fix unit test test_warpctc_op

* Optime failed log for dlopen

* Optime failed log for dlopen

* Delete extra changes

* fix warpctc commit id

* fix warpctc commit id

* Add is_compiled_with_rocm for test_warpctc_op

* fix warpctc commit id

* Cancel optimize dlopen failed reason, move to next pr, due to it makes windows ci failed

* Cancel optimize dlopen failed reason, move to next pr, due to it makes windows ci failed

* Cancel optimize dlopen failed reason, move to next pr, due to it makes windows ci failed

* fix code style problems

ef8323d4

30 3月, 2021 10 次提交
- J
  
  fix stack op grad nullptr (#31962) · 95f808c8
  由 Jiawei Wang 提交于 3月 30, 2021
  
  95f808c8
- L
  
  [dynamic setitem] Fix bug of dynamic setitem: Decerease axes to do right broadcast (#31960) · 57d4288a
  由 liym27 提交于 3月 30, 2021
  
  57d4288a
- 石
  
  fix a syntax error, test=develop (#31930) · 0fa6c8a3
  由石晓伟提交于 3月 30, 2021
  
  0fa6c8a3
- P
  
  map_matmul_to_mul_pass support 3dim (#31958) · 98e803e0
  由 Pei Yang 提交于 3月 30, 2021
  
  98e803e0
- W
  
  modify CI recommend information (#31395) · a37a7f67
  由 wuhuanzhou 提交于 3月 30, 2021
  
  a37a7f67
- J
  
  Added int8 kernel for oneDNN LSTM op (#31894) · 6dca7a1d
  由 jakpiase 提交于 3月 30, 2021
  
  6dca7a1d
- P
  [Paddle-TRT] TRT inference support for BERT/Transformer in paddle 2.0 api (#31744) · 14b7e3cf
  由 Pei Yang 提交于 3月 30, 2021
```
* support multihead_matmul_fuse_pass_v3

* fix compile problems

* embedding_eltwise_ln pass support lookup_table_v2

* suppoort matmul and matmul_v2 in qkv matmul
```
  14b7e3cf
- Z
  [Custom OP]Remove old custom OP and reduce whl package volume (#31813) · 04a49b09
  由 Zhou Wei 提交于 3月 30, 2021
```
* Remove old custom OP to reduce whl package volume

* [Custom OP]Remove old custom OP to reduce whl package volume
```
  04a49b09
- S
  fix batchnorm when inpu dims < 3 (#31933) · 8084b759
  由 Shang Zhizhou 提交于 3月 30, 2021
```
* fix batchnorm when inpu dims < 3

* add unittest for batchnorm dims = 2
```
  8084b759
- Z
  [Paddle-TRT] yolobox (#31755) · 64ee255f
  由 zlsh80826 提交于 3月 30, 2021
```
* yolobox converter and plugin

* yolobox unittest

* add dynamic shape restriction

* fix git merge log
```
  64ee255f
29 3月, 2021 8 次提交

N

relu forward and backward with vectortype (#31869) · a71d72d9
由 niuliling123 提交于 3月 29, 2021

a71d72d9
T

Delete cudnn6 code (#31835) · 8829a309
由 tianshuo78520a 提交于 3月 29, 2021

8829a309
L

Fix bug of set_value op：Decerease axes to do right broadcast (#31875) · 525c32e3
由 liym27 提交于 3月 29, 2021

525c32e3
R

[ROCM] added a cudnn switch of conv2d for rocm platform (#31836) · 123949eb
由 ronnywang 提交于 3月 29, 2021

123949eb
S
fix cmake model path (#31866) · 61805d8f
由 Shang Zhizhou 提交于 3月 29, 2021
```
* fix cmake model path

* update cmake

* fix unittest

* fix unittest
```
61805d8f

[CustomOP] Add shape related constructor for Tensor (#31681) · 51eb29de

由 Jiabin Yang 提交于 3月 29, 2021

* give shape related contructor and reshape warning

* change line num to fit ut

* change ut to fit

* remove useless code

* call resize directly in constructor

51eb29de

[Paddle-TRT] roi_align_plugin (#31732) · e3a38d79

由 zlsh80826 提交于 3月 29, 2021

* add roi_align_plugin

* add roi align unit_test

* add roi align serialization

* remove roi align static plugin because of batch dim issue

* refine roi align unittest and add fp16/serialization

* add trt roi align condition to op_teller

* refine error message

* remove unnecessary reshape layer

e3a38d79

[Paddle-TRT] trt affine channel converter (#31628) · bfb5cf55

由 zlsh80826 提交于 3月 29, 2021

* trt affine channel converter

* add trt affine channel base test

* add trt affine channel NHWC

* remove asterisk for python2 compatibility

* trt affine channel converter

* add trt affine channel base test

* add trt affine channel NHWC

* remove asterisk for python2 compatibility

* fix rebase

* move LodTensor to Tensor

* add dbg info

* affine channel converter only support NCHW

* scale,bias are parameters, use create_parameters api

* reduce test input size to not exceed the timelimit of ci

* refine affine channel unittest and add serialization/dynamic test

* change super to InferencePassTest for python2 compatibility

* change super to InferencePassTest for python2 compatibility

* fix affine channel fp16 serialize setting

bfb5cf55

26 3月, 2021 1 次提交

[dygraph qat] Use layer to calculate output scale (#31861) · b47478ef

由 cc 提交于 3月 26, 2021

* Use layer to calculate output scale
* add backward for moving_average_abs_max_scale and save output scales to op's attr

b47478ef

Crayon鑫 / Paddle 与 Fork 源项目一致

Crayon鑫 / Paddle
与 Fork 源项目一致