提交 · a60d93fb77a055540fe239d97055975ba7dc8e2f · PaddlePaddle / Paddle

23 2月, 2021 1 次提交
- Q
  
  [ROCM] update fluid framework for rocm (part2), test=develop (#31010) · a60d93fb
  由 Qi Li 提交于 2月 23, 2021
  
  a60d93fb
22 2月, 2021 11 次提交

T
support save multi sparse table in one path (#31108) · 565354f6
由 Thunderbrook 提交于 2月 22, 2021
```
* save multi table one path

* format
```
565354f6
Q

[ROCM] update fluid framework for rocm (part3), test=develop (#31011) · 50967135
由 Qi Li 提交于 2月 22, 2021

50967135

[Dy2stat] Refactoring tensor_shape_transformer.py to Fix Change after Assign Bug (#31082) · cf43a321

由 Huihuang Zheng 提交于 2月 22, 2021

**Problem**
In our old shape transformer logic, if user write:
```
s = tensor.shape
...
y = paddle.some_api(s)
```
Dy2stat will change it to
```
...
y = paddle.some_api(convert_var_shape(tensor))
```
However it will cause fatal bug if user changes the shape of `x` after assign. For example:
```
s = tensor.shape
...
tensor = paddle.some_change_shape_api(tensor)
...
y = paddle.some_api(s)
```
Then the Dy2stat will get wrong result because the code is translated into:
```
tensor = paddle.some_change_shape_api(tensor)
...
y = paddle.some_api(convert_var_shape(tensor)) # tensor shape has been changed, not origin `s` value
```

**Solution Logic**

It can not be solved in the old logic, so I refactoring tensor_shape_transformer logic. Now we will use `s` to store shape attribute and generate a var `s__STATIC_CONVERT_VAR_SHAPE_SUFFIX` to store static shape API `shape(tensor)`
```
s = tensor.shape
...
y = paddle.some_api(s)
```
Dy2stat will change it to
```
s = tensor.shape
s__STATIC_CONVERT_VAR_SHAPE_SUFFIX = shape(tensor)
...
y = paddle.some_api(choose_shape_attr_or_api(s, s__STATIC_CONVERT_VAR_SHAPE_SUFFIX ))
```
In this case, the code is consistent with origin dygraph meaning and it fixed the change after assign bug.

**Code Key Note**

To help reviewers, the key change of this PR is changing `self.name_to_var_shape` from "mapping name to shape node" to "mapping name to its STATIC_CONVERT_VAR_SHAPE_SUFFIX name", then if a variable name has the SUFFIX, we can choose to use attribute shape or shape api. Other changes go with the key change.

**Consideration**
The issue of this PR is that we store extra static `shape` API result, will it harms the speed of Dy2stat? In some cases it will, but we argue that the benefit would be greater than the cost.

1. The extra calling to static `shape` API will happen when coder assign among shape variables. Take the following dygraph code as an instance:
```
s1 = tensor.shape
s2 = s1
s3 = s2
...
```
Then we called extra static `shape` APIs again and again, however users seldom write code like this.

2. If the shape variable is used a lot, for example:
```
s = tensor.shape
y1 = paddle.some_api1(s)
y2 = paddle.some_api2(s)
y3 = paddle.some_api3(s)
```
Our old logic will create 3 shape APIs but now just 1. This is more common user code pattern. In fact, if reviewers take a look at the current unit test in this PR, you could see the op numbers decrease after this PR. So we argue that this PR can also improve speed in this code pattern.

cf43a321

fix dist fleet ctr ut (#31087) · 0e4b1542

由 tangwei12 提交于 2月 22, 2021

* fix dist fleet ctr ut

Change-Id: I59bf5123c7bd47bd0e8f1ca2a26295257597c0f5

* fix dist fleet ctr ut

Change-Id: Iafcdd172364be47fe67b753774ce09af050bcbce

* Update CMakeLists.txt

0e4b1542

Q

[ROCM] update fluid framework for rocm (part1), test=develop (#31009) · 8fe09faf
由 Qi Li 提交于 2月 22, 2021

8fe09faf
Q

[ROCM] update fluid platform for rocm39 (part4), test=develop (#30936) · 33429630
由 Qi Li 提交于 2月 22, 2021

33429630
S
update trt int8 calibrator to IEntropyCalibratorV2 (#31060) · a5c56d83
由 Shang Zhizhou 提交于 2月 22, 2021
```
* update trt int8 calibrator to IEntropyCalibratorV2

* add delele opt_cache for trt_split_converter_test
```
a5c56d83

[2.0Custom OP]Support New Custom OP on Windows (#31063) · adaec007

由 Zhou Wei 提交于 2月 22, 2021

* [2.0.1]Support New Custom OP on windows

* fix CI

* fix code style

* fix CI

* fix CI

* fix coverage

* fix CI

* fix CI

adaec007

C

add optional for param attr args, test=document_fix (#31105) · 2168f08a
由 Chen Weihang 提交于 2月 22, 2021

2168f08a

[ROCM] update fluid imperative for rocm (part1), test=develop (#31017) · 1d996637

由 Qi Li 提交于 2月 22, 2021

* [ROCM] update fluid imperative for rocm (part1), test=develop

* [ROCM] update reducer.cc after merge, test=develop

* update reducer cmake after merge, test=develop

1d996637

J

fix the bug in backward OP of index_sample. (#31026) · b95eb38b
由 JamesLim 提交于 2月 22, 2021

b95eb38b

20 2月, 2021 12 次提交
- C
  Remove PE special profiler (#30886) · 6b3371e0
  由 Chengmo 提交于 2月 20, 2021
```
* remove pe special profiler

* add profiler info
```
  6b3371e0
- C
  [CustomOp] Add more dispatch marco for users (#31058) · 6beeafe7
  由 Chen Weihang 提交于 2月 20, 2021
```
* add more dispatch marco

* add more dispatch marco

* add more tests

* revert unneeded change

* add timeout for test dispatch

* add float and complex test

* remove and marco
```
  6beeafe7
- T
  add squeeze_op/unsqueeze_op on kunlun;fix conv op and parallel... · d5323dab
  由 TTerror 提交于 2月 20, 2021
```
add squeeze_op/unsqueeze_op on kunlun;fix conv op and parallel executor;optimize lookup_table op (#31056)

* add squeeze_op/unsqueeze_op on kunlun; fix conv op and parallel executor on kunlun; optimize lookup_table op on kunlun

* update squeeze/unsqueeze op
```
  d5323dab
- 1
  test=develop, save/load, shrink (#30625) · 16b4260b
  由 123malin 提交于 2月 20, 2021
```
* test=develop, save/load, shrink
Co-authored-by: NseiriosPlus <tangwei12@baidu.com>
```
  16b4260b
- S
  export paddle.static.normalize_program method. (#31072) · 4424aac6
  由 Shibo Tao 提交于 2月 20, 2021
```
* export paddle.static.normalize_program method. test=develop

* fix ut coverage.test=develop
```
  4424aac6
- J
  
  hide useless headers and add complex support (#31074) · 628451af
  由 Jiabin Yang 提交于 2月 20, 2021
  
  628451af
- W
  update paddle_fluid.so to paddle_inference.so (#30850) · 463eae03
  由 Wilber 提交于 2月 20, 2021
```
* update paddle_fluid.so to paddle_inference.so
```
  463eae03
- T
  change fleet reviewer (#31069) · a2170a08
  由 tangwei12 提交于 2月 20, 2021
```
* change reviewer, test=document

Change-Id: I7592ee5c93bd580300ce39df885b603597b09026

* Update check_file_diff_approvals.sh

test=document_fix
```
  a2170a08
- L
  [static setitem] Support the index is Tensor; step>1; step<0 .(#30949) · 5b367dab
  由 liym27 提交于 2月 20, 2021
```
* [static setitem] support the index step > 1. tensor_a[::3] = value

* [static setitem] support the index step < 0. Eg: tensor_a[::-3] = value

* [static setitem] support the index is Tensor. eg: tensor_a[tensor_3:0:-1] = value

* Add op version.
```
  5b367dab
- Q
  
  [ROCM] update fluid inference for rocm (part1), test=develop (#31018) · eb3050fa
  由 Qi Li 提交于 2月 20, 2021
  
  eb3050fa
- J
  
  add detail about states index in rnn result, test=document_fix (#31048) · 6df1ca54
  由 Jack Zhou 提交于 2月 20, 2021
  
  6df1ca54
- H
  Fix that convert_var_shape doesn't support slice like [0:], test=develop (#31051) · ef627ac5
  由 Huihuang Zheng 提交于 2月 20, 2021
```
As the title, when slice_node like 1:3 being passed to idx of convert_var_shape, it will cause syntax error because a function cannot take this as argument. This PR fixed it.
```
  ef627ac5
19 2月, 2021 12 次提交
- J
  Added reshape grad bf16 (#31035) · f7465641
  由 Jacek Czaja 提交于 2月 19, 2021
```
* - added Reshape grad bf16

* - Added reshape grad bf16

* - cosmetics in py
```
  f7465641
- A
  [CustomOp] Refine name argument in setup (#31049) · 4dbe16c4
  由 Aurelius84 提交于 2月 19, 2021
```
* refine setup name usage

* fix unittest failed
```
  4dbe16c4
- A
  
  [CustomOp] Support output dtypes in generated Python API (#31045) · f2dc29a9
  由 Aurelius84 提交于 2月 19, 2021
  
  f2dc29a9
- W
  Modify relu native implementation 2 (#30996) · 615d8a22
  由 Wojciech Uss 提交于 2月 18, 2021
```
* Modify relu native implementation

* fix GPU performance
```
  615d8a22
- S
  
  Remove scale loss before reduce in dygraph (#30807) · 9401173e
  由 ShenLiang 提交于 2月 19, 2021
  
  9401173e
- W
  
  fix python pass builder error. (#30946) · 0020d915
  由 Wilber 提交于 2月 18, 2021
  
  0020d915
- W
  
  fix jetson problem (#30939) · 39aeaa16
  由 Wilber 提交于 2月 18, 2021
  
  39aeaa16
- W
  
  update trt error message when input height or width is -1 (#31019) · 01ccfbcd
  由 Wilber 提交于 2月 18, 2021
  
  01ccfbcd
- W
  
  resolve memory leak in cudnn8.0 (#31029) · cf8b8f9c
  由 Wilber 提交于 2月 18, 2021
  
  cf8b8f9c
- K
  fix dataloader collate return list mix tensor and numpy array (#30904) · c4ddc3ab
  由 Kaipeng Deng 提交于 2月 19, 2021
```
* fix dataloader collate return list mix tensor and numpy array. test=develop
```
  c4ddc3ab
- G
  add offset parameter in roi_align,generate_proposals.etc ops (#30864) · 5b267474
  由 Guanghua Yu 提交于 2月 19, 2021
```
* add  parameter in roi_align op
```
  5b267474
- C
  
  fix regex error & simplify marco name (#31031) · 75f81233
  由 Chen Weihang 提交于 2月 18, 2021
  
  75f81233
18 2月, 2021 4 次提交

Z
enable exhaustive_search for forward and backward algos when dtype is float16 (#30959) · f0ee1592
由 Zhang Ting 提交于 2月 18, 2021
```
* enable exhaustive_search for input_grad when dtype is float16

* enable exhaustive_search for forward algos
```
f0ee1592
P

add trt transpose and flatten converter (#31022) · 9b54fe41
由 Pei Yang 提交于 2月 18, 2021

9b54fe41

[CustomOp] Support Compile multi ops at same time (#30920) · 4c9f96c9

由 Aurelius84 提交于 2月 18, 2021


* add more unitest for ABI compatibility

* add more unittest

* refine warning style

* support compile multi custom ops in same time

* fix not import paddle in unittest

* fix typo

* add more unittest

* add comment for details

4c9f96c9

Add Conv Transpose BF16 (#30877) · caf9d398

由 joanna.wozna.intel 提交于 2月 18, 2021

* Add conv transpose BF16

* Share function GetWeightsTz

* Adjust to review and fix op compatibility

* Add bias to unique handler name

* Remove errors related to paddle enforce

* Add conv2d_transpose to bf16 list and kernel refator

caf9d398

PaddlePaddle / Paddle 1 年多 前同步成功

PaddlePaddle / Paddle
1 年多前同步成功