提交 · 6024488d3ae939fe11e43abc9921da042c708256 · BaiXuePrincess / Paddle

28 6月, 2021 2 次提交
- Q
  [ROCM] fix RNN miopen as weight need to permuted, test=develop (#33733) · 6024488d
  由 Qi Li 提交于 6月 28, 2021
```
* [ROCM] fix RNN miopen as weight need to permuted, test=develop

* [ROCM] fix data share when is_test, test=develop

* update, test=develop
```
  6024488d
- P
  [Paddle-TRT]Fix flatten converter when batch_size > 1 (#33768) · d91352c0
  由 Pei Yang 提交于 6月 28, 2021
```
* fix trt flatten converter when batch_size > 1

* change ut to same dynamic shape
```
  d91352c0
25 6月, 2021 1 次提交
- W
  
  static support mp_layers (#33700) · 91a0acdb
  由 WangXi 提交于 6月 25, 2021
  
  91a0acdb
24 6月, 2021 7 次提交
- H
  [NPU] support dygraph execution on npu place(#33579) · 6aea6be2
  由 houj04 提交于 6月 24, 2021
```
* in NPU environment, use CPUPlace for missing operators.

* in NPU environment, use CPUPlace for missing operators.

* fix TensorCopy bug and add unit test.

* fix code style.

* add more unit tests.
```
  6aea6be2
- J
  [oneDNN] Fix to #33282 , added support of X input broadcasting to oneDNN elementwise ops (#33549) · 049dd853
  由 Jacek Czaja 提交于 6月 24, 2021
```
* - fix to #33282

* - Increased threshold for elementwise_mul_bf16 grad

* -disabled faulty UT

* - fix to approval
```
  049dd853
- A
  [Dy2Stat]Support Python3 type hint (#33745) · c7797802
  由 Aurelius84 提交于 6月 24, 2021
```
* support type hint

* fix unittest
```
  c7797802
- W
  TestSaveLoadLargeParameters use cpu place. (#33756) · 1def9e05
  由 WeiXin 提交于 6月 24, 2021
```
* TestSaveLoadLargeParameters use cpu place.

* edit unittest
```
  1def9e05
- J
  
  fix undef var (#33692) · 68c1fe8c
  由 Jiangxinz 提交于 6月 24, 2021
  
  68c1fe8c
- J
  
  fix undef var (#33691) · 49638f25
  由 Jiangxinz 提交于 6月 24, 2021
  
  49638f25
- C
  supplet several interface of static Variable to consistent with dygraph Tensor (#33330) · af9dcb2d
  由 CtfGo 提交于 6月 24, 2021
```
As the title
```
  af9dcb2d
23 6月, 2021 4 次提交
- J
  Added split op bf16/fp32 oneDNN kernel (#33584) · 68106509
  由 jakpiase 提交于 6月 23, 2021
```
* base changes for split op

* 90% of split functionality added

* full fp32 functionality

* added bf16 test

* added submemory caching

* added bf test to static mode whitelist

* minor change

* enabled split op for inference

* minor fix

* minor fix
```
  68106509
- K
  elastic unitest (#33728) · 9b58cbf1
  由 kuizhiqing 提交于 6月 23, 2021
```
* elastic unitest

* rename demo
```
  9b58cbf1
- B
  
  repair npu matmul_grad and comm_init_hccl (#33719) · 9bf00cd5
  由 Baibaifan 提交于 6月 23, 2021
  
  9bf00cd5
- Z
  
  Add new operation: BroadcastTensorsOp (#33294) · affddfaa
  由 Zhanlue Yang 提交于 6月 23, 2021
  
  affddfaa
22 6月, 2021 3 次提交

[API/OP]Add a new API paddle.diagonal (#33586) · ad106290

由 zhangbo9674 提交于 6月 22, 2021

* new api diagonal, test=develop

* add new api diagonal, test=develop

* new api diagonal, test=develop

* add new api paddle.diagonal, test=develop

* use framework::stride replace ComputeDimStride

* replace cudaMalloc/cudaMemcpy by TensorFormVector in cudaKernel and cudaGradKernel

* perfect funciton: when attr(offset) is exceed attr(axis1) or attr(axis2), set the diagonal dim is 0

* fix RP-Mac-CI bug: replace framework::stride() by ComputDimStride.

* perfect code-block

* perfect code of python API diagonal

* api supports dtype of float16 and bool

* api supports dtype of float16 and bool

* modify unittest code

* modify unittest code

* perfect dtype describe

* perfect code-block

ad106290

Z

Fix the save path problem of UT test_pass_builder. (#33717) · 8a5bbae6
由 Zhen Wang 提交于 6月 22, 2021

8a5bbae6
C
transform complex scale to tensor (#33699) · 5db0c84b
由 chentianyu03 提交于 6月 22, 2021
```
* transform complex scale to tensor

* add test_case for complex scalar

* modify import paddle
```
5db0c84b

21 6月, 2021 9 次提交

Add AXPY oneDNN handler (#33632) · 773aabc7

由 lidanqing 提交于 6月 21, 2021

* Add oneDNN AXPY handler.

* Add fallback for small tensors.

* Fix ifdefs

* Remove unnecessary namespace prefixes and add missing headers.

* Guard handler_axpy with proper ifdefs.

* Compilation of this function is possible only when Paddle is not build
with CUDA nor HIP.

* Move AXPY handler code to separate files.

* Use oneDNN AXPY handler in SGD op.

* Use axpy handler only when Paddle is built with oneDNN.

* Add test for SUM BF16 with big rows.

* Fix SFINAE rules for elementwise_add_to.

* Add test case for SGD with big rows.

* update

* update
Co-authored-by: NAdam Osewski <adam.osewski@intel.com>

773aabc7

Y

add sync calc stream and add ut for fuse on gpu (#33580) · e0e0c0fa
由 Yuang Liu 提交于 6月 21, 2021

e0e0c0fa

[NPU] optimize mul op, use BatchMatMul to realize (#33616) · f91dfe15

由 pangyoki 提交于 6月 21, 2021

* use BatchMatMul

* replace TensorCopy with ShareDataWith

* remove check fp16 grad

* fix format

* add grad_check

* fix grad check

f91dfe15

T
Del six.PY code2 (#33607) · 0f7187af
由 tianshuo78520a 提交于 6月 21, 2021
```
* del py2 code2

* fix test timeout
```
0f7187af
J

fix undef val (#33562) · 4b9430a1
由 Jiangxinz 提交于 6月 21, 2021

4b9430a1

[NPU] flatten params and grads, fuse grad_clip and optimizer op (#33461) · c269a160

由 Leo Chen 提交于 6月 21, 2021

* enable npu alignment

* support flatten_params/grads

* support clip by global norm

* remove memset in coalesce_tensor_op

* fix npu kernel of sum op when input is one tensor

* add ut for flatten_param_grads+regularizer

* fix ut

* fix typo

c269a160

J

fix lack of self arg (#33598) · fa821ef9
由 Jiangxinz 提交于 6月 21, 2021

fa821ef9
J

fix unexpected keyword arg (#33569) · a6ba016e
由 Jiangxinz 提交于 6月 21, 2021

a6ba016e
W

fix sgd unittest timeout (#33665) · fc7e3e99
由 WangXi 提交于 6月 21, 2021

fc7e3e99

17 6月, 2021 4 次提交

Add bf16 support for save and load ops (#33173) · 832a014c

由 joanna.wozna.intel 提交于 6月 17, 2021

* Add bf16 support for save and load ops

* Add bf16 test condition

* Add matmul and chagne fluid.io to paddle.static

* Reduce the test duration

832a014c

R
Add atan2 op and test (#33067) · 918aeb71
由 ronnywang 提交于 6月 16, 2021
```
* add atan2_op

* fix
```
918aeb71

[Dy2Stat]Support non-tensor type in `input_spec` (#33464) · 63b03cf5

由 Aurelius84 提交于 6月 17, 2021

* support non-tensor type

* fix unittest failed

* add unittest with prune

* rm unused code

* coverage

* fix two or

63b03cf5

Add lookup_table_v2 BF16 op (#33172) · 9d6c8bdf

由 joanna.wozna.intel 提交于 6月 16, 2021

* Add lookup_table_v2 BF16

* Reuse lookup table UT

* Change op_type to op_version

* Remove check_dygraph

* Remove skip_check_grad_ci

9d6c8bdf

16 6月, 2021 7 次提交
- J
  [oneDNN] Further ops refactoring of oneDNN cache access (#33515) · f9ce1b1a
  由 Jacek Czaja 提交于 6月 16, 2021
```
* - Draft of implementation of refactoring

- compilation fix

* - Fixes after review

* - Removed unnecessary comment
```
  f9ce1b1a
- J
  
  fix bad super call (#33533) · 78a9870f
  由 Jiangxinz 提交于 6月 16, 2021
  
  78a9870f
- Z
  
  Add bitwise_and/or/xor/not OP/API and unittest (#33524) · ecc05377
  由 Zhou Wei 提交于 6月 16, 2021
  
  ecc05377
- Z
  [Feature] add paddle.trunc (#33371) · 72d36970
  由 zhangbo9674 提交于 6月 16, 2021
```
* new api trunc, test=develop
```
  72d36970
- W
  fix output_padding in conv (#33585) · 78260ff3
  由 wangguanzhong 提交于 6月 16, 2021
```
* fix output padding conv

* add repr unittest for conv
```
  78260ff3
- J
  
  fix used before assign (#33519) · e6c5282e
  由 Jiangxinz 提交于 6月 16, 2021
  
  e6c5282e
- S
  [HybridParallel]Add SharedLayerDesc for PipelineParallel (#33578) · 294dfd23
  由 ShenLiang 提交于 6月 16, 2021
```
* add pplayer

* add sharedlayerdesc
```
  294dfd23
15 6月, 2021 3 次提交
- Z
  
  support convert core.Tensor to paddle.Tensor (#33430) · b7a54fc1
  由 Zhou Wei 提交于 6月 15, 2021
  
  b7a54fc1
- L
  [NPU] use SparseSoftmaxCrossEntropyWithLogits in npu kernel of softmax_with_cross_entropy (#32858) · ff825238
  由 Leo Chen 提交于 6月 15, 2021
```
* use SparseSoftmaxCrossEntropyWithLogits

* fix

* test_slice

* revert test_slice

* add backprob for npu kernel

* fix typo

* fix ut

* fix ut

* refine comments

* return softmax
```
  ff825238
- W
  Save all the information of 'ParamBase' in 'Layer'. (#33500) · 28521e0f
  由 WeiXin 提交于 6月 15, 2021
```
* Save all the information of 'ParamBase' in 'Layer'.

* edit unittest
```
  28521e0f

BaiXuePrincess / Paddle 与 Fork 源项目一致

BaiXuePrincess / Paddle
与 Fork 源项目一致