提交 · af9dcb2da09d9940dffef355b3da99e0dbd8d55d · Crayon鑫 / Paddle

24 6月, 2021 1 次提交
- C
  supplet several interface of static Variable to consistent with dygraph Tensor (#33330) · af9dcb2d
  由 CtfGo 提交于 6月 24, 2021
```
As the title
```
  af9dcb2d
23 6月, 2021 4 次提交
- J
  Added split op bf16/fp32 oneDNN kernel (#33584) · 68106509
  由 jakpiase 提交于 6月 23, 2021
```
* base changes for split op

* 90% of split functionality added

* full fp32 functionality

* added bf16 test

* added submemory caching

* added bf test to static mode whitelist

* minor change

* enabled split op for inference

* minor fix

* minor fix
```
  68106509
- K
  elastic unitest (#33728) · 9b58cbf1
  由 kuizhiqing 提交于 6月 23, 2021
```
* elastic unitest

* rename demo
```
  9b58cbf1
- B
  
  repair npu matmul_grad and comm_init_hccl (#33719) · 9bf00cd5
  由 Baibaifan 提交于 6月 23, 2021
  
  9bf00cd5
- Z
  
  Add new operation: BroadcastTensorsOp (#33294) · affddfaa
  由 Zhanlue Yang 提交于 6月 23, 2021
  
  affddfaa
22 6月, 2021 3 次提交

[API/OP]Add a new API paddle.diagonal (#33586) · ad106290

由 zhangbo9674 提交于 6月 22, 2021

* new api diagonal, test=develop

* add new api diagonal, test=develop

* new api diagonal, test=develop

* add new api paddle.diagonal, test=develop

* use framework::stride replace ComputeDimStride

* replace cudaMalloc/cudaMemcpy by TensorFormVector in cudaKernel and cudaGradKernel

* perfect funciton: when attr(offset) is exceed attr(axis1) or attr(axis2), set the diagonal dim is 0

* fix RP-Mac-CI bug: replace framework::stride() by ComputDimStride.

* perfect code-block

* perfect code of python API diagonal

* api supports dtype of float16 and bool

* api supports dtype of float16 and bool

* modify unittest code

* modify unittest code

* perfect dtype describe

* perfect code-block

ad106290

Z

Fix the save path problem of UT test_pass_builder. (#33717) · 8a5bbae6
由 Zhen Wang 提交于 6月 22, 2021

8a5bbae6
C
transform complex scale to tensor (#33699) · 5db0c84b
由 chentianyu03 提交于 6月 22, 2021
```
* transform complex scale to tensor

* add test_case for complex scalar

* modify import paddle
```
5db0c84b

21 6月, 2021 9 次提交

Add AXPY oneDNN handler (#33632) · 773aabc7

由 lidanqing 提交于 6月 21, 2021

* Add oneDNN AXPY handler.

* Add fallback for small tensors.

* Fix ifdefs

* Remove unnecessary namespace prefixes and add missing headers.

* Guard handler_axpy with proper ifdefs.

* Compilation of this function is possible only when Paddle is not build
with CUDA nor HIP.

* Move AXPY handler code to separate files.

* Use oneDNN AXPY handler in SGD op.

* Use axpy handler only when Paddle is built with oneDNN.

* Add test for SUM BF16 with big rows.

* Fix SFINAE rules for elementwise_add_to.

* Add test case for SGD with big rows.

* update

* update
Co-authored-by: NAdam Osewski <adam.osewski@intel.com>

773aabc7

Y

add sync calc stream and add ut for fuse on gpu (#33580) · e0e0c0fa
由 Yuang Liu 提交于 6月 21, 2021

e0e0c0fa

[NPU] optimize mul op, use BatchMatMul to realize (#33616) · f91dfe15

由 pangyoki 提交于 6月 21, 2021

* use BatchMatMul

* replace TensorCopy with ShareDataWith

* remove check fp16 grad

* fix format

* add grad_check

* fix grad check

f91dfe15

T
Del six.PY code2 (#33607) · 0f7187af
由 tianshuo78520a 提交于 6月 21, 2021
```
* del py2 code2

* fix test timeout
```
0f7187af
J

fix undef val (#33562) · 4b9430a1
由 Jiangxinz 提交于 6月 21, 2021

4b9430a1

[NPU] flatten params and grads, fuse grad_clip and optimizer op (#33461) · c269a160

由 Leo Chen 提交于 6月 21, 2021

* enable npu alignment

* support flatten_params/grads

* support clip by global norm

* remove memset in coalesce_tensor_op

* fix npu kernel of sum op when input is one tensor

* add ut for flatten_param_grads+regularizer

* fix ut

* fix typo

c269a160

J

fix lack of self arg (#33598) · fa821ef9
由 Jiangxinz 提交于 6月 21, 2021

fa821ef9
J

fix unexpected keyword arg (#33569) · a6ba016e
由 Jiangxinz 提交于 6月 21, 2021

a6ba016e
W

fix sgd unittest timeout (#33665) · fc7e3e99
由 WangXi 提交于 6月 21, 2021

fc7e3e99

17 6月, 2021 4 次提交

Add bf16 support for save and load ops (#33173) · 832a014c

由 joanna.wozna.intel 提交于 6月 17, 2021

* Add bf16 support for save and load ops

* Add bf16 test condition

* Add matmul and chagne fluid.io to paddle.static

* Reduce the test duration

832a014c

R
Add atan2 op and test (#33067) · 918aeb71
由 ronnywang 提交于 6月 16, 2021
```
* add atan2_op

* fix
```
918aeb71

[Dy2Stat]Support non-tensor type in `input_spec` (#33464) · 63b03cf5

由 Aurelius84 提交于 6月 17, 2021

* support non-tensor type

* fix unittest failed

* add unittest with prune

* rm unused code

* coverage

* fix two or

63b03cf5

Add lookup_table_v2 BF16 op (#33172) · 9d6c8bdf

由 joanna.wozna.intel 提交于 6月 16, 2021

* Add lookup_table_v2 BF16

* Reuse lookup table UT

* Change op_type to op_version

* Remove check_dygraph

* Remove skip_check_grad_ci

9d6c8bdf

16 6月, 2021 7 次提交
- J
  [oneDNN] Further ops refactoring of oneDNN cache access (#33515) · f9ce1b1a
  由 Jacek Czaja 提交于 6月 16, 2021
```
* - Draft of implementation of refactoring

- compilation fix

* - Fixes after review

* - Removed unnecessary comment
```
  f9ce1b1a
- J
  
  fix bad super call (#33533) · 78a9870f
  由 Jiangxinz 提交于 6月 16, 2021
  
  78a9870f
- Z
  
  Add bitwise_and/or/xor/not OP/API and unittest (#33524) · ecc05377
  由 Zhou Wei 提交于 6月 16, 2021
  
  ecc05377
- Z
  [Feature] add paddle.trunc (#33371) · 72d36970
  由 zhangbo9674 提交于 6月 16, 2021
```
* new api trunc, test=develop
```
  72d36970
- W
  fix output_padding in conv (#33585) · 78260ff3
  由 wangguanzhong 提交于 6月 16, 2021
```
* fix output padding conv

* add repr unittest for conv
```
  78260ff3
- J
  
  fix used before assign (#33519) · e6c5282e
  由 Jiangxinz 提交于 6月 16, 2021
  
  e6c5282e
- S
  [HybridParallel]Add SharedLayerDesc for PipelineParallel (#33578) · 294dfd23
  由 ShenLiang 提交于 6月 16, 2021
```
* add pplayer

* add sharedlayerdesc
```
  294dfd23
15 6月, 2021 7 次提交

Z

support convert core.Tensor to paddle.Tensor (#33430) · b7a54fc1
由 Zhou Wei 提交于 6月 15, 2021

b7a54fc1

[NPU] use SparseSoftmaxCrossEntropyWithLogits in npu kernel of softmax_with_cross_entropy (#32858) · ff825238

由 Leo Chen 提交于 6月 15, 2021

* use SparseSoftmaxCrossEntropyWithLogits

* fix

* test_slice

* revert test_slice

* add backprob for npu kernel

* fix typo

* fix ut

* fix ut

* refine comments

* return softmax

ff825238

W
Save all the information of 'ParamBase' in 'Layer'. (#33500) · 28521e0f
由 WeiXin 提交于 6月 15, 2021
```
* Save all the information of 'ParamBase' in 'Layer'.

* edit unittest
```
28521e0f
W
add the support for the bool in compare ops · 1f8de080
由 wawltor 提交于 6月 15, 2021
```
add the support for the bool in compare ops
```
1f8de080

Support reduce_sum_op float16 (#32966) · 606939de

由 jiangcheng 提交于 6月 15, 2021

* add reduce_sum_op by add self-kernel

* set all ReduceKernel MPType for accuracy

* add float16 test script which input is integer number

* solve reduce sum float16 check_grad problem

* solve conflict and change test script for CI

* change kernel register for CI

* remove all useless template

606939de

Add digamma_op and unittest (#33278) · 02a6d49a

由 zyfncg 提交于 6月 15, 2021

* Add digamma_op and unittest

* add digamma_op api

* remove special DigammaCudaKernel and correct some docs

* remove unused headers

* fix api doc error

02a6d49a

J
Revert "Fix some Bugs of Undefined Variable (#33488)" (#33538) · 18e71bdf
由 Jiangxinz 提交于 6月 15, 2021
```
This reverts commit b2afc8df.
```
18e71bdf

14 6月, 2021 1 次提交
- K
  Add warning for dataloader incompatable upgrade (#32967) · 308467c3
  由 Kaipeng Deng 提交于 6月 14, 2021
```
* add warning log for DataLoader output format imcompatible upgrade. test=develop
```
  308467c3
12 6月, 2021 3 次提交

Fix LayerNorm Problem (#33420) · fe94db6c

由 zhiboniu 提交于 6月 12, 2021

* Eliminate numerical differences of LayerNorm; fix LayerNorm Nan Bug while large data input

* fix bug while large shape of data input

fe94db6c

P
[Paddle-TRT] add support for trt dynamic shape flatten op (#33394) · 24bde98f
由 Pei Yang 提交于 6月 12, 2021
```
* add support for trt dynamic shape flatten op

* add version restriction

* add ut input dynamic shape
```
24bde98f

由 joanna.wozna.intel 提交于 6月 11, 2021

* Small changes related to BF16 fusion_gru and fusion_lstm

* Correct to pass arg by value

* Add conditions to rnn op

* Correct the spelling mistake

* Improving the test with checking activation

* Trigger CI

cd95ea82

11 6月, 2021 1 次提交
- S
  Fix gather infer shape using axis (#33413) · abc17ef7
  由 ShenLiang 提交于 6月 11, 2021
```
* fix gather shape bug

* fix None

* fix topo
```
  abc17ef7

Crayon鑫 / Paddle 与 Fork 源项目一致

Crayon鑫 / Paddle
与 Fork 源项目一致