提交 · bdd3dde32322eee59d642d7d33169ffb382c78a6 · PaddlePaddle / Paddle

18 10月, 2022 1 次提交

[code-gen] Support code-gen for opmaker of sparse op (#46993) · bdd3dde3

由 zyfncg 提交于 10月 18, 2022

* support generating code of opmaker for backward op invoke forward op

* gsupport code-gen of opmaker for sparse op

* refind logic of choose phi kernrel

* fix complie budg

* fix code_gen bug

* fix bug

* fix kernel signature code-gen

* fix complie bug of VarType

* fix complie bug of VarType

* fix test_sparse_conv_op

* fix test_sparse_norm_op

bdd3dde3

17 10月, 2022 7 次提交

Support BF16 training for sharding (#46846) · 0b39b244

由 Ghost Screaming 提交于 10月 17, 2022

* Fix bug of reduce_sum op. When input.numel() > INT32_MAX, its result
is wrong.

* support pure bfloat16

* support bf16 linear

* update PR to pass CI

* tiny fix where_grad_kernel.cu

* Support bfloat16 type for reducer and sharding.

* Fix some bug.

* Polish code.

* Polise code.

* Add bfloat16 datatype in fill_grad kernels.
Co-authored-by: Nsneaxiy <sneaxiy@126.com>

0b39b244

O

delete maybe unused code in paddle\phi\infermeta\sparse\unary.h (#46844) · 776e80a6
由 OccupyMars2025 提交于 10月 17, 2022

776e80a6
Y
[PHI]Modify DataLayout's namespace from paddle::experimental to phi (#46869) · ec749398
由 YuanRisheng 提交于 10月 17, 2022
```
* namespace modify

* update by comment
```
ec749398
R

Fix warning message format error (#47045) · 13284437
由 RedContritio 提交于 10月 17, 2022

13284437

[Hackathon 3rd No.22 ] add paddle.incubate.sparse.reshape (#46694) · abb38136

由 OccupyMars2025 提交于 10月 17, 2022

* add sparse reshape

* change the dtype in all test cases to int64

* just one test case

* modify comments

* Update test_sparse_reshape_op.py

* chang the type of "shape"  from  vector<int64_t>  to  IntArray

* check whether sp_out.to_dense() is the cause  of error

* print sp_out

* Update reshape_kernel.cc

* use numpy to generate the equal paddle tensor

* just check dense_tensor.numpy()

* check cpu and cuda versions

* Update test_sparse_reshape_op.py

* supply all test cases for cpu forward coo kernel

* test forward coo cuda kernel

* change configuration of cuda kernel

* keep only one test case

* test coo cpu kernel (forward and backward)

* row major or column major ???

* test cuda coo forward kernel

* complete declaration and registration

* Update __init__.py

* rebuild

* retrigger CI

* add cudaMalloc and cudaMemcpy  in  ReshapeCooKernel  and change back to row major order in a cuda dense tensor

* midify minor error

* test only cpu coo forward kernel

* add all test cases for coo forward kernel  (both cpu and gpu)

* test all forward kernels (coo, csr; cpu, gpu)

* add all test cases for all kinds of kernels

* just retrigger CI

* Update sparse_ops.yaml

* Update sparse_ops.yaml

* Update sparse_ops.yaml

* resolve conflicts

* Update sparse_ops.yaml

* don't specify tensor place

* new shape has -1 or 0 in it

* Update unary_grad_kernel.h

* correct lvalue error

* code style

* Update sparse_backward.yaml

* Update sparse_ops.yaml

* Update unary_kernel.h

* Update unary.py

* Update sparse_backward.yaml

* Update unary.py

* code style

* code style

* code style

* Update unary.py

* specify tensor place explicitly

* do not use numpy array

* use numpy array in unit test again

* modify example code in docstring

abb38136

L
Fix the bug of PHI kernel of reduce_sum in kunlun when using eager mode. (#47004) · f9c1cdc1
由 Leo Guo 提交于 10月 17, 2022
```
test=kunlun
```
f9c1cdc1
D
[Custom Device] Add singleton to custom device (#46963) · 73196e5a
由 duanyanhui 提交于 10月 17, 2022
```
* add singleton to custom device

* Update custom_device.cc

Init device_init_flag_ in default
```
73196e5a

14 10月, 2022 2 次提交
- R
  
  speed_up for deformable conv (#46997) · eee6b3a7
  由 Rayman 提交于 10月 14, 2022
  
  eee6b3a7
- W
  TRT pool2d adaptive mode bugfix (#46802) · eb32746a
  由 Wang Bojun 提交于 10月 14, 2022
```
* draft with debug print
```
  eb32746a
13 10月, 2022 7 次提交

Z
[Phi] Refactor logic of judging whether having a phi kernrel (#46920) · 8d797fd2
由 zyfncg 提交于 10月 13, 2022
```
* refind logic of choose phi kernrel

* fix complie budg
```
8d797fd2
X

logsumexp support fp16 (#45817) · 910e1b6a
由 xiaohemaikoo 提交于 10月 13, 2022

910e1b6a
[Zero-Dim] support 0D for paddle.transpose/reshape/stack/tile/unsqueeze (#46555) · 78add057
由 zhouweiwei2014 提交于 10月 13, 2022

78add057
C

fix softmax memory align (#46902) · 71748805
由 carryyu 提交于 10月 13, 2022

71748805

Revert #46111 (#46961) · cf9ca61d

由 Zhang Ting 提交于 10月 13, 2022

* Revert "【Hackathon No.56&38】deformable_conv_v1 算子实现 float16 数据类型支持&前向运行加速 (#46111)"

cf9ca61d

Z
Correct the logic and remove unnecessary template param (#46623) · 450af30c
由 Zhang Zheng 提交于 10月 13, 2022
```
* Correct the logic and remove unnecessary template param

* fix error throw

* fix print format

* fix ci
```
450af30c

[Kernel Selection] Remove hard code of PADDLE_WITH_MKLDNN (#46606) · ef1c8759

由 HongyuJia 提交于 10月 13, 2022

* remove PADDLE_WITH_MKLDNN, test white_list=abs

* fix unique_ptr

* fix op.Type()

* remove TODO in kernel_dispatch.h

* remove IndicateVarDataType function, update white_list

* remove mkldnn hard code

* add comments

* fix ==

* update mkldnn_op_list

* delete hard code of OPs

* update mkldnn_op_list

* update mkldnn_op_list, remove interp

* add error check for ExecutionContext

* update mkldnn_op_list, remove transpose2_grad

* remove interpolate mkldnn

* remove fill_constant mkldnn

* opt HasAttr in DygraphExecutionContext

* deprecated commit, test mkldnn_white_list

* deprecated commit, test mkldnn_white_list

* deprecated commit, test mkldnn_black_list

* update mkldnn_op_list, add assert error op

* solve cudnn related op

* fix error

* add mkldnn fallback in phi_utils.cc

* remove mkldnn fallback in phi_utils.cc

* opt code implementation

* polish Copyright License

ef1c8759

12 10月, 2022 7 次提交
- Z
  Revert "remove comment (#46827)" (#46935) · 2ea3700a
  由 Zhang Ting 提交于 10月 12, 2022
```
This reverts commit 8a5f17e8.
```
  2ea3700a
- Z
  
  deliver indices_dict (#46919) · 4681f13b
  由 zhangkaihuo 提交于 10月 12, 2022
  
  4681f13b
- Z
  
  support generating code of opmaker for backward op invoke forward op (#46912) · 227ab74d
  由 zyfncg 提交于 10月 12, 2022
  
  227ab74d
- S
  Fix some operators when the tensor.numel() > INT32_MAX (#46767) · e896567e
  由 sneaxiy 提交于 10月 12, 2022
```
* fix some ops for int64 range

* update error message
```
  e896567e
- [Zero-Dim] support input 0D Tensor for some unary api (#45992) · 05c2b9ba
  由 zhouweiwei2014 提交于 10月 12, 2022
```
* [Zero-Dim] support input 0D Tensor for unary api

* fix CI
```
  05c2b9ba
- Z
  
  [Sparse] Rename and fix doc (#46853) · a9cc5482
  由 zhangkaihuo 提交于 10月 12, 2022
  
  a9cc5482
- S
  
  [CodeStyle][F401] remove unused imports in unittests/r_cmake_paddle_tools. (#46712) · 5f25183e
  由 Shuangchi He 提交于 10月 12, 2022
  
  5f25183e
11 10月, 2022 4 次提交
- F
  
  set_value_op: add support for complex types (#46884) · 34c7e3e3
  由 Feiyu Chan 提交于 10月 11, 2022
  
  34c7e3e3
- C
  Remove LoDTensor using in fluid (Part 1) (#46663) · 940d8f25
  由 Chen Weihang 提交于 10月 11, 2022
```
* remove using lodtensor part1

* polish history code format
```
  940d8f25
- 傅
  Fix set_value failure when source tensor is fp16 Dtype (#46801) · 2341ed5e
  由傅剑寒提交于 10月 11, 2022
```
* add fp16 data type for set_value

* cancel flip modification

* add fp16 dtype support for set_value
```
  2341ed5e
- N
  
  Update layout autotune for module with no modified (#46541) · 3da3462f
  由 niuliling123 提交于 10月 11, 2022
  
  3da3462f
10 10月, 2022 5 次提交

[PHI]Add RNN yaml (#46812) · ab60fd8b

由 YuanRisheng 提交于 10月 10, 2022

* add yaml entry for rnn and rrnn_grad, move infershape function for rnn_grad to phi infer meta

* WIP: move rnn kernrl to phi

* Change the code generation to avoid converting from intializer list to tuple of heterogeneous types.
This is only triggered when an api has intermediate outputs, and the result of the outputs are of heterogeneous types.

* fix the bug that when none in a vector of tensors requires gradient, the conversion to InferShapeContext to InferMetaContext (a.k.a. BuildInferMetaContext) produces errorous results.

* fix ci bugs

* fix ci bugs

* fix ci bugs

* modify code according comment
Co-authored-by: Nchenfeiyu <chenfeiyu@baidu.com>

ab60fd8b

R

remove comment (#46827) · 8a5f17e8
由 Rayman 提交于 10月 10, 2022

8a5f17e8

[PHI] transpose2_grad op migration (#46139) · e3407a80

由 Paulina Gacek 提交于 10月 10, 2022

* op migrated, Copy(OneDNNContext, ...) added

* mutable_data & op registration in fluid removed

* refactoring

* OneDNNGetDataType to uppercase

* missing cpu check added, handler moved to .h file

* name changed to transpose_grad

* Copy changed back to TensorCopy

* Resizing corrected, Copy(OneDNNContext) removed

e3407a80

R

【Hackathon No.36】优化 lerp_grad op 在 GPU 上的计算性能 (#45946) · ef61df30
由 Rayman 提交于 10月 10, 2022

ef61df30
R
【Hackathon No.56&38】deformable_conv_v1 算子实现 float16 数据类型支持&前向运行加速 (#46111) · 5e0614a1
由 Rayman 提交于 10月 10, 2022
```
support fp16 for deformable conv
```
5e0614a1

09 10月, 2022 4 次提交
- Z
  
  add sync_batch_norm_kernel (#46430) · 5cd6a707
  由 zhangkaihuo 提交于 10月 09, 2022
  
  5cd6a707
- Z
  
  [Sparse] Add a batch_norm kernel (#46359) · 888223b7
  由 zhangkaihuo 提交于 10月 09, 2022
  
  888223b7
- S
  
  add seed check (#46747) · 97ec57fe
  由 Sławomir Siwek 提交于 10月 09, 2022
  
  97ec57fe
- S
  Enable hard_swish_grad unit test (#46621) · ff0171e4
  由 Sławomir Siwek 提交于 10月 09, 2022
```
* enable hard_swish_grad unit test

* remove unused argument
```
  ff0171e4
08 10月, 2022 1 次提交
- H
  
  fix typo (#46680) · 6e9bb9f9
  由 HongyuJia 提交于 10月 08, 2022
  
  6e9bb9f9
03 10月, 2022 1 次提交
- J
  Requantize to use Memory Desc in Tensors (#46608) · a579e523
  由 Jacek Czaja 提交于 10月 03, 2022
```
* - some more MD changes

* - lint

* - compilation fixes

* - compilation fixes

* - lint

* - fix
```
  a579e523
30 9月, 2022 1 次提交
- Fix undefined reference PD_IntArrayGetElementCount (#46662) · 2055a1d2
  由 engineer1109 提交于 9月 30, 2022
```
* Fix undefined reference PD_IntArrayGetElementCount

* Delete PD_IntArrayGetSize Unused
```
  2055a1d2

PaddlePaddle / Paddle 大约 1 年 前同步成功

PaddlePaddle / Paddle
大约 1 年前同步成功