提交 · 940d8f253239a74e0812faf9a80f60d5c39ebedd · PaddlePaddle / Paddle

11 10月, 2022 10 次提交
- C
  Remove LoDTensor using in fluid (Part 1) (#46663) · 940d8f25
  由 Chen Weihang 提交于 10月 11, 2022
```
* remove using lodtensor part1

* polish history code format
```
  940d8f25
- Y
  Fix slice bugs in MKLDNN when input dims are zeros (#46671) · 6c2af7bb
  由 yeliang2258 提交于 10月 11, 2022
```
* fix slice bugs

* fix

* update code

* fix

* update code
```
  6c2af7bb
- H
  
  change mkldnn interp to normal GetExpectedKernelType (#46685) · c5173591
  由 HongyuJia 提交于 10月 11, 2022
  
  c5173591
- H
  [Opt Code] Opt GetExpectedKernelType code of conv_op (#46681) · b4d7ef9d
  由 HongyuJia 提交于 10月 11, 2022
```
* refine conv_op mkldnn code

* fix customized_type_value
```
  b4d7ef9d
- H
  [Opt Code] Opt GetExpectedKernelType code of sum (#46678) · d6c69d7c
  由 HongyuJia 提交于 10月 11, 2022
```
* refine sum_op mkldnn code

* refine sum_op mkldnn code
```
  d6c69d7c
- H
  
  refine mkldnn code (#46677) · ee1aec62
  由 HongyuJia 提交于 10月 11, 2022
  
  ee1aec62
- 傅
  Fix set_value failure when source tensor is fp16 Dtype (#46801) · 2341ed5e
  由傅剑寒提交于 10月 11, 2022
```
* add fp16 data type for set_value

* cancel flip modification

* add fp16 dtype support for set_value
```
  2341ed5e
- H
  [Opt transpose2] Opt GetExpectedKernelType code of transpose2 (#46692) · 98e00793
  由 HongyuJia 提交于 10月 11, 2022
```
* solve transpose2, follow #22402

* fix CI cmake

* update REGISTER_OP_KERNEL of transpose2
```
  98e00793
- H
  
  fix typo (#46814) · 46595d6b
  由 HongyuJia 提交于 10月 11, 2022
  
  46595d6b
- W
  
  [DOC] update docs of activation op (#46556) · 20eb6e00
  由 wuyefeilin 提交于 10月 11, 2022
  
  20eb6e00
10 10月, 2022 8 次提交

由 YuanRisheng 提交于 10月 10, 2022

* add yaml entry for rnn and rrnn_grad, move infershape function for rnn_grad to phi infer meta

* WIP: move rnn kernrl to phi

* Change the code generation to avoid converting from intializer list to tuple of heterogeneous types.
This is only triggered when an api has intermediate outputs, and the result of the outputs are of heterogeneous types.

* fix the bug that when none in a vector of tensors requires gradient, the conversion to InferShapeContext to InferMetaContext (a.k.a. BuildInferMetaContext) produces errorous results.

* fix ci bugs

* fix ci bugs

* fix ci bugs

* modify code according comment
Co-authored-by: Nchenfeiyu <chenfeiyu@baidu.com>

ab60fd8b

Z

[inference] CPU-> GPU async io copy for TensorRT using ShareExternalData API (#46636) · c333af2f
由 Zhang Jun 提交于 10月 10, 2022

c333af2f

make fused_multi_transformer support dynamically set the cache_kvs' shape and... · 9ea279a4

由 carryyu 提交于 10月 10, 2022

make fused_multi_transformer support dynamically set the cache_kvs' shape and support input prefix_caches. (#46777)

* make fused_multi_transformer support dynamically set the cache_kvs' shape and support input prefix_caches.

9ea279a4

H

delete_activation_headerfile (#46690) · c4bbe5d9
由 HongyuJia 提交于 10月 10, 2022

c4bbe5d9
H

delete_multi_gru_headerfile (#46689) · 749da9a9
由 HongyuJia 提交于 10月 10, 2022

749da9a9
H
[MKLDNN] Delete mkldnn headerfile in quantize and requantize (#46676) · 8ec3b737
由 HongyuJia 提交于 10月 10, 2022
```
* delete_quantize_headerfile

* delete_requantize_headerfile
```
8ec3b737
H

delete_gaussian_random_mkldnn_headerfle (#46669) · 26d1d83e
由 HongyuJia 提交于 10月 10, 2022

26d1d83e

[PHI] transpose2_grad op migration (#46139) · e3407a80

由 Paulina Gacek 提交于 10月 10, 2022

* op migrated, Copy(OneDNNContext, ...) added

* mutable_data & op registration in fluid removed

* refactoring

* OneDNNGetDataType to uppercase

* missing cpu check added, handler moved to .h file

* name changed to transpose_grad

* Copy changed back to TensorCopy

* Resizing corrected, Copy(OneDNNContext) removed

e3407a80

09 10月, 2022 4 次提交
- Z
  
  add sync_batch_norm_kernel (#46430) · 5cd6a707
  由 zhangkaihuo 提交于 10月 09, 2022
  
  5cd6a707
- Z
  
  [Sparse] Add a batch_norm kernel (#46359) · 888223b7
  由 zhangkaihuo 提交于 10月 09, 2022
  
  888223b7
- H
  
  [Dygraph] Fix Perf of FusedFeedForward and FusedAttention with AllReduce (#46780) · 078e8c78
  由 Haohongxiang 提交于 10月 09, 2022
  
  078e8c78
- R
  
  [MLU] fix cmake error (#46772) · 4df12303
  由 ronnywang 提交于 10月 09, 2022
  
  4df12303
08 10月, 2022 2 次提交

C

[MLU] add fluid MLUOps prior_box (#46585) · ff37e48e
由 cifar10 提交于 10月 08, 2022

ff37e48e

fix some doc bug test=document_fix (#45488) · 04abcab8

由 mrcangye 提交于 10月 08, 2022

* fix some doc bug test=document_fix

* fix some docs issues, test=document_fix

* beta -> \beta in softplus

* threshold -> \varepsilon in softplus

* parameter name

* delta -> \delta in smooth_l1_loss

* fix some docs test=document_fix

* fix docs test=document_fix

* fix docs && 增加空行 test=document_fix

* Update python/paddle/nn/functional/activation.py, test=document_fix

* Update python/paddle/nn/layer/activation.py, test=document_fix
Co-authored-by: NSigureMo <sigure.qaq@gmail.com>

04abcab8

03 10月, 2022 2 次提交
- J
  OneDNN md-in-tensor refactoring: Added support for md in transpose (#46620) · 19746835
  由 jakpiase 提交于 10月 03, 2022
```
* added transpose

* CI fix

* fix for transpose

* fix after review
```
  19746835
- J
  Requantize to use Memory Desc in Tensors (#46608) · a579e523
  由 Jacek Czaja 提交于 10月 03, 2022
```
* - some more MD changes

* - lint

* - compilation fixes

* - compilation fixes

* - lint

* - fix
```
  a579e523
30 9月, 2022 8 次提交
- H
  
  opt GetExpectedKernelType code of fill_constant_op (#46667) · 136b1f42
  由 HongyuJia 提交于 9月 30, 2022
  
  136b1f42
- H
  
  remove MKLDNN hard code in addmm (#46660) · 9900ed52
  由 HongyuJia 提交于 9月 30, 2022
  
  9900ed52
- C
  
  [MLU] fix phi::Tensor compile error of mlu. (#46649) · 2e231402
  由 Chenxiao Niu 提交于 9月 30, 2022
  
  2e231402
- [MLU] add_fluid_mluop_yolo_box (#46573) · 832b0a15
  由光明和真理提交于 9月 30, 2022
  
  832b0a15
- H
  [Opt Code] Opt GetExpectedKernelType code of conv_transpose_op (#46666) · 8c067ec1
  由 HongyuJia 提交于 9月 30, 2022
```
* opt GetExpectedKernelType code of conv_transpose_op

* fix if error
```
  8c067ec1
- H
  
  remove_dequantize_mkldnn_headerfile (#46665) · 7a1e1f99
  由 HongyuJia 提交于 9月 30, 2022
  
  7a1e1f99
- H
  
  change mkldnn kernel layout, ALL_LAYOUT->ONEDNN (#46629) · abee2210
  由 HongyuJia 提交于 9月 30, 2022
  
  abee2210
- S
  support pure bfloat16 for more ops (#46364) · b7b231a6
  由 sneaxiy 提交于 9月 30, 2022
```
* support pure bfloat16

* support bf16 linear

* update PR to pass CI

* tiny fix where_grad_kernel.cu

* add bfloat16 to selu_grad to pass CI

* fix selu grad compilation error
```
  b7b231a6
29 9月, 2022 2 次提交

fix P40 topk: Make the optimized topk compatible with P40. (#46547) · 667082c0

由 carryyu 提交于 9月 29, 2022

* fix P40 topk: Make the optimized topk compatible with P40.

* fix P40 topk: Make the optimized topk compatible with P40.

* fix P40 topk: Make the optimized topk compatible with P40.

667082c0

[MLU] add mlu kernel for add_reduce_max_grad (#45651) · 1ef1cace
由光明和真理提交于 9月 29, 2022
```
Co-authored-by: Nliupeiyu <liupeiyu@cambricon.com>
```
1ef1cace

28 9月, 2022 4 次提交

Remove the declaration of using Tensor in framework/tensor.h (#46432) · e12a905e

由 Chen Weihang 提交于 9月 28, 2022

* remove needless using tensor

* remove needless using tensor

* resolve conflict

* replace tensor using

* fix format error

* revert needless changing

* fix rocm and npu compile error

* fix cinn compile error

* fix format error

* fix mkldnn format error

* fix mkldnn format error

* fix cinn compile error

* fix cinn compile error

* fix cinn compile error

* resolve conflict

e12a905e

Replacing set_format with set_mem_desc in FC onednn kernel (#46372) · 844d9855

由 Jacek Czaja 提交于 9月 28, 2022

* added fc int8 tests

* CI fix

* added skipping UTs for GPUs

* fixes for CI

* added support for residual connections inside fc

* fix for quant int8 bias

* - lint
Co-authored-by: Njakpiase <jakpia21@gmail.com>

844d9855

L

first commit (#46525) · 806b252c
由 limingshu 提交于 9月 28, 2022

806b252c

[PHI] relu6_grad kernel (#46501) · cee2b12d

由 Sławomir Siwek 提交于 9月 28, 2022

* Relu6

* remove fluid handler

* add individual kernel signature

* coding style

* replace bounded_relu with clip

* whitespace

* code style

cee2b12d

PaddlePaddle / Paddle 接近 2 年 前同步成功

PaddlePaddle / Paddle
接近 2 年前同步成功