提交 · 6698e8d1011254d17aed7f57f3f2a778d2c7a661 · PaddlePaddle / Paddle

12 12月, 2022 8 次提交
- F
  
  fix: Move the pass location to the appropriate location (#48951) · 6698e8d1
  由 feng_shuai 提交于 12月 12, 2022
  
  6698e8d1
- Z
  
  forbid conv op whose weight is not a persistable weight into Paddle-TRT (#48763) · 60223894
  由 zhoutianzi666 提交于 12月 12, 2022
  
  60223894
- H
  [PHI decoupling] move norm_utils.cu.h from fluid to phi and remove norm_utils.h in fluid (#48930) · 3cb8db8f
  由 huangjiyi 提交于 12月 12, 2022
```
* move norm_utils.cu.h from fluid to phi

* remove norm_utils.h in fluid

* fix bugs and replace mutable_data with Alloc

* replace mutable_data with Alloc
```
  3cb8db8f
- Z
  
  add static_ops.yaml for static op (#48991) · 8f87f0c7
  由 zyfncg 提交于 12月 12, 2022
  
  8f87f0c7
- Z
  
  fix a bug in GetTrtWeight (#48993) · 93e36b06
  由 zhoutianzi666 提交于 12月 12, 2022
  
  93e36b06
- Generate static graph code of some ops by yaml (#48771) · 4c0d46a8
  由 HappyHeavyRain 提交于 12月 12, 2022
```
* generate static graph code of some ops by yaml, test = develop

* fix 'take_along_axis' yaml style

* reset scatter/scatter_nd_add

* delete the comments of put_along_axis
```
  4c0d46a8
- R
  Support cross-step stream synchronization for standalone executor (#48809) · 9455d146
  由 Ruibiao Chen 提交于 12月 12, 2022
```
* Add UT

* Support cross-step stream synchronization for standalone executor

* Fix typos

* Fix typos

* Update UTs
```
  9455d146
- W
  Add dynamic checks for collective communication on NCCL (#48915) · e7711592
  由 Wen Sun 提交于 12月 12, 2022
```
* chore: unify `SingleTensor`

* feat: dynamic check
```
  e7711592
11 12月, 2022 1 次提交
- W
  
  fix for mkldnn (#48852) · 96e58f87
  由 Wilber 提交于 12月 11, 2022
  
  96e58f87
10 12月, 2022 1 次提交
- Z
  [Paddle-TRT] add cast between int64 tensor and Paddle-TRT (#45547) · fd373579
  由 zhoutianzi666 提交于 12月 10, 2022
```
* Add cast between int64 tensor and Paddle-TRT
* Add Unit testing.
```
  fd373579
09 12月, 2022 11 次提交
- S
  [PHI] Migrate reshape kernel (#48749) · 7b2b0c1b
  由 Sławomir Siwek 提交于 12月 09, 2022
```
* reshape

* typo

* remove header
```
  7b2b0c1b
- Y
  [Inference] optimize some code and fix some bug (#48780) · c0034b5b
  由 Yuanle Liu 提交于 12月 09, 2022
```
* clean ir_pass_manager and fix map_depthwise_conv_to_conv_pass

* fix unitest timeout
```
  c0034b5b
- H
  [Custom XPU Support] Custom extension support xpu backend (#48733) · 5ecd0ad5
  由 HongyuJia 提交于 12月 09, 2022
```
* support custom_xpu

* update cmake to test xpu

* support custom_xpu, verify mechanism

* fix test_custom_relu_op_xpu_setup.py, test=kunlun

* fix FLAGS_init_allocated_mem

* cancel TIMEOUT property

* reset FLAGS_init_allocated_mem property
```
  5ecd0ad5
- Z
  [inference][trt] upgrade prelu op (#48528) · 98ab2433
  由 Zhang Jun 提交于 12月 09, 2022
```
* add prelu
```
  98ab2433
- fix scale type in alpha and beta (#48887) · c1cadcca
  由 MarDino 提交于 12月 09, 2022
  
  c1cadcca
- H
  
  move ops_extra_info_gen.py from phi to fluid (#48926) · c7d6d9f4
  由 huangjiyi 提交于 12月 09, 2022
  
  c7d6d9f4
- W
  mv fused_bias_dropout_residual_ln to fluid manual dir (#48824) · e0131224
  由 Weilong Wu 提交于 12月 09, 2022
```
* mv fused_bias_dropout_residual_ln to fluid manual dir

* rm useless comments
```
  e0131224
- Z
  [Paddle Inference]add cutlass act set in conv_elementwise_add_act_fuse_pass (#48838) · 0f6c5459
  由 zhoutianzi666 提交于 12月 09, 2022
```
* add cutlass act set in conv_elementwise_add_act_fuse_pass
```
  0f6c5459
- Z
  Support static graph code-gen for scalar and int_array (#48792) · 58f08924
  由 zyfncg 提交于 12月 09, 2022
```
* add suppport_tensor for code_gen to static graph

* support code-gen for int_array

* polish code

* fix bug of data_type
```
  58f08924
- L
  move share_buffer kernel to phi (#48858) · c2e77ba3
  由 Leo Chen 提交于 12月 09, 2022
```
* move share_buffer kernel to phi

* fix ut

* add source file

* fix window links
```
  c2e77ba3
- P
  
  [PHI decoupling] move "flags.h" from fluid to phi (#48696) · 39ffef0d
  由 PuQing 提交于 12月 09, 2022
  
  39ffef0d
08 12月, 2022 12 次提交
- J
  
  fix paddle2cinn float16 type support bug (#48249) · 73bff10f
  由 jiangcheng 提交于 12月 08, 2022
  
  73bff10f
- L
  
  first commit (#38143) · 2e7c172c
  由 limingshu 提交于 12月 08, 2022
  
  2e7c172c
- K
  fix 'BlasAXPBY unimplemented' error with custom device (#48762) · 127da101
  由 Kai Song 提交于 12月 08, 2022
```
* fix 'BlasAXPBY unimplemented' error with custom device

* fix utils CmakeLists bug
```
  127da101
- R
  rewrite delete_weight_dequant_linear_op_encoder/decoder pass (#48650) · 95332bef
  由 RichardWooSJTU 提交于 12月 08, 2022
```
* rewrite delete_weight_deqquant_linear_op_encoder/decoder pass
```
  95332bef
- W
  [Paddle Inference] General optimization for no_varlen embedding layernorm (#48580) · 22bfa579
  由 Wangzheee 提交于 12月 08, 2022
```
* general optimization no_varlen embedding layernorm
```
  22bfa579
- H
  [PHI decoupling] move cuda_graph from fluid to phi (#48686) · a4d9851b
  由 huangjiyi 提交于 12月 08, 2022
```
* move cuda_graph from fluid to phi

* move device_memory_aligment from fluid to phi

* Revert "move device_memory_aligment from fluid to phi"

This reverts commit b92fcd39a0a50fdac13278f49be0237a85f3a13f.

* update xpu cmake
```
  a4d9851b
- W
  
  [Inference] Enable infer shape cache. (#48312) · f88713e1
  由 Wilber 提交于 12月 08, 2022
  
  f88713e1
- R
  
  Set WaiterType of kGpuSync to kCPU (#48758) · a5999d83
  由 Ruibiao Chen 提交于 12月 08, 2022
  
  a5999d83
- Q
  rm kunlun xpu2_op_list (#48826) · 83c41459
  由 QingshuChen 提交于 12月 08, 2022
```
*test=kunlun
```
  83c41459
- 六
  [Paddle Inference] Add add onehot trt converter (#48655) · 1adf5430
  由六个骨头提交于 12月 08, 2022
```
* add onehot trt converter

* add unitest

* fix bug

* opt code

* fix bug

* fix depth_tensor

* fix unitest

* fix bug

* fix unitest

* fix bug

* fix bug

* fix bug

* fix bug
```
  1adf5430
- W
  
  [Inference] inference add cinn interface (#48741) · 3a387df6
  由 Wilber 提交于 12月 08, 2022
  
  3a387df6
- W
  
  set free_when_no_cache_hit default value to true (#48815) · 592ed40b
  由 wanghuancoder 提交于 12月 08, 2022
  
  592ed40b
07 12月, 2022 5 次提交
- S
  [PHI] Migrate squeeze and squeeze_grad kernels (#48634) · ad41fce8
  由 Sławomir Siwek 提交于 12月 07, 2022
```
* squeeze kernel

* squeze fwd

* whitespace
```
  ad41fce8
- 张
  
  [phi::DenseTensor] Replace Tensor with phi::DenseTensor (#48682) · 65420271
  由张春乔提交于 12月 07, 2022
  
  65420271
- F
  
  fix: oss just support sm>=75 (#48731) · 87fbc5e4
  由 feng_shuai 提交于 12月 07, 2022
  
  87fbc5e4
- Q
  
  [NPU] add FLAGS_npu_storage_format env to enable npu storage format, test=develop (#48774) · e5bc2eec
  由 Qi Li 提交于 12月 07, 2022
  
  e5bc2eec
- Z
  
  modify d2d copy to xpu::copy in xpu kernel, test=kunlun (#48710) · 0d8ddf9f
  由 zhangyikun02 提交于 12月 07, 2022
  
  0d8ddf9f
06 12月, 2022 2 次提交

Clear extra input (Bias, ResidualData) in OpMaker of conv2d (#47579) · 0a2dfa38

由 zyfncg 提交于 12月 06, 2022

* delete Bias and ResidualData in OpMaker of conv2d

* delete extra input of conv3d

* refactor pass of conv_bias_fusion

* fix mkldnn dependency

* fix mkldnn compile

* fix test_conv_bias_mkldnn_fuse_pass

* police some code

* remove useless log

* fix analyzer_vit_ocr_tester

* fix conv_activation_mkldnn_fuse_pass

* fix test_analyzer_ocr

* add fused_conv_sig

* fix performence regression

* fix performance regression

0a2dfa38

Q
add xpu_support op function (#48606) · 06b32b38
由 QingshuChen 提交于 12月 06, 2022
```
*test=kunlun
```
06b32b38

PaddlePaddle / Paddle 大约 1 年 前同步成功

PaddlePaddle / Paddle
大约 1 年前同步成功