提交 · c789907430fe2014c3aafc4276f50829a4c867d4 · PaddlePaddle / Paddle

06 1月, 2023 1 次提交

[Auto Parallel] Merge dist attrs from python into c++ (#49214) · c7899074

由 Yulong Ao 提交于 1月 06, 2023

* [Auto Parallel] Rename methods of ProcessMesh

* [Auto Parallel] Impl the python process_mesh by the c++ one

* [Auto Parallel] Add some minor modifications

* [Auto Parallel] Rename some methods

* [Auto Parallel] Remove unnecessary codes

* [Auto Parallel] Add back some removed files

* [Auto Parallel] Fix bugs

* [Auto Parallel] Fix a bug

* Update process_mesh.cc

* [Auto Parallel] Merge dist attrs of Python into C++

* [Auto Parallel] Add back deleted importing

* [Auto Parallel] Add back removed unittest

* [Auto Parallel] Remove type qualifiers of return types

* [Auto Parallel] Fix some bugs

* [Auto Parallel] Fix a bug of the quant pass

* [Auto Parallel] Fix the code style

c7899074

05 1月, 2023 14 次提交
- F
  sequence_mask fix: when the input length is an empty tensor, the kernel tries... · 0f3ccd14
  由 Feiyu Chan 提交于 1月 05, 2023
```
sequence_mask fix: when the input length is an empty tensor, the kernel tries to dereference illegal sentinel iterator (#49525)
```
  0f3ccd14
- X
  
  [Paddle Inference] Add ci flags for a persistent IBuilder. (#49538) · fcd6d675
  由 xiaoxiaohehe001 提交于 1月 05, 2023
  
  fcd6d675
- Generate the static graph code of ops (#49413) · 39f0eb2c
  由 HappyHeavyRain 提交于 1月 05, 2023
```
* generate the static graph code of ops

* modify the isclose comment

* modify the clip comment in nn.py

* reset nn.py
```
  39f0eb2c
- Z
  [inference][trt]Upgrade expand cast nearestinterp for sd (#48998) · 5defefd6
  由 Zhang Jun 提交于 1月 05, 2023
```
* update nearest_interp, expand_v2, cast for stable diffusion

* update nearest_interp, expand_v2, cast for stable diffusion

* correct shape rank

* Update expand_v2_op.cc
```
  5defefd6
- J
  CINN add fetch op for skip gc vars (#49553) · c1ce54bf
  由 jiangcheng 提交于 1月 05, 2023
```
* CINN add fetch op for skip gc vars

* perfect test annotation

* break if not is_only_used_internal

* move skip_gc_var_names get out of for loop
```
  c1ce54bf
- R
  
  Adjust OP scheduling order for standalone executor (#49561) · e8f4a327
  由 Ruibiao Chen 提交于 1月 05, 2023
  
  e8f4a327
- 姜
  Yj/rm core ops exp (#49490) · 70ea88bf
  由姜永久提交于 1月 05, 2023
```
* rm op_function_generator

* rm op_func_generator.h

* rm op_function

* modify cmake

* rm op_function.h

* rm check for op_function_generator.cc

* reset imperative

* rm python part

* fix imperative

* lint

* lint

* modify legacy_c

* review

* modify

* modify legacy

* rm gen op_functions code

* reset framework

* rm core.ops for test

* core.ops->core.eager.ops.legacy

* not raiseError for xpu
```
  70ea88bf
- Z
  
  support generate static graph code for imag and real op (#49523) · 192eb4d5
  由 zyfncg 提交于 1月 05, 2023
  
  192eb4d5
- W
  
  [Inference] inplace all reshape op (#49146) · 017af746
  由 Wilber 提交于 1月 05, 2023
  
  017af746
- Y
  
  [Paddle Inference] add unitest for zero_copy_tensor with bool type (#49495) · 8705a79d
  由 Yuanle Liu 提交于 1月 05, 2023
  
  8705a79d
- W
  Refactor `ProcessGroup` to support comm context migration & clang compilation (#49451) · 1be70bc5
  由 Wen Sun 提交于 1月 05, 2023
```
* refactor: use base class

* fix: incorrect deps

* fix: add missing header

* refactor: update class structures

* fix: bkcl typo

* fix: remove redundant def
```
  1be70bc5
- T
  delivery skip_gc_vars attr to cinn subgraph (#49471) · 1221307b
  由 TeFeng Chen 提交于 1月 05, 2023
```
* delivery skip_gc_vars from the main graph to each subgraph compiled by CINN

* rearrange format and annotation

* fix lacking namespace

* fix segmentation fault cinn subgraph doesn't own kSkipGcVarNames

* deliver all skip_gc_vars of main graph

* add vlog for skip_gc_vars
```
  1221307b
- Y
  
  Add transpose_qkv_wb flags to the fused_attention_op. (#49494) · ec857b85
  由 Yuang Liu 提交于 1月 05, 2023
  
  ec857b85
- G
  
  Add to_hash func and paddle2arg map for cinn (#49402) · 1168a178
  由 GaoYuYang 提交于 1月 05, 2023
  
  1168a178
04 1月, 2023 7 次提交
- A
  
  [D2SCinn]Add build_cinn_pass in BuildStrategy (#49496) · 343bff7b
  由 Aurelius84 提交于 1月 04, 2023
  
  343bff7b
- Y
  
  update vlog output (#49541) · bbc6dd94
  由 Yuanle Liu 提交于 1月 04, 2023
  
  bbc6dd94
- W
  
  [Inference] Add conv_fusion nhwc impl. (#49047) · 4a8708bb
  由 Wilber 提交于 1月 04, 2023
  
  4a8708bb
- Y
  
  [Paddle Inference] fix mixed precision diff (#49475) · ac75a9a6
  由 Yuanle Liu 提交于 1月 04, 2023
  
  ac75a9a6
- S
  Revert "Replace matmul with matmul_v2 during oneDNN fuse passes (#49108)" (#49524) · 338cbeaa
  由 Sławomir Siwek 提交于 1月 04, 2023
```
This reverts commit 2c444dfa.
```
  338cbeaa
- H
  [Unify KernelKey] change OpKernelType->KernelKey (#49138) · 4383494f
  由 HongyuJia 提交于 1月 04, 2023
```
* execute use kernel_key first

* change OpKernelType->KernelKey

* fix py3 compile error, remove redundant header files

* fix build_strategy_test

* fix DataType::RAW

* fix custom_type test: operator_test.cc

* fix transform place

* fix backends_are_same_class

* try fix place TransDataDevice

* support all KernelKey

* fix TransformData

* fix place_are_same_class

* fix merge

* fix test_params_no_grad

* fix specific place of GetExpectedKernelType

* fix specific place of GetExpectedKernelType

* fix GetKernelTypeForVar

* fix dtype error

* fix fetch_v2

* change GetKernelTypeForVar

* fix interpreter

* fix typo error

* polish codes

* polish codes

* polish codes

* fix conflict
```
  4383494f
- L
  
  add multi_devices_fused_multi_transformer_encoder_pass and cherry-pick from 48349 (#49383) · 29eec2dd
  由 lzy 提交于 1月 04, 2023
  
  29eec2dd
03 1月, 2023 10 次提交
- W
  
  [code_style fix] graph_brpc_client cpplint (#49457) · a2d7e1d7
  由 wangzhen38 提交于 1月 03, 2023
  
  a2d7e1d7
- W
  [Dy2St]Fix param and out grad names in dy2st for high order grad (#49461) · f484a61e
  由 WangZhen 提交于 1月 03, 2023
```
* Fix param and out grad names in dy2st for high order grad
```
  f484a61e
- Y
  
  [Paddle Inference] enhance paddle_infer::Tensor data type (#49388) · dc13f7c5
  由 Yuanle Liu 提交于 1月 03, 2023
  
  dc13f7c5
- S
  Replace matmul with matmul_v2 during oneDNN fuse passes (#49108) · 2c444dfa
  由 Sławomir Siwek 提交于 1月 03, 2023
```
* replace matmul with matmul_v2 in fuse passes

* Remove fusion logic from matmul

* removing fusion methods

* add proper name

* adjust namespaces
```
  2c444dfa
- K
  
  set Flag_control_flow_use_new_executor=true by default (#49447) · 0f9e2b17
  由 kangguangli 提交于 1月 03, 2023
  
  0f9e2b17
- Z
  [Paddle Inference] Implement conv2d_fusion NHWC format using cutlass (#47989) · c123dd1e
  由 zhoutianzi666 提交于 1月 03, 2023
```
* Implement conv2d_fusion NHWC format using CUTLASS
* Add unit testing for CUTLASS Conv in inference
* Add experimental API for CUTLASS.
```
  c123dd1e
- A
  [OpAttr]Fix Ignore AttriteTensor in IndicateDataType bug in grad_op (#49472) · 5ac96468
  由 Aurelius84 提交于 1月 03, 2023
```
* [OpAttr]Fix Ignore AttriteTensor in IndicateDataType bug in grad_op

* add GetExpectedKernelType
```
  5ac96468
- Z
  [Zero-Dim] reshape/reshape_/reverse 0D support (#49357) · 347d2123
  由 zhaoyingli 提交于 1月 03, 2023
```
* [Zero-Dim] reshape/reshape_/reverse 0D support

* rm comment

* change paddle.to_tensor to paddle.full

* fix docs

* update paddle.full
```
  347d2123
- Z
  
  forbid ops who have 1D intermediate tensor entering Paddle-TRT (#49378) · 021085e3
  由 zhoutianzi666 提交于 1月 03, 2023
  
  021085e3
- S
  
  Add not_equal trt converter (#49393) · 822ea0f9
  由 Sanbu 提交于 1月 03, 2023
  
  822ea0f9
02 1月, 2023 1 次提交
- H
  
  Scale Matmul Fuse pass rewritten (#49105) · 18c0a002
  由 Hulek 提交于 1月 02, 2023
  
  18c0a002
01 1月, 2023 1 次提交
- G
  
  memorty_optimize remove inplace op (#49431) · aa96ddc3
  由 gem5 提交于 1月 01, 2023
  
  aa96ddc3
30 12月, 2022 6 次提交

Z
Fix test_conv_bn_fuse_pass_cc on Windows System (#49446) · a4b4343f
由 zyfncg 提交于 12月 30, 2022
```
* fix test_conv_bn_fuse_pass_cc

* remove comment
```
a4b4343f
Z
[inference][trt] update Convolution to ConvolutionNd (#47653) · 6e5917e4
由 Zhang Jun 提交于 12月 30, 2022
```
* update conv to convNd

* trigger ci
```
6e5917e4

Support static graph code-gen for squeeze and unsqueeze op (#49430) · 23c1ac2c

由 zyfncg 提交于 12月 30, 2022

* support static graph code-gen for squeeze op

* generate static graph code of unsqueeze

* refine op name

* add extra output in op_compat

* remove debug log

23c1ac2c

H

fix possible bug (#49367) · 18f0ab86
由 HongyuJia 提交于 12月 30, 2022

18f0ab86

在文档中统一静态图模式与动态图模式的英文翻译 (#49170) · a186e60d

由 Sanbu 提交于 12月 30, 2022

* 1219

* temporarily change the num_diff_files limit, test=document_fix

* Revert "temporarily change the num_diff_files limit, test=document_fix"

This reverts commit 8e70f00ef468d2dad0e38b3da06295ed62990d20.

* for codestyle

* remove duplicate license

* `static mode` -> `static graph mode`

* Update hybrid_parallel_inference.py

* Update layer_function_generator.py

* Update manipulation.py

* reset
Co-authored-by: NLigoml <39876205+Ligoml@users.noreply.github.com>
Co-authored-by: NSigureMo <sigure.qaq@gmail.com>

a186e60d

W
Fix default GetExpectedKernelType for ops supported tensor attrs (#49414) · 8a859554
由 WangZhen 提交于 12月 30, 2022
```
* Fix default GetExpectedKernelType for ops supported tensor attrs
```
8a859554

PaddlePaddle / Paddle 大约 1 年 前同步成功

PaddlePaddle / Paddle
大约 1 年前同步成功