提交 · e0ee7403e77c5ee0a50359b12ef989a1ed8a2e7d · PaddlePaddle / Paddle

06 1月, 2023 4 次提交

T

fix bug (#49546) · e0ee7403
由 Thomas Young 提交于 1月 06, 2023

e0ee7403
张

Expansions of some unmaintained pr (#49551) · 419c2d14
由张春乔提交于 1月 06, 2023

419c2d14
N

Fix inaccurate return of low precision op list (#49391) · a214e5dc
由 niuliling123 提交于 1月 06, 2023

a214e5dc

[Auto Parallel] Merge dist attrs from python into c++ (#49214) · c7899074

由 Yulong Ao 提交于 1月 06, 2023

* [Auto Parallel] Rename methods of ProcessMesh

* [Auto Parallel] Impl the python process_mesh by the c++ one

* [Auto Parallel] Add some minor modifications

* [Auto Parallel] Rename some methods

* [Auto Parallel] Remove unnecessary codes

* [Auto Parallel] Add back some removed files

* [Auto Parallel] Fix bugs

* [Auto Parallel] Fix a bug

* Update process_mesh.cc

* [Auto Parallel] Merge dist attrs of Python into C++

* [Auto Parallel] Add back deleted importing

* [Auto Parallel] Add back removed unittest

* [Auto Parallel] Remove type qualifiers of return types

* [Auto Parallel] Fix some bugs

* [Auto Parallel] Fix a bug of the quant pass

* [Auto Parallel] Fix the code style

c7899074

05 1月, 2023 18 次提交
- F
  sequence_mask fix: when the input length is an empty tensor, the kernel tries... · 0f3ccd14
  由 Feiyu Chan 提交于 1月 05, 2023
```
sequence_mask fix: when the input length is an empty tensor, the kernel tries to dereference illegal sentinel iterator (#49525)
```
  0f3ccd14
- S
  Support 0D for paddle.sort/argsort (#49501) · 032da731
  由 Siming Dai 提交于 1月 05, 2023
```
* support 0D for paddle.sort/argsort

* support 0D tensor for paddle.sort/argsort in xpu

* fix bug

* fix grad and add value assertion
```
  032da731
- X
  
  [Paddle Inference] Add ci flags for a persistent IBuilder. (#49538) · fcd6d675
  由 xiaoxiaohehe001 提交于 1月 05, 2023
  
  fcd6d675
- Generate the static graph code of ops (#49413) · 39f0eb2c
  由 HappyHeavyRain 提交于 1月 05, 2023
```
* generate the static graph code of ops

* modify the isclose comment

* modify the clip comment in nn.py

* reset nn.py
```
  39f0eb2c
- Z
  [inference][trt]Upgrade expand cast nearestinterp for sd (#48998) · 5defefd6
  由 Zhang Jun 提交于 1月 05, 2023
```
* update nearest_interp, expand_v2, cast for stable diffusion

* update nearest_interp, expand_v2, cast for stable diffusion

* correct shape rank

* Update expand_v2_op.cc
```
  5defefd6
- J
  CINN add fetch op for skip gc vars (#49553) · c1ce54bf
  由 jiangcheng 提交于 1月 05, 2023
```
* CINN add fetch op for skip gc vars

* perfect test annotation

* break if not is_only_used_internal

* move skip_gc_var_names get out of for loop
```
  c1ce54bf
- Z
  
  [BugFix] Fix illegal memory overflow for p_norm op (#49537) · ba1dce0a
  由 Zhong Hui 提交于 1月 05, 2023
  
  ba1dce0a
- R
  
  Adjust OP scheduling order for standalone executor (#49561) · e8f4a327
  由 Ruibiao Chen 提交于 1月 05, 2023
  
  e8f4a327
- 姜
  Yj/rm core ops exp (#49490) · 70ea88bf
  由姜永久提交于 1月 05, 2023
```
* rm op_function_generator

* rm op_func_generator.h

* rm op_function

* modify cmake

* rm op_function.h

* rm check for op_function_generator.cc

* reset imperative

* rm python part

* fix imperative

* lint

* lint

* modify legacy_c

* review

* modify

* modify legacy

* rm gen op_functions code

* reset framework

* rm core.ops for test

* core.ops->core.eager.ops.legacy

* not raiseError for xpu
```
  70ea88bf
- Z
  
  support generate static graph code for imag and real op (#49523) · 192eb4d5
  由 zyfncg 提交于 1月 05, 2023
  
  192eb4d5
- W
  
  [Inference] inplace all reshape op (#49146) · 017af746
  由 Wilber 提交于 1月 05, 2023
  
  017af746
- Y
  
  [Paddle Inference] add unitest for zero_copy_tensor with bool type (#49495) · 8705a79d
  由 Yuanle Liu 提交于 1月 05, 2023
  
  8705a79d
- W
  Refactor `ProcessGroup` to support comm context migration & clang compilation (#49451) · 1be70bc5
  由 Wen Sun 提交于 1月 05, 2023
```
* refactor: use base class

* fix: incorrect deps

* fix: add missing header

* refactor: update class structures

* fix: bkcl typo

* fix: remove redundant def
```
  1be70bc5
- T
  delivery skip_gc_vars attr to cinn subgraph (#49471) · 1221307b
  由 TeFeng Chen 提交于 1月 05, 2023
```
* delivery skip_gc_vars from the main graph to each subgraph compiled by CINN

* rearrange format and annotation

* fix lacking namespace

* fix segmentation fault cinn subgraph doesn't own kSkipGcVarNames

* deliver all skip_gc_vars of main graph

* add vlog for skip_gc_vars
```
  1221307b
- X
  
  fix trace heap overflow (#49548) · 5feadc0b
  由 XiangGao 提交于 1月 05, 2023
  
  5feadc0b
- Y
  
  Add transpose_qkv_wb flags to the fused_attention_op. (#49494) · ec857b85
  由 Yuang Liu 提交于 1月 05, 2023
  
  ec857b85
- G
  
  Add to_hash func and paddle2arg map for cinn (#49402) · 1168a178
  由 GaoYuYang 提交于 1月 05, 2023
  
  1168a178
- R
  keep run_setup and cmake_gen_and_build same (#46957) · 1228bad0
  由 risemeup1 提交于 1月 05, 2023
```
* modify setup.py and paddle_build.sh

* modify setup.py and paddle_build.sh

* modify setup.py and paddle_build.sh

* modify setup.py

* modify run_setup

* modify setup.py

* fix make_clean

* modify setup.py

* modify setup.py

* delete setting python_libary

* debug

* debug

* debug

* debug
```
  1228bad0
04 1月, 2023 9 次提交
- A
  
  [D2SCinn]Add build_cinn_pass in BuildStrategy (#49496) · 343bff7b
  由 Aurelius84 提交于 1月 04, 2023
  
  343bff7b
- Y
  
  update vlog output (#49541) · bbc6dd94
  由 Yuanle Liu 提交于 1月 04, 2023
  
  bbc6dd94
- G
  
  Add the input check for softmax_with_cross_entropy (#49333) · f17b2de8
  由 Guanghua Yu 提交于 1月 04, 2023
  
  f17b2de8
- W
  
  [Inference] Add conv_fusion nhwc impl. (#49047) · 4a8708bb
  由 Wilber 提交于 1月 04, 2023
  
  4a8708bb
- Z
  
  refine diagonal infermeta (#49520) · 852c8db3
  由 zhangbo9674 提交于 1月 04, 2023
  
  852c8db3
- Y
  
  [Paddle Inference] fix mixed precision diff (#49475) · ac75a9a6
  由 Yuanle Liu 提交于 1月 04, 2023
  
  ac75a9a6
- S
  Revert "Replace matmul with matmul_v2 during oneDNN fuse passes (#49108)" (#49524) · 338cbeaa
  由 Sławomir Siwek 提交于 1月 04, 2023
```
This reverts commit 2c444dfa.
```
  338cbeaa
- H
  [Unify KernelKey] change OpKernelType->KernelKey (#49138) · 4383494f
  由 HongyuJia 提交于 1月 04, 2023
```
* execute use kernel_key first

* change OpKernelType->KernelKey

* fix py3 compile error, remove redundant header files

* fix build_strategy_test

* fix DataType::RAW

* fix custom_type test: operator_test.cc

* fix transform place

* fix backends_are_same_class

* try fix place TransDataDevice

* support all KernelKey

* fix TransformData

* fix place_are_same_class

* fix merge

* fix test_params_no_grad

* fix specific place of GetExpectedKernelType

* fix specific place of GetExpectedKernelType

* fix GetKernelTypeForVar

* fix dtype error

* fix fetch_v2

* change GetKernelTypeForVar

* fix interpreter

* fix typo error

* polish codes

* polish codes

* polish codes

* fix conflict
```
  4383494f
- L
  
  add multi_devices_fused_multi_transformer_encoder_pass and cherry-pick from 48349 (#49383) · 29eec2dd
  由 lzy 提交于 1月 04, 2023
  
  29eec2dd
03 1月, 2023 9 次提交
- W
  
  [code_style fix] graph_brpc_client cpplint (#49457) · a2d7e1d7
  由 wangzhen38 提交于 1月 03, 2023
  
  a2d7e1d7
- L
  
  H2D data transfer optimization for concat kernel (#49040) · 0de94cd9
  由 limingshu 提交于 1月 03, 2023
  
  0de94cd9
- W
  [Dy2St]Fix param and out grad names in dy2st for high order grad (#49461) · f484a61e
  由 WangZhen 提交于 1月 03, 2023
```
* Fix param and out grad names in dy2st for high order grad
```
  f484a61e
- Y
  
  [Paddle Inference] enhance paddle_infer::Tensor data type (#49388) · dc13f7c5
  由 Yuanle Liu 提交于 1月 03, 2023
  
  dc13f7c5
- S
  Replace matmul with matmul_v2 during oneDNN fuse passes (#49108) · 2c444dfa
  由 Sławomir Siwek 提交于 1月 03, 2023
```
* replace matmul with matmul_v2 in fuse passes

* Remove fusion logic from matmul

* removing fusion methods

* add proper name

* adjust namespaces
```
  2c444dfa
- K
  
  set Flag_control_flow_use_new_executor=true by default (#49447) · 0f9e2b17
  由 kangguangli 提交于 1月 03, 2023
  
  0f9e2b17
- Z
  [Paddle Inference] Implement conv2d_fusion NHWC format using cutlass (#47989) · c123dd1e
  由 zhoutianzi666 提交于 1月 03, 2023
```
* Implement conv2d_fusion NHWC format using CUTLASS
* Add unit testing for CUTLASS Conv in inference
* Add experimental API for CUTLASS.
```
  c123dd1e
- A
  [OpAttr]Fix Ignore AttriteTensor in IndicateDataType bug in grad_op (#49472) · 5ac96468
  由 Aurelius84 提交于 1月 03, 2023
```
* [OpAttr]Fix Ignore AttriteTensor in IndicateDataType bug in grad_op

* add GetExpectedKernelType
```
  5ac96468
- Y
  Use BroadcastKernel and ReduceKernel to optimize expand and expand_grad. (#49419) · c4604025
  由 Yiqun Liu 提交于 1月 03, 2023
```
* Use BroadcastKernel and ReduceKernel to optimize expand and expand_grad.

* Correct the axis when there is only 1 input in BroadcastKernel.

* Add the calculate of output's shape.
```
  c4604025

PaddlePaddle / Paddle 1 年多 前同步成功

PaddlePaddle / Paddle
1 年多前同步成功