提交 · ec77defccfcf0799e4f5da195add18d321d06bcd · PaddlePaddle / Paddle

25 4月, 2023 1 次提交
- N
  [Cherry-pick] Add enable_tensor_checker and disable_tensor_checker to api list (#52936) (#53287) · ec77defc
  由 niuliling123 提交于 4月 25, 2023
```
新增enable_tensor_checker, disable_tensor_checker API (#52936)
```
  ec77defc
24 4月, 2023 2 次提交
- J
  Revert "Cherry pick getitem/setitem 0d (#53125)" (#53265) · 50f61213
  由 JYChen 提交于 4月 24, 2023
```
This reverts commit a79c04f3.
```
  50f61213
- N
  [cherry-pick] Add debugging api and python stack (#53217) · 1e7efd81
  由 niuliling123 提交于 4月 24, 2023
```
Print the forward's stack when backward op has nan/inf and FLAGS_check_nan_inf_level = 0
Delete temp param in eager_gen
```
  1e7efd81
23 4月, 2023 2 次提交

Cherry pick getitem/setitem 0d (#53125) · a79c04f3

由 JYChen 提交于 4月 23, 2023

* support 0-D output and 0-D as indice in __getitem__

* fix tests

* fix inference and UT

* add unittest for setitem

* fix xpu test

* fix xpu 0-d

a79c04f3

Fix bug of block desc. (#53163) (#53176) · 7adecf40

由 Ghost Screaming 提交于 4月 23, 2023

* Fix bug of reduce_sum op. When input.numel() > INT32_MAX, its result
is wrong.

* Remove climits.

* Fix bug of BlockDesc::MoveFrom(). It's used to rebuild main_program_desc from ProgramDesc modified by Fusion Pass. As some fused operators need to create new Variables in modified ProgramDesc, MoveFrom function uses std::move() function to move these VarDesc to main_program_desc. As a result, their pointers holded by modified ProgramDesc become nullptr. When call block()->Program()->proto() function, it will call ProgramDesc::Flush() function at first, which may cause a segmentation fault.

7adecf40

20 4月, 2023 3 次提交
- C
  
  Fix open missing mode on jetson (#53069) · 02f44fcc
  由 chalsliu 提交于 4月 20, 2023
  
  02f44fcc
- Y
  [cherry-pick] remove c++14 assert and remove include tensor.h in phi (#53071) · 356ba7e3
  由 Yuanle Liu 提交于 4月 20, 2023
```
* remove c++14 assert and remove include tensor.h in phi

* update

* remove delete_cast_op_pass
```
  356ba7e3
- R
  [CustomDevice] add c_identity op (#52982) (#53013) · d131e679
  由 ronnywang 提交于 4月 20, 2023
```
* [CustomDevice] add c_identity op

* fix use calc stream
```
  d131e679
17 4月, 2023 10 次提交

Y
[PHI]Unify fluid kernel (Part4) (#52626) · 1b5eba8a
由 YuanRisheng 提交于 4月 17, 2023
```
* unify kernel

* fix ci bugs

* fix py3 bugs

* fix py3 bugs

* perfect code
```
1b5eba8a
L
【fix bug】Fix bug in parse args with '{,}' (#52968) · be04f258
由 lzydev 提交于 4月 17, 2023
```
* fix bug in parse args

* fix bug

* recover legacy_*.yaml

* change 'Out' to Output
```
be04f258
L

add autogen code support for uniform_inplace (#52955) · b9830634
由 LoneRanger 提交于 4月 17, 2023

b9830634
G

remove some [-Wunused-paramter] warning (#52924) · 337cc2ca
由 Galaxy1458 提交于 4月 17, 2023

337cc2ca

[CINN] fix concat (#52341) · 31fc763a

由 wangzhen38 提交于 4月 17, 2023

* [CINN] fix concat&pow

* update concat

* composite_backward_api

* for ci

* for ci

* update test & fix opmaker

31fc763a

J

Support trt engine auto build in runtime for dynamic shape (#52162) · ebc58548
由 JingZhuangzhuang 提交于 4月 17, 2023

ebc58548
张

remove hccl in some .cc files (#52942) · 514d83de
由张春乔提交于 4月 17, 2023

514d83de

Add output defs for some kernelsPhi register (#52941) · 23f87442

由 Sonder 提交于 4月 17, 2023

* add register info for eigh and eig_gard

* add sync_batch_norm_op.cu register info

* add lamb output register info

* add unique register info

* change type name

* change type name

* add output register info for check_finite_and_unscale

* update cmake and config file

* add register info for adagrad

* fix build error

* add sync to run_unittests.sh

* add register info for unique_consecutive

* fix build error

* add eigh to STATIC_BUILD_TESTS

* update eig_kernel.cc

* update eig_kernel.cc

* fix infer mate error

* fix unique register error

* fix lamb register info error

* fix lamb register info

* update lamb register info

* fix lamb

* remove one Output Register

* update static build file

* add eigh op to disable_wingpu_test

* update run_unittests

23f87442

Z
[AMP OP&Test] Sync_batch_norm support bfloat16 (#52921) · 1080d4fc
由 Zhang Zheng 提交于 4月 17, 2023
```
* [AMP OP&Test] Sync_batch_norm support bfloat16

* fix

* fix
```
1080d4fc
H

[Dygraph] Support delaying div loss by accumulate_steps in PipelineLayer (#52848) · 0abdcff6
由 Haohongxiang 提交于 4月 17, 2023

0abdcff6

15 4月, 2023 1 次提交
- H
  
  [Opt CustomOP] Optimize the perf and impl of custom grad operator (#52915) · 0afef498
  由 HongyuJia 提交于 4月 15, 2023
  
  0afef498
14 4月, 2023 14 次提交

J
delete SupportNPU(), SupportMLU() (#52911) · 8601859e
由 jjyaoao 提交于 4月 14, 2023
```
* delete SupportNPU(), SupportMLU()

* delete npu branch
```
8601859e
U

[Dcu]: Add rocsparse_spmm for dcu. (#52200) · 281ea2f4
由 umiswing 提交于 4月 14, 2023

281ea2f4

[Zero-Dim] support 0-D tensor for... · 6f41e177

由 YangQun 提交于 4月 14, 2023

[Zero-Dim] support 0-D tensor for reduce/reshape/stack/prelu/expand_v2/gaussion onednn kernels (#52185)

* support 0-D tensor for reduce/reshape/stack/prelu/expand_v2/gaussion ops

* fix gaussian random mkldnn op ut

6f41e177

[Decouple enforce.h] Move LOG from enforce.h to enforce.cc (#52883) · b33f95b0

由 HongyuJia 提交于 4月 14, 2023

* [Decouple enforce.h] Move LOG from enforce.h to enforce.cc

* update cmake of device_context.cc, solve cuda_device_context_allocator.h compile error

* add namespace inside macro

b33f95b0

1. modify set_value op, use Scalars to represent attr `values`, instead of a... · dd2a749a

由 Feiyu Chan 提交于 4月 14, 2023

1. modify set_value op, use Scalars to represent attr `values`, instead of a bunch of attributs of various types; (#52408)

2. add program converter and set_value op as an example, which provides the functionality to convert `paddle::framework::ProgramDesc` between old and new formats(the differences are mainly some operators with incompatible updates in the definition);
3. program version and operator version map now are always saved when serializing `paddle::framework::ProgramDesc` to identify the version;
3. provide an option `legacy_format=false` in serialization of `paddle::framework::ProgramDesc`, it decided whether to convert ProgramDesc back to a legacy format, which is compatible for paddle 2.4.2 or earlier versions to load and execute;
4. deserialization of `paddle::framework::ProgramDesc` is now automatically detecting whether the bytes it receives is in legacy format(contains any of the operators that has been incompatibly updated and have any attribute of type `Scalar`) and convert it to new format. But if you want a faithful deserialization without the automatic conversion, you can use protobuf's deserialization instead. Though it is not recommended, it can be used for the purpose of testing.

dd2a749a

[phi] move sequence_pool to phi - Step 2 : sequence_pool_op (#52750) · b281b221

由 gouzil 提交于 4月 14, 2023

* [phi] move sequence_pool kernel to phi

* [phi] mv sequence_pooling to phi funcs

* [phi] mv sequence_pooling_test

* [phi] RollBACK `paddle/fluid/operators/sequence_ops/sequence_pool_op.cc`

* [phi][funcs] fix mutable_data

* [phi][funcs] fix mutable_data

b281b221

Move fused_attention op to phi [迁移反向 GPU OpKernel] (#51909) · 3bac6264

由 Sonder 提交于 4月 14, 2023

* add kernel functions

* update kernel functions

* update func parameters' name

* create codes for gpu device

* 调整文件位置

* fix include error

* remove dependent files to phi/

* restore fused_attention_op.cu

* fix dependence errors

* fix dependence errors

* fix include error

* fix all depandence errors[build success]

* remove useless include

* recover useless include

* use phi::ToNCCLDataType

* fix namespace

* update new register code

* fix error in fused_gemm_epilogue_utils

* fix error in FusedAttentionKernel parm

* finish fused_attention registe code[build success]

* add paddle::optional

* add sig file

* fix build error

* fix a include error

* 恢复正向代码

* update CMkaeList

* trans Compute function to phi [build success]

* add register code and fix include error [build success]

* fix parameter sequence

* add include file

* update #if before include

* update #if before include

* fix grammly error

* update codes for DropoutParam

* remove const cast

* trans some fluid api to phi api

* remove const cast

* trans some fluid api to phi api

* add #if

* update test code

* update test codes

* recover test codes

* fix namespace and remove fluid include

* recover random seed

* remove fluid quant_helper

* fix include error

* include utils in funcs

* change include file

* move grad codes back to fluid floder

* move grad codes back to fluid floder

* fix sig file error

* update include

* recover codes to develop

* update register codes

* fix build error

* recover fluid include

* remove some fluid include

* remove some fluid include

* Update fused_attention_op.cu

* remove fluid include

* add some fluid include

* Update fused_attention_op.cu

* Update fused_attention_op.cu

* Update fused_attention_op.cu

* Update fused_attention_op.cu

* remote useless include

3bac6264

【Prim】Add more infer var type (#52818) · 630d14f5

由 Jiabin Yang 提交于 4月 14, 2023

* add more infer var type

* fix split error

* fix ut

* fix top_k infer vartype

* fix top_k infer vartype

630d14f5

Z

delete cast if lookup_table_v2 support fp16; delete repeated ops (#52888) · 7aafeb45
由 zhupengyang 提交于 4月 14, 2023

7aafeb45
K

rem cncl (#52434) · 25bd5ed8
由 Kim Yann 提交于 4月 14, 2023

25bd5ed8
R

[CustomDevice] add model parallel support for custom device (#52872) · f8d09011
由 ronnywang 提交于 4月 14, 2023

f8d09011
H

update (#52880) · 2f499713
由 huangjiyi 提交于 4月 14, 2023

2f499713
H

update (#52879) · b1bb7484
由 huangjiyi 提交于 4月 14, 2023

b1bb7484
H

update (#52878) · e93e8a3f
由 huangjiyi 提交于 4月 14, 2023

e93e8a3f

13 4月, 2023 7 次提交
- Y
  
  remove need cpp14 support (#52867) · d9c3abe6
  由 Yuanle Liu 提交于 4月 13, 2023
  
  d9c3abe6
- W
  [Paddle-Trt] Replace fc mul matmul matmul_v2 with matrix_multiply (#52222) · ef734e84
  由 Wangzheee 提交于 4月 13, 2023
```
* Paddle-Trt: Replace fc mul matmul matmul_v2 with matrix_multiply
```
  ef734e84
- J
  delete WITH_ASCEND_CL (#52825) · 4a374c60
  由 jjyaoao 提交于 4月 13, 2023
```
* delete WITH_ASCEND_CL

* delete NPU/ and WITH_MLU
```
  4a374c60
- Z
  [Paddle-TRT]fix bilinear_interp_v2 && some other bugs in trt 7011 (#52753) · dc8d6a1a
  由 zhoutianzi666 提交于 4月 13, 2023
```
* fix bilinear_interp_v2 && some other bugs in trt 7011

* add version check in test_trt_convert_bilinear_interp_v2.py
```
  dc8d6a1a
- G
  Fix ignore index of c_softmax_with_cross_entropy_op. (#52835) · 4341ebd9
  由 Ghost Screaming 提交于 4月 13, 2023
```
* Fix bug of reduce_sum op. When input.numel() > INT32_MAX, its result
is wrong.

* Remove climits.

* Fix bug of c_softmax_with_cross_entropy_op. Support ignore_index is
negative number.
```
  4341ebd9
- C
  
  Fix delete_isolated_node_pass problem (#52856) · 0f2dc4ca
  由 csy0225 提交于 4月 13, 2023
  
  0f2dc4ca
- W
  
  refine force syncbn (#52860) · ea1c9b89
  由 wanghuancoder 提交于 4月 13, 2023
  
  ea1c9b89

PaddlePaddle / Paddle 1 年多 前同步成功

PaddlePaddle / Paddle
1 年多前同步成功