提交 · 3bedec8a8ead5bc3c193650969ac180153802c95 · PaddlePaddle / Paddle

28 7月, 2023 6 次提交

【Complex op】add complex support for sin, cos, tan, tanh (#55380) · 3bedec8a

由 Scotty 提交于 7月 28, 2023

* add complex dtype for tanh

* add test case

* support complex for sin, cos and tan

* support gpu

* fix error in cpu

* fix gpu error

* set check_prim to False only for complex type

3bedec8a

J
remove old dataloader & generator from quantilization (#55754) · e2e0d296
由 JYChen 提交于 7月 28, 2023
```
* remove old dataloader & generator from quantilization

* fix ut test_post_training_quantization_mnist
```
e2e0d296
Y

[bug fix] fix scatter 0d index grad error (#55738) · 3e2c6a56
由 Yuang Liu 提交于 7月 28, 2023

3e2c6a56

fix fused multihead matmul unitest (#55755) · 09a60477

由 caizejun 提交于 7月 28, 2023

* bugfix fused_multihead_matmul

* fix test case of fused multihead matmul

---------
Co-authored-by: Nbukejiyu <395822456@qq.com>

09a60477

New ir support fluid op (#55693) · b76c2f94

由 hong 提交于 7月 28, 2023

* new ir support save combine

* update

* polish code

* update

* new ir support fluid op

* remove depulicate op

* fix ir exe test compile error

* fix compile bug

* update

* code format

* update

* update

* polish code

b76c2f94

W
Fix test_resnet and test_resnet_v2 ut (#55723) · 9556beaf
由 WangZhen 提交于 7月 28, 2023
```
* Fix test_resnet and test_resnet_v2 ut

* Remove ut
```
9556beaf

27 7月, 2023 8 次提交

M
[Paddle-TRT] add flip op (#55688) · d608170a
由 ming1753 提交于 7月 27, 2023
```
* [Paddle-TRT] add flip op
```
d608170a
A
[NewIR]Split NewIRCompiler with .h/.cc and decoupling compilation with cinncore (#55733) · 4191f2c6
由 Aurelius84 提交于 7月 27, 2023
```
* [NewIR]Split NewIRCompiler with .h/.cc and decoupling compilatiom with cinncore

* fix cmake

* fix CINN_ONLY
```
4191f2c6
Z
add int32/int64 for outer/matmul Kernel. (#55584) · ff2142f2
由 zxcd 提交于 7月 27, 2023
```
* add int32/int64 for outer/matmul Kernel.

* fix by comment.

* fix by comment
```
ff2142f2
M
paddle-TRT support float64 (#55520) · 8b063030
由 ming1753 提交于 7月 27, 2023
```
* Paddle-TRT support float64  in/out type, support fill_any_like_op in int64
```
8b063030

[NewIR]Fix new ir dygraph 2 static concat grad bug (#55634) · 51ebcf68

由 hong 提交于 7月 27, 2023

* add kernel dialect

* change DenseTensorTypeStorage to DenseTensorType

* add test case`

* add first pd_op to kernel dialect

* lower pd op to kernel dialect

* update

* update

* remove useless code

* add attrite print test

* fix bug

* update

* update

* update

* update

* polish code

* fix bug

* polish  code  and add python test

* add test

* fix test error

* relax constraint when inserting get_parameter

* add env flag

* fix bug

* dygraph2static support new ir

* fix bug

* revert test env

* change cc_test_old to cc_test

* update

* fix build_static bug

* update test

* fix type test error

* udpate cmake

* disable test in windows

* fix inference compile

* fix program translator error

* only run on cpu, not support gpu yet

* fix conflict

* polish code

* fix bug

* add feed with place op

* update

* remove useless unitest

* udpate mkldnn

* update

* update

* align mkldnn version

* new ir support builtin slice op

* fix bug

* fix phi kernel adaptor bug

* add enable static

* add enable_static

* remove useless test case

* change feed list to single variable

* update

* add feed with place and shaddow output op

* fix bug

* remove usless code

* support gpu

* fix bug

* fix bug

* remove template

* add more data type

* fix cimpile bug

* udpate

* remove useless code

* revert dygraph2st test

* remove usless code

* revert op

* fix bug

* remove instance norm

* fix concat grad bug

* revert code

---------
Co-authored-by: Nkangguangli <kangguangli@hotmail.com>

51ebcf68

H

skip inplace check for new ir (#55702) · b9ee7105
由 hong 提交于 7月 27, 2023

b9ee7105

【inplace api】batch add inplace api paddle.log_, paddle.i0_,... · 58a03d41

由 GGBond8488 提交于 7月 27, 2023

【inplace api】batch add inplace api paddle.log_, paddle.i0_, paddle.nn.functional.leaky_relu_... (#55576)

* batch add inplace api

* add inplace test

* add activation inplace

* fix test

* remove atan2 ge, gt, le, lt, nq

* remove atan2 ge, gt, le, lt, nq

* fix windows ci error

* rerun ci

* fix typro

* fix bugs

---------
Co-authored-by: Nzhangrui34 <v_zhangrui34@baidu.com>

58a03d41

A

[NewIR]Remove compatible logic of ProgramTranslator (#55453) · cbbd940e
由 Asthestarsfalll 提交于 7月 27, 2023

cbbd940e

26 7月, 2023 16 次提交
- H
  【CINN】Remove Remaining Old Schedule, Now We Completely Remove it. (#55566) · 011f97bc
  由 Huihuang Zheng 提交于 7月 26, 2023
```
Remove the remaining old schedules.
```
  011f97bc
- T
  
  add sin and cos optional parameters to fused_rope op (#55415) · 581d05bb
  由 tianhaodongbd 提交于 7月 26, 2023
  
  581d05bb
- H
  
  [0D-Tensor] Fix test_elementwise_max_op unittest of FP16 (#55683) · 18de0c94
  由 HongyuJia 提交于 7月 26, 2023
  
  18de0c94
- H
  
  [0D-Tensor] Fix test_elementwise_pow_op unittest (#55684) · aa1f4a44
  由 HongyuJia 提交于 7月 26, 2023
  
  aa1f4a44
- K
  
  add test case (#55664) · 563e873e
  由 kangguangli 提交于 7月 26, 2023
  
  563e873e
- J
  [Fluid Clean] remove module fluid.layers.control_flow (#55661) · a646e75f
  由 JYChen 提交于 7月 26, 2023
```
* remove api staticrnn

* move select_input/output to static/controw flow

* delete some func, only remain Switch

* clean fluid.layers.controw_flow

* remove fluid.layers.controlflow

* fix conditional_block ut
```
  a646e75f
- K
  [NewIR]support attribute hook and fix_reduce_all (#55553) · 7470889f
  由 kangguangli 提交于 7月 26, 2023
```
* support attribute hook and fix_reduce_all

* resolve merge conflicts

* fix coverage ci

* trigger CI

* trigger CI

* fix coverage ci
```
  7470889f
- D
  
  Add FP16 & BF16 for lamb (#55641) · 84a56b4a
  由 Difer 提交于 7月 26, 2023
  
  84a56b4a
- [BUG] fix bug of float/int/long/index Tensor (#55568) · a4644c50
  由 zhouweiwei2014 提交于 7月 26, 2023
  
  a4644c50
- Y
  [New IR]Bind core structrure (#55665) · ee506c2f
  由 YuanRisheng 提交于 7月 26, 2023
```
* bind ir core

* perfect code

* deal with conflict
```
  ee506c2f
- A
  [NewIR]Add ConvertIRType and fix some TODO for IR+CINN (#55691) · 2ade1f92
  由 Aurelius84 提交于 7月 26, 2023
```
* [NewIR]Add ConvertIRType and fix some TODO for IR+CINN

* modify into GPUPlace
```
  2ade1f92
- L
  [Reshard] Implement replicated to split with same placement (#55552) · 9f3b5f15
  由 LiYuRio 提交于 7月 26, 2023
```
* Implement replicated to split reshard function

* fix link error in clang

* refine split functor

* simplify reshard code
```
  9f3b5f15
- H
  [0D-Tensor] CINN supports `fill_constant`, fix infershape and pass (#55563) · f5830c05
  由 HongyuJia 提交于 7月 26, 2023
```
* [0D-Tensor] CINN supports fill_constant, fix infershape and pass

* fix infershape of fill_constant

* add back fill_constant to zero_tensor_trick_pass
```
  f5830c05
- H
  New ir support save combine (#55538) · a88d36aa
  由 hong 提交于 7月 26, 2023
```
* new ir support save combine

* update

* polish code
```
  a88d36aa
- G
  
  add modernize-redundant-void-arg check (#55652) · 12fb18dd
  由 gouzil 提交于 7月 26, 2023
  
  12fb18dd
- R
  
  [CustomDevice] fix SplitDenseTensor (#55615) · 6c675ed9
  由 ronnywang 提交于 7月 26, 2023
  
  6c675ed9
25 7月, 2023 7 次提交

Bugfix, fast layer norm, OOB (#55639) · 017a6164

由 Jeng Bai-Cheng 提交于 7月 25, 2023

* Fix LayerNormForward perf issue

* Bugfix, fast_layer_norm OOB

* apply pre-commit

---------
Co-authored-by: NShijie Wang <jaywan@nvidia.com>

017a6164

A

[NewIR]Support Instruction.Run in CINN for Runtime::Program (#55680) · f9e1b2d2
由 Aurelius84 提交于 7月 25, 2023

f9e1b2d2
傅

add all false bool indices support for index_put (#55655) · c737f0ae
由傅剑寒提交于 7月 25, 2023

c737f0ae

[NewIR]new ir dygraph to static supoort gpu (#55620) · fb9bec5d

由 hong 提交于 7月 25, 2023

* add kernel dialect

* change DenseTensorTypeStorage to DenseTensorType

* add test case`

* add first pd_op to kernel dialect

* lower pd op to kernel dialect

* update

* update

* remove useless code

* add attrite print test

* fix bug

* update

* update

* update

* update

* polish code

* fix bug

* polish  code  and add python test

* add test

* fix test error

* relax constraint when inserting get_parameter

* add env flag

* fix bug

* dygraph2static support new ir

* fix bug

* revert test env

* change cc_test_old to cc_test

* update

* fix build_static bug

* update test

* fix type test error

* udpate cmake

* disable test in windows

* fix inference compile

* fix program translator error

* only run on cpu, not support gpu yet

* fix conflict

* polish code

* fix bug

* add feed with place op

* update

* remove useless unitest

* udpate mkldnn

* update

* update

* align mkldnn version

* new ir support builtin slice op

* fix bug

* fix phi kernel adaptor bug

* add enable static

* add enable_static

* remove useless test case

* change feed list to single variable

* update

* add feed with place and shaddow output op

* fix bug

* remove usless code

* support gpu

* fix bug

* fix bug

* remove template

* add more data type

* fix cimpile bug

* udpate

* remove useless code

* revert dygraph2st test

* remove usless code

* revert op

* fix bug

* new ir dygraph2static support gpu

* remove usless code

* code polish

* add const

* revert code and remove useless code

* revert code

* revert legacy op yaml

* remove useless code

* delete std::move

---------
Co-authored-by: Nkangguangli <kangguangli@hotmail.com>

fb9bec5d

K
[BugFix] fix random fail of test_bilinear_interp_v2_op (#55643) · 98c7a3e0
由 kangguangli 提交于 7月 25, 2023
```
* fix random fail of test_bilinear_interp_v2_op

* reset if compiledProgram
```
98c7a3e0

解决 grad_fn next_functions api 接口导致内存异常的问题 - (#55627) · 03a2f187

由 qiuwenbo 提交于 7月 25, 2023

* [尝试] 给tensor增加一个属性, 这个属性是一个定值 1

* 暴露gradnode 并构建gradnode新的方法(用来测试)进行暴露给python python端可以访问

* 开发grad_fn、next_functions两个API 并暴露到python端- 做一些规范化处理

* 增加一个单元测试

* 优化 code-style

* 将单侧文件迁到正确的位置

* 优化 code-style

* 删除无用注释

* 解决 __main__ has no attribute

* 修改单侧文件

* 修改单侧脚本-temp

* 解决 grad_fn next_functions api 接口导致内存异常的问题

* 修改单测内容

* 解决 code-style 问题

03a2f187

H

[0D-Tensor] Fix test_elementwise_max_op unittest (#55674) · 05a40691
由 HongyuJia 提交于 7月 25, 2023

05a40691

24 7月, 2023 3 次提交
- J
  add IndexPutGradInfermeta to fix backward error in static-mode (#55602) · 76530a2a
  由 JYChen 提交于 7月 24, 2023
```
* add IndexPutGradInfermeta to fix backward error in static-mode

* codestyle
```
  76530a2a
- Y
  [Semi-Auto] Add transpose spmd rule (#55350) · f6161d1e
  由 Yichen Zhang 提交于 7月 24, 2023
```
* [Semi-Auto] Add transpose spmd rule

* add unit test in cmake file

* log perm info
```
  f6161d1e
- Y
  
  [sharding stage 1 optim] Sharding comm overlap with backward (#55598) · a9f877ff
  由 Yuang Liu 提交于 7月 24, 2023
  
  a9f877ff

PaddlePaddle / Paddle 大约 1 年 前同步成功

PaddlePaddle / Paddle
大约 1 年前同步成功