提交 · f4290a92653f8b8685958a90043585e59b83bf70 · PaddlePaddle / Paddle

11 7月, 2023 1 次提交
- Linear compress (#55128) · f4290a92
  由 FormlessUnit 提交于 7月 11, 2023
```
* rename weight_only/llm.int8
```
  f4290a92
03 7月, 2023 1 次提交
- add linear_compress API (#54140) · c4d5ec66
  由 FormlessUnit 提交于 7月 03, 2023
```
* add linear_compress API
```
  c4d5ec66
29 6月, 2023 1 次提交

Add fused_rope forward op (#54351) · a215c46a

由 niuliling123 提交于 6月 29, 2023

* style

* more

* update ctest

* Update legacy_backward.yaml

* Update legacy_ops.yaml

* Update legacy_ops.yaml

* update

* update

* update for move

a215c46a

28 6月, 2023 1 次提交
- S
  [BugFix] Fix bug for binary_cross_entropy_with_logits loss (#54869) · bb42d870
  由 Siming Dai 提交于 6月 28, 2023
```
* add pos_weight in kernel

* fix unittest

* fix xpu

* fix bce unittest, change infermeta order
```
  bb42d870
26 6月, 2023 1 次提交

remove ops from OpsWithFluidKernelNeedMoveToPhi set (#54007) · 733eca85

由 Sonder 提交于 6月 26, 2023

* remove ops from OpsWithFluidKernelNeedMoveToPhi set

* open static build flag

* OpsWithFluidKernelNeedMoveToPhi

* open new_executor_static_build

* add infermate for cudnn_lstm

* fix

* update

* fix

* update

* update

* update

* fix pow2 decay

* fix pow2 decay

* recover analysis_predictor.cc

* fix pow2 decay

* fix cudnn lstm

* add output register info for svd

* fix pow2_decay_with_linear_warmup_kernel

* recover test lstm cudnn

* recover svg register codes

* fix register info

* fix reduce sum register info

* add output info for adadelta

* add output info for adadelta

* add output info for adamax

* fix complex abs register info

* add register info for cudnn_lstm_grad

* recover

* fix lstm cudnn

* fix

* fix xpu output registe info

* remove std::cout

* add backend

* remove output info in pow2_decay_with_linear_warmup_kernel

* add judgment in TensorShouldBeFakeInitialized

* recover power_

* close new_executor_static_build

* fix set_value_xpu

733eca85

16 6月, 2023 1 次提交
- Z
  fix lamb optimizer always_adapt (#54654) · 2a56f4b3
  由 zhiboniu 提交于 6月 16, 2023
```
* fix lamb always_adapt

* fix optest

* fix all optests
```
  2a56f4b3
01 6月, 2023 1 次提交

Support static graph code generation for conv2d, conv3d, depthwise_conv2d (#54201) · f3eccb3f

由 huangjiyi 提交于 6月 01, 2023

* update

* update cmake

* update

* update

* update

* update

* Revert "update cmake"

This reverts commit 1e1dc1b2bc9967b725201272607f939260070fd4.

* update

* update

* update

* update

f3eccb3f

23 5月, 2023 1 次提交
- H
  move fusion_group infershape to phi (#53934) · 3dc99088
  由 huangjiyi 提交于 5月 23, 2023
```
* update

* update

* update

* set out dtype
```
  3dc99088
10 5月, 2023 1 次提交

傅

add index_put api (#52886) · f3393f49

由傅剑寒提交于 5月 10, 2023

* add index_put api

* fix value broadcast in backward and add test case in static

* add timeout=120s for index_put

* add op_compat for index_put

* add inplace index_put test

* add test case when index tensor in indices is int32 when indices.size less than x.dims

* add index_put api backward in cpu place

* add backward test case

* refactor code to delete some duplicated code

* replace reshape with resize for decrease extra memcpy

* add datatype flag in backward yaml

* fix bug in documentation

* Update python/paddle/tensor/manipulation.py

---------
Co-authored-by: NLigoml <39876205+Ligoml@users.noreply.github.com>

f3393f49

24 4月, 2023 1 次提交
- S
  Add weighted sample (#52013) · 6a8d98e0
  由 Siming Dai 提交于 4月 24, 2023
```
Add paddle.geometric.weighted_sample_neighbors API
```
  6a8d98e0
19 4月, 2023 1 次提交
- Z
  fix graph_reindex (#52930) · e5506be6
  由 zhangyuqin1998 提交于 4月 19, 2023
```
* fix graph_reindex

* fix

* Update op_compat.yaml
```
  e5506be6
11 4月, 2023 2 次提交
- Z
  
  delete remote_prefetch (#52748) · 3951c40d
  由 zhangyuqin1998 提交于 4月 11, 2023
  
  3951c40d
- W
  
  [BUG Fixs] adadelta lr support (#49732) · 23032590
  由 wangzhen38 提交于 4月 11, 2023
  
  23032590
04 4月, 2023 1 次提交
- Z
  rename_bilinear_tensor_product (#52375) · 34069c46
  由 zhangyuqin1998 提交于 4月 04, 2023
```
* rename_bilinear_tensor_product

* fix
```
  34069c46
27 3月, 2023 1 次提交
- Z
  
  edit formate of mea (#52147) · 13baef48
  由 ZhangDY-6483 提交于 3月 27, 2023
  
  13baef48
24 3月, 2023 1 次提交

Memory Efficient Attention (#51867) · e5ad3859

由 ZhangDY-6483 提交于 3月 24, 2023

* first version, notest

* return final rst, notest

* use infinity() instead of max

* ut structure

* start up of ut

* generate lse

* update

* add depense

* reconstruct cmake

* move file

* add memory efficient attention and fix blasimpl

* update

* update cmake

* add namespace

* update cmake

* use .cu

* update for pad3d

* bug fix

* bug fix

* update

* bug fix

* update enforce

* add test case

* merge the lse pad

* fix kernel_fn of backward

* fix PADDLE_ENFORCE_EQ and phi_api

* fix PADDLE_ENFORCE

* fix PADDLE_ENFORCE

* rerun coverage

* fix memory efficient attention test

* rerun ci

* add cuda version condition

* add cuda version condition

* delete WIP test

* replace PADDLE_ENFORCE

* edit the namespace of datatype in multiple.cc

* rerun

* rerun

---------
Co-authored-by: Nliuyuang <liuyuang@baidu.com>

e5ad3859

22 3月, 2023 1 次提交

Add fused_linear_param_grad_add_kernel (#51805) · f59c5d8b

由 sneaxiy 提交于 3月 22, 2023

* add fused_linear_param_grad_add_kernel

* fix compile error

* remove flag

* fix ci compile error

* fix ci compile error

* revert pylayer revision

* fix ci ut

* improve performance

f59c5d8b

08 3月, 2023 1 次提交
- N
  
  Add mult_precision param for adamax op (#49705) · 151ec311
  由 niuliling123 提交于 3月 08, 2023
  
  151ec311
06 3月, 2023 1 次提交
- N
  
  Add multiprecision for adadelta op (#50131) · a8a2b7f4
  由 niuliling123 提交于 3月 06, 2023
  
  a8a2b7f4
03 3月, 2023 1 次提交
- N
  
  Add multi_precision for adagrad op (#50078) · 4779c2c1
  由 niuliling123 提交于 3月 03, 2023
  
  4779c2c1
01 3月, 2023 1 次提交
- N
  
  Add multiprecision for rms op (#50132) · 48060b2e
  由 niuliling123 提交于 3月 01, 2023
  
  48060b2e
17 2月, 2023 1 次提交

Rename MultiTensorAdam To FusedAdam (#50449) · e6af9bd2

由 yuehuayingxueluo 提交于 2月 17, 2023

* rename multi_tensor_adam to fused_adam

* fix some bugs

* fix CI coverage

* rename test_fused_adam.py

* fix some bug

* add test_fused_adam_op.py

* fix some bugs

* fix fused_adam_op.cc

* fix CI bugs

* fix CI bug

* fix CI bug

e6af9bd2

16 2月, 2023 1 次提交
- C
  Add logspace yaml (#49194) · c284d42a
  由 Chen Weihang 提交于 2月 16, 2023
```
* add logspace yaml

* update by comments

* resolve test framework conflicct
```
  c284d42a
09 2月, 2023 1 次提交

Add MultiTenosrAdam OP (#49220) · 10654c77

由 yuehuayingxueluo 提交于 2月 09, 2023

* add multi_tenosr_adam

* update multi_tensor_base.py, test_multi_tensor_adam.py, adamw.py

* fix adam.py optimizer.py

* fix adamw.py

* fix test_multi_tensor_adam.py

* fix CI bug

* fix CI coverage

* fix ci bug

* fix betapow

* fix some bugs

* fix test_adamw_op.py

* fix CI coverage

* fix multi_tensor_adam_kernel.cc

* fix CI bug

* fix multi_tensor_adam_op.cc and test_multi_tensor_adam.py

* fix code style

* update C++ parts

* remove python parts modification temporarily

* add C++ ut

* update betapow copy code logic

* fix ci ut

* fix windows ci

* fix coverage ci

* improve coverage rate

---------
Co-authored-by: Nsneaxiy <sneaxiy@126.com>

10654c77

23 12月, 2022 1 次提交
- H
  add rnn-t loss and api (#49199) · c088f9ec
  由 Hui Zhang 提交于 12月 23, 2022
```
* add warp transducer code
```
  c088f9ec
22 12月, 2022 1 次提交
- X
  
  [Paddle Inference] Add moe phi kernel (#48703) · def2a87f
  由 xiaoxiaohehe001 提交于 12月 22, 2022
  
  def2a87f
09 12月, 2022 1 次提交
- L
  move share_buffer kernel to phi (#48858) · c2e77ba3
  由 Leo Chen 提交于 12月 09, 2022
```
* move share_buffer kernel to phi

* fix ut

* add source file

* fix window links
```
  c2e77ba3
17 11月, 2022 1 次提交
- Y
  [PHI]Standardise some C++ API (Part5) (#47860) · f3650201
  由 YuanRisheng 提交于 11月 17, 2022
```
* standard api

* fix xpu bugs
```
  f3650201
02 11月, 2022 1 次提交
- Y
  [PHI]Standardise some C++ API (Part3) (#47532) · fe8c6796
  由 YuanRisheng 提交于 11月 02, 2022
```
* Standardise batch norm

* standardize conv3d and depwise_conv2d

* fix ci bugs
```
  fe8c6796
01 11月, 2022 1 次提交
- Y
  [PHI]Standardise some C++ API (Part2) (#47510) · 399047d7
  由 YuanRisheng 提交于 11月 01, 2022
```
* standard_api

* add hardtanh
```
  399047d7
31 10月, 2022 1 次提交
- Y
  [PHI]Standardise some C++ API (#47385) · 60e0c506
  由 YuanRisheng 提交于 10月 31, 2022
```
* standard api

* fix ci bugs

* fix ci bugs

* fix ce bugs
```
  60e0c506
12 10月, 2022 1 次提交
- [Zero-Dim] support input 0D Tensor for some unary api (#45992) · 05c2b9ba
  由 zhouweiwei2014 提交于 10月 12, 2022
```
* [Zero-Dim] support input 0D Tensor for unary api

* fix CI
```
  05c2b9ba
19 9月, 2022 1 次提交

[PHI]Move sum op to PHI (#45860) · 4b3f2af1

由 YuanRisheng 提交于 9月 19, 2022

* move sum

* fix ci bugs

* fix ci bugs

* fix set_lod bugs

* fix infershape bugs

* fix ci bugs

* fix ci unittest bug

* fix ci bugs

* perfect code

* update code according comment

* add unittest

* fix ci bugs

4b3f2af1

09 9月, 2022 1 次提交

[new-exe] convert fused_all_reduce_op_handle to program (#45774) · e755c07e

由 Leo Chen 提交于 9月 09, 2022

* add operator<< for BuildStrategy

* add fake_coalesce

* fit allreduce mode for new_exe

* remove dubeg code

* follow comments

e755c07e

07 9月, 2022 1 次提交

[InferMeta] add compile-time infermeta logic for stack infermeta. (#45528) · 5a4ceb32

由 xiongkun 提交于 9月 07, 2022

* add compile-time infermeta logic for stack infermeta.

* add unittest for stack infermeta where -1 exists in shapes.

* remove backward changes.

5a4ceb32

30 8月, 2022 1 次提交

[phi] Transfer coalesce_tensor to phi (#45478) · cf9d651b

由 HongyuJia 提交于 8月 30, 2022

* add coalesce_tensor kernel

* polist coalesce_tensor kernel

* add sig and InferMeta

* add testcase

* add legacy_api.yaml

* fix infermeta

* fix yaml

* fix kernel implementation

* add compile dependency of phi/kernels

* fix MetaConfig

* add python api

* add and fix testcase

* rnn.py add import

* change _C_ops.coalesce_tensor

* remove useless comments

* add SetBackend

* restore XPU kernel temporarily

* fix code according to PR comments

cf9d651b

16 8月, 2022 2 次提交

[Phi] Move amp ops into phi (#45079) · b4f67757

由 Chen Weihang 提交于 8月 16, 2022

* move check finite and unscale kernel into phi

* move infershape into phi

* move update_loss_scaling kernel into phi

* remove original kernels

* move update loss scaling infershape into phi

* add header for xpu and npu

* solve coverage failed

* fix npu test failed

* remove mutable data in cu file

* fix new executor failed

* add valid check for meta tensor output

b4f67757

[geometric]Add paddle.geometric.send_uv API (#44848) · 88724a53

由 Siming Dai 提交于 8月 16, 2022

* initial commit

* fix op maker bug

* fix mul grad bug

* add unittest

* fix add grad bug, add cpu kernel

* add paddle.geometric.message_passing

* add paddle.geometric.send_uv api, add unittest

* add fp16 judgement

* fix file typo, move compute_type to message_op

* add impl file

* fix unittest timeout time

* add review revise

88724a53

12 8月, 2022 1 次提交

[geometric]Add paddle.geometric.send_ue_recv API (#43174) · 615b15a3

由 Siming Dai 提交于 8月 12, 2022

* add init file

* add op definition and infermeta

* add kernel definition funcs

* add broadcast infer shape

* add gpu forward kernel

* delete SUB and DIV

* add x_grad

* add template

* add e_grad for min and max

* fix small bug

* temp commit

* temp commit

* add e_grad for sum and mean

* fix some compile bug

* fix compile bugs

* fix compile problem

* add sum forward unittest

* fix broadcast error, add kernel sig, register e_grad, change unit test

* fix grad

* add temp grad fix

* temp commit

* add min max unittest

* add max, min unittest, fix mul bug

* add cpu forward sum and mean

* add forward min max, fix mean unittest

* add cpu backward min max

* fix code-style

* add backward sum mean

* fix rocm ci

* set uniitest timeout

* fix bug of x broadcast to e, gpu grad

* fix bug of x broadcast to e, cpu grad

* rename BOOST_GET_CONST macro

* fix rocm ci

* mv graph_send_e_recv to graph_send_ue_recv

* move out_size to IntArray

* add eager op test

* fix max pool type bug, add unittest for api

* revise api doc

* add fp16 for atomic min and max, add unittest

* add unittest

* add fp16 support for graph_send_recv

* fix unittest fp16 bug

* change OutSizeTensor to Out_size

* move E to Y

* add copyright, fix comment

* review code

* fix thread block size

* fix thread block size

* change api attribute name: pool_type to reduce_op, compute_type to message_op

* change api attribute name, move pool_type to reduce_op, move compute_type to message_op

615b15a3

08 8月, 2022 1 次提交
- T
  
  move lamb_op to phi (#44899) · 4a7aa7c3
  由 Thomas Young 提交于 8月 08, 2022
  
  4a7aa7c3

PaddlePaddle / Paddle 大约 1 年 前同步成功

PaddlePaddle / Paddle
大约 1 年前同步成功