提交 · 50df0170c6f3500edcbe703f0bb32beeb9d267c8 · PaddlePaddle / Paddle

15 3月, 2023 5 次提交
- R
  
  support auto generate for nonzero (#51600) · 3734e89a
  由 RedContritio 提交于 3月 15, 2023
  
  3734e89a
- Move the "GetExpectedKernelType" into "get_expected_kernel_func.cc" (#51453) · f0db1f7e
  由 HappyHeavyRain 提交于 3月 15, 2023
```
* test_get_kernel

* add invoke signature

* change reduce_max

* change frobenius_norm

* reset reduce_max according to composite and change reduce_all

* fix the bug when Scalar(*)

* fix 'scalar when support_tensor'

* change code according to review

* change 'keep_signature' to 'manual_signature' and add some erro info
```
  f0db1f7e
- G
  
  add inplace sigmoid_ and multiply_ (#50267) · b3caa233
  由 Guoxia Wang 提交于 3月 15, 2023
  
  b3caa233
- Z
  Delete hardswish_raw op (#51634) · 3e636ec9
  由 zhangyuqin1998 提交于 3月 15, 2023
```
* Delete hardswish_raw op

* fix ut
```
  3e636ec9
- X
  【prim】 modify_yaml (#51436) · 870c0837
  由 xiaoguoguo626807 提交于 3月 15, 2023
```
* modify_yaml

* delete default param

* add output for matmul_double_grad
```
  870c0837
14 3月, 2023 2 次提交
- C
  add split and split_with_num composite rule (#51341) · bb9eb20f
  由 ccrrong 提交于 3月 14, 2023
```
* add split_with_num composite rule

* add split_with_num composite rule

* add split composite rule

* update

* update test

* update test

* delete split_with_num_grad
```
  bb9eb20f
- H
  
  [Tensor Operants & Prim-Relevant] Multiply operants replace by scale (#51469) · 2d0e8c3b
  由 HongyuJia 提交于 3月 14, 2023
  
  2d0e8c3b
13 3月, 2023 4 次提交

Add phi operator all_gather (#51420) · afa26a59
由 TaoTao Li 提交于 3月 13, 2023
```
* add all_gather and fix conflicts

* fix code format

* fix ut

* fix broadcast ut
```
afa26a59

由 heyanru 提交于 3月 13, 2023

* refresh

* compat

* register

* testop

* fix

* fix

* fox

* cast

* cast

* fix

* type

* fix

* out

* cast

* fix

* fix

* fix

* broad

* broad

* broad

* fix

* fix

* fix

* fix

* fix

* broad

* broad

* numel

* fix

* fix

* fix

* fix

* cinn

* fix

* fix

* fix

* fix

4a484973

Add from_blob api for constructing tensor from data pointer (#51085) · 74442f5e

由 Huang Jiyi 提交于 3月 13, 2023

* add from_blob

* fix test

* fix test

* fix codestyle

* add gpu test

* fix test

* update

* add comment

* fix comment

* update comment

* fix CI bug

* add thread_local

* update

* fix bug

* fix bug

* fix bug

* fix bug

* fix bug

* fix bug

* fix cmake

* fix CI-Py3 make

* update

* use api_reg

* fix include

* update

* update

* update

* fix bug

* fix bug

* fix bug

* fix bug

74442f5e

Fused softplus (#51087) · fdcfa04f

由 Sławomir Siwek 提交于 3月 13, 2023

* mkldnn->onednn

* fused softplus op + kernel

* remove extra attributes

* add missing handler

* change var name

fdcfa04f

10 3月, 2023 2 次提交

[New features]Add function node in phi_kernel for MKLDNN (#51073) · a0a6dc6a

由 HappyHeavyRain 提交于 3月 10, 2023

* Add function node in phi_kernel for MKLDNN

* fix the bug in 'BuildInferVarKernelContext'

* add infer_varkernel_utils.cc

* fix the bug:the first two parametes of 'BuildInferVarKernelContext' can't be template variable

* change the code according to first review

* change the code according to first review

* change the mode of paddle_build.sh

* change 'infer_var_kernel_fn_' to 'get_kerneltype_forvar_fn_'

* add the error information

* fix NotFound infomation warning

* fix NotFound infomation warning

* fix NotFound infomation warning

a0a6dc6a

C

add flashattn raw kernel (#51383) · f951832d
由 Chitsing KUI 提交于 3月 10, 2023

f951832d

09 3月, 2023 7 次提交

add prim erf grad (#50436) · b7e4d974

由 GGBond8488 提交于 3月 09, 2023

* add prim erf grad

* add yaml config for prim erf grad

* add math.h

* add cmath

* add math  defines

* use define math

* use define math

* define M_2_SQRTPI

* M_2_SQRTPI math

* try math.h

* fix typro

* remove pow in erf grad

* use new optest

* add fp16 fp32 test

* remove fp16 test

b7e4d974

W
Add softplus double grad (#50261) · 542844b4
由 will-jl944 提交于 3月 09, 2023
```
* add softplus double grad

* use constant method
```
542844b4

[PHI] Register custom kernel for all type of custom device (#51262) · 782454bd

由 zyfncg 提交于 3月 09, 2023

* register custom kernel for all type of custom device

* fix bug

* fix GetKernelInputArgDef

* fix amp bug

* fix TransToPhiPlace

* adapt interpreter_util

782454bd

add abs composite backward op (#50963) · d0d739ca

由 SylarTiaNII 提交于 3月 09, 2023

* add abs composite backward op

* add missing changes during merge

* modify according to new rules

* local UT OK

* fix typo

* codestyle

* register composite operator

* add fp16 test for abs

* replace experimenta::tensor

d0d739ca

Add comm context manager, add phi broadcast op (#51072) · c191b707

由 TaoTao Li 提交于 3月 09, 2023

* * add comm context for device context

* add broadcast phi operator kernel and api

* add broadcast support dtype, update ut

* fix broadcast bfloat16 type

* fix ut

* update test_collective_broadcast_api timeout to 300

c191b707

Z

delete axis of fmax (#51264) · 7d138402
由 zhangyuqin1998 提交于 3月 09, 2023

7d138402

[prim] add elementwise_pow backward (#51230) · d9de3ef6

由 wangzhen38 提交于 3月 09, 2023

* [cinn] add elementwise_pow backward

* [cinn] update unnitest

* [cinn] update by comments

* [cinn] for ci

* [cinn] for ci

* [cinn] for ci

* [cinn] for ci

* [cinn] for ci

d9de3ef6

08 3月, 2023 2 次提交
- M
  
  DLTP-66486:implement log_grad by primitive logic (#51296) · 30e0409c
  由 Meteor Liu 提交于 3月 08, 2023
  
  30e0409c
- N
  
  Add mult_precision param for adamax op (#49705) · 151ec311
  由 niuliling123 提交于 3月 08, 2023
  
  151ec311
07 3月, 2023 1 次提交
- C
  
  remove experimental namespace of Tensor (#51155) · 50ad760c
  由 Chen Weihang 提交于 3月 07, 2023
  
  50ad760c
06 3月, 2023 3 次提交

implement floor_grad by primitive logic (#51059) · 769e24ce

由 Meteor Liu 提交于 3月 06, 2023

* implement floor_grad by primitive logic

* implement floor_grad by primitive logic

* Merge branch 'develop' into floor_grad

769e24ce

N

Add multiprecision for adadelta op (#50131) · a8a2b7f4
由 niuliling123 提交于 3月 06, 2023

a8a2b7f4

[phi decoupling] decouple dependency to device_context in phi (Part 1) (#50865) · a1006b2b

由 Huang Jiyi 提交于 3月 06, 2023

* move DeviceContextPool to phi

* add EmplaceExternalContextFunc

* update namespace

* update cmake

* fix bugs and create context_pool_impl.h

* replace platform::is_xxx_place

* fix bugs

* update generator

* fix bugs

* fix bugs

* fix bugs

* fix bugs

* fix bugs

* fix bugs

* fix bugs

* fix enforce usage

* Revert "fix enforce usage"

This reverts commit 5f521f08a69713cee506e64a00ec6d9fba709e27.

* fix bugs

* rm XPUDeviceContext and CustomDeviceContext

* fix bugs

* fix fix context init bug

* fix bugs after merge

* fix bugs

* fix name

* fix mutable_data

* update and fix bugs

* fix bugs

* update

* fix bugs

* fix name

* fix bugs

* merge

* fix bugs

* create context_pool in phi/backends

* create context_pool in phi/backends

* fix bugs

* fix xpu bugs

* fix rocm bugs

* fix bugs

* fix bugs

* fix bugs

* fix xpu bugs

* update

* update

* fix bugs

* fix bugs

a1006b2b

03 3月, 2023 2 次提交
- W
  add gather_nd_comp_grad composite rule (#50966) · 625e30b7
  由 wangxiaoning 提交于 3月 03, 2023
```
* comp gather_nd_grad

* fix

* test no cinn

* fix

* fix cinn
```
  625e30b7
- N
  
  Add multi_precision for adagrad op (#50078) · 4779c2c1
  由 niuliling123 提交于 3月 03, 2023
  
  4779c2c1
02 3月, 2023 1 次提交

Add concat grad cinn (#50972) · a4689c90

由 wangzhen38 提交于 3月 02, 2023

* [cinn] concat_grad

* [cinn] concat_grad

* [cinn] concat_grad build success

* [Add PGLBOX] fix unnitest

* [Add PGLBOX] fix unnitest

* [Add PGLBOX] fix codestyle

* [cinn] update by comments

* [cinn] update by comment

* [cinn] add axis check

a4689c90

01 3月, 2023 5 次提交

Integration flash attention (#49869) · 61611786

由 Chitsing KUI 提交于 3月 01, 2023

* flash attn

* seed

* almost

* softmax

* fix workspace

* add unitest; linux only

* fix setup

* fix datatype include

* fix setup typo

* fix def scope

* new error api

* use paddle fork

* fix attr bug; complete ut

* update flash hash

* fix rng reset

* fix offset

* fix comments

61611786

[Tensor Operants & Prim-Relevant] Tensor supports logical operants (#50983) · 1794927b

由 HongyuJia 提交于 3月 01, 2023

* Add comments for #50886

* [Tensor Operants & Prim-Relevant] Tensor supports logical operants

* add prim dynamic unit test

* add prim static unit test

1794927b

add topk prim backward (#50679) · 296b3ff0

由 zqw_1997 提交于 3月 01, 2023

* tmp gather vjp

* support gather

* remove useless code

* fix compiling error

* fix ut

* add eager test

* add eager test

* add seed

* small change

* fix cpu error

* fix transpose op compat

* remove tensor index case

* fix prim_cinn

* small commit

* add cumsum prim backward

* small commit

* skip aixs=None test case

* fix op generante eror

* fix static test error

* remove unused code

* fix static test error

* small commit

* skip cpu float16 test case

* skip eager cpu cumsum float16 test case

* add eager and static UT

* fix ut

* add composite backward rule

* fix error

* fix type error and format error

* add try cpu+float16 test

* fix test bugs

* remove test for cpu+float16 and make y[0] be the grad arg

* add cinn test

* fix UT

* fix the wrong dim of v in test cases

* change y[0] to y[1] for grad in UT

* reshape flatten out

* Disable cinn single test

* use scatter_nd_add

* modify the reshape part of topk_grad

* delete useless build file

* to make the syntax right

* modify bug

* try use of put_along_axis

* remove cinn test

* reformat todo

* add silu composite rule

* fix code style.

* add cinn test

* fix composite grad maker code gen

* add prim in cumsum op test

* remove old test

* fix typro

* pass the static test

* fix typro

* modify optest and delete old test files

* remove normal test_top_k_op test

* fix typro

* pass axis=None test case

* buffer comment

* for debug

* add silu fp16 unit test.

* add static guard

* remove forward prim test

* remove same name axis

* modify the test_top_v2_op.py to pass all local tests

* delete the useless testcase

* fix mistake

* add more testcases to test dtype16 and dtype32

---------
Co-authored-by: NJiabinYang <360788950@qq.com>
Co-authored-by: NGGBond8488 <857631483@qq.com>
Co-authored-by: Nzxcd <228587199@qq.com>
Co-authored-by: NCharles-hit <wanghao107@baidu.com>

296b3ff0

C

add op map (#51026) · 83f61bd5
由 cyber-pioneer 提交于 3月 01, 2023

83f61bd5
N

Add multiprecision for rms op (#50132) · 48060b2e
由 niuliling123 提交于 3月 01, 2023

48060b2e

28 2月, 2023 3 次提交

【prim】Matmul double grad composite api (#50452) · a0c473f4

由 xiaoguoguo626807 提交于 2月 28, 2023

* modify name

* merge develop

* original code

* build modify

* success 2*2

* fused dim=1 failed

* success

* modify static

* success for static except dim=1

* delete log

* tmp modify

* success

* success

* add fp1664

* delete fp16 cpu test

* stop windows test

* review modify

* modify tanh test

* modify tanh

* fix_conflixt

* modift static prim

* fix_conflict

* Update test_static_prim.cc

* update

* bug fix

a0c473f4

add cumsum prim backward (#50565) · ca2b6095

由 GGBond8488 提交于 2月 28, 2023

* add cumsum prim backward

* skip aixs=None test case

* fix op generante eror

* fix static test error

* remove unused code

* fix static test error

* skip cpu float16 test case

* skip eager cpu cumsum float16 test case

* add cinn test

* reshape flatten out

* Disable cinn single test

* remove cinn test

* reformat todo

* add prim in cumsum op test

* remove old test

* fix typro

* fix typro

* fix typro

* pass axis=None test case

* remove forward prim test

* remove same name axis

ca2b6095

【Prim】Reshape, transpose, cast vjp (#50778) · ab1b6303

由 Jiabin Yang 提交于 2月 28, 2023

* support transpose and reshape

* support reshpe, transpose, cast vjp

* merge develop

* recover unused file

* remove prim base

* support problem

* remove additional status settting

* remove additional status settting

* fix ut

* fix ut

* fix ut

* fix no grad branch

* add more test

* disable fp16 in cpu

* fix test

ab1b6303

27 2月, 2023 1 次提交
- H
  [Tensor Operants & Prim] Tensor pow API uses elementwise_pow (#50886) · 8a097399
  由 HongyuJia 提交于 2月 27, 2023
```
* [Tensor Operants & Prim] Tensor pow API uses elementwise_pow

* unittest change to fill_constant+elementwise_pow
```
  8a097399
25 2月, 2023 1 次提交
- Z
  Rename elementwise_heaviside to heaviside (#50821) · 8129c22e
  由 zyfncg 提交于 2月 25, 2023
```
* rename elementwise_heaviside to heaviside

* delete __init__.py

* fix bug
```
  8129c22e
24 2月, 2023 1 次提交

support 'backend' in static ops (#50671) · 363825df

由 HappyHeavyRain 提交于 2月 24, 2023

* support 'backend' in static ops

* change bitwise_xx comment in python

* change bitwise_xxx comment in python

* change 'backend' and 'data_type' in GetExpectedKernelType

363825df

PaddlePaddle / Paddle 大约 1 年 前同步成功

PaddlePaddle / Paddle
大约 1 年前同步成功