提交 · 5a39365a259eb4c3fbeb889c9e41603d7e311b9f · PaddlePaddle / Paddle

13 3月, 2023 15 次提交

M
Add Unsqueeze op composite rule (#51527) · ed19d37f
由 mengziheng 提交于 3月 13, 2023
```
* first test

* add unsqueeze_op
```
ed19d37f
W
[AMP OP&Test]add fp16 and bf16 OpTest for index_select (#51159) · d3ebf1e6
由 wangxiaoning 提交于 3月 13, 2023
```
* add fp16/bf16

* add grad bf16

* test name
```
d3ebf1e6
J

just a patch for bool tensor indexing with shape -1 (#51046) · 059699a2
由 JYChen 提交于 3月 13, 2023

059699a2

由 Sławomir Siwek 提交于 3月 13, 2023

* mkldnn->onednn

* fused softplus op + kernel

* remove extra attributes

* add missing handler

* change var name

fdcfa04f

W
squeeze2_op (#51146) · f9a4f007
由 wenbin 提交于 3月 13, 2023
```
* squeeze2_op

* add ut

* fix ut

* fix static

* modity ut
```
f9a4f007

[with_data_parallel][part6] remove with_data_parallel in distributed optimizer (#50719) · 1404f732

由 kangguangli 提交于 3月 13, 2023

* find relevant testcase

* remove with_data_parallel

* trigger CI

* do not apply ParameterServerGraphOptimizer

* remove useless optimizer

* remove with_data_parallel in test_dist_base

* fix test_fleet_base_3

* only reserve changes for GraphExecutionOptimizer

* fix bug

* fix test_minst_dgc_nccl

* fix typo

* fix test_dist_mnist_gradient_merge

* rm TestDistMnistNCCL2DGCMultiCards

* fix optimizer conflicts

* fix dist_mnist

* fix test_dist_hapi

* delete test_fleet_graph_execution_meta_optimizer & test_fleet_graph_executor

* temporally not delete unittest

* fix unittests

* fix ci

* recover prune in python/paddle/hapi/model.py

1404f732

K

remove flags_enable_parallel_graph (#51375) · 38865fcd
由 kangguangli 提交于 3月 13, 2023

38865fcd

Add expand composite rule (#50810) · 559de39a

由 xysheng-baidu 提交于 3月 13, 2023

* Add expand composite rule

* reshape x when dim_in less than dim_out

* add tile op for expand

* remove rensor shape case when comp prim

* enable cinn case

* dim_out can't be 0

* update test case for prim type

559de39a

[Paddle Inference ]use python to generate cutlass code (#50603) · 4e9e23cb

由 zhoutianzi666 提交于 3月 13, 2023

* use python to generate cutlass code

* refine CommonConvKernelPart1, CommonConvKernelPart2

* remove useless code in generate_cutlass_code.sh

* add more config in conv2d_residual

* CommonCutlassConvKernelPart1 and CommonCutlassConvKernelPart2

* add group conv support in util.cu

* remove .sh

* refine name

* make name goodgit status!

* add fuse_alpha

* make code easy to understand

* mot fopen generate in py

* use python script to generate conv2d,group=1 cutlass code

* use const &

* use const & && use python script to generate conv2d/group=1 code

4e9e23cb

K
[with_data_parallel][part12] remove with_data_parallel in test_sync_batch_norm_op (#51382) · 37662dd1
由 kangguangli 提交于 3月 13, 2023
```
* remove with_data_parallel in test_sync_batch_norm_op

* fix debug code

* polish code

* polish code

* polish code
```
37662dd1
J

[CINN] reopen elementwise_div/cumsum prim+cinn unittest (#51465) · 0530358f
由 jiangcheng 提交于 3月 13, 2023

0530358f

张

【Hackathon No.89】 Remove circle import Part2 (#51199) · 34358de5

由张春乔提交于 3月 13, 2023

* fix the only one circle import in call_transformer.py

* move define of CONVERSION_OPTIONS from convert_call_func.py to program_translator.py

* delete the self import of program_translator.py

* fix import failed problem

* define variable in utils.py

* move is_builtin to utils.py

* move is_builtin to utils.py

* fix import errors

* fix import errors

* fix something

* Update python/paddle/jit/dy2static/call_transformer.py
Co-authored-by: NAurelius84 <zhangliujie@baidu.com>

* Update python/paddle/jit/dy2static/call_transformer.py

---------
Co-authored-by: NAurelius84 <zhangliujie@baidu.com>

34358de5

H
[XPU] add increment op. (#51487) · ab17f988
由 houj04 提交于 3月 13, 2023
```
* [XPU] add increment op.

* fix ci
```
ab17f988
R
[PHI] Add reduce_min_grad xpu op and the corresponding unittest (#51431) · 88d42398
由 RuohengMa 提交于 3月 13, 2023
```
* [XPU] add reduce_min_grad XPU kernel

* add unittest for reduce_min xpu op
```
88d42398
W
Del old dygraph optest2 (#51458) · e6ca78c2
由 wanghuancoder 提交于 3月 13, 2023
```
* delete old dygraph op test
```
e6ca78c2

12 3月, 2023 1 次提交
- W
  Del old dygraph optest1 (#51417) · 11d7dae9
  由 wanghuancoder 提交于 3月 12, 2023
```
* delete old dygraph op test
```
  11d7dae9
10 3月, 2023 18 次提交
- N
  
  [Dy2St] allow write to container in control flow (#51248) · 30865110
  由 Nyakku Shigure 提交于 3月 10, 2023
  
  30865110
- P
  Align auc with distps (#51434) · d69bb1da
  由 pangengzheng 提交于 3月 10, 2023
```
* support run haokanctr model in heterps-models

* polish setup.py

* polish JVM_LIB in evn_dict

* align infer auc with DistPsArch pre-stable
```
  d69bb1da
- I
  
  【Hackathon No58】-fix set_value (#51197) · bfb79ee2
  由 Infinity_lee 提交于 3月 10, 2023
  
  bfb79ee2
- T
  fix-debug-mode-exception (#51368) · 2404847d
  由 tifa 提交于 3月 10, 2023
```
* fix-debug-mode-exception

* fix code style

* use try-except solve
```
  2404847d
- Z
  
  should import flatten from paddle.utils (#51481) · fe264cfc
  由 zqw_1997 提交于 3月 10, 2023
  
  fe264cfc
- K
  [with_data_parallel][part10] remove with_data_parallel in parallel_executor.py (#51369) · 19001c6c
  由 kangguangli 提交于 3月 10, 2023
```
* remove with_data_parallel

* remove multidevice-optimizer-in-controlflow checks and fix ci
```
  19001c6c
- K
  
  remove with_data_parallel and return_merged (#51374) · 7ee3eba9
  由 kangguangli 提交于 3月 10, 2023
  
  7ee3eba9
- C
  
  add reduce_mean and gelu test (#51447) · ac495981
  由 Charles-hit 提交于 3月 10, 2023
  
  ac495981
- S
  Add attn_bias.py of xformers (#51387) · 54331f1a
  由 sneaxiy 提交于 3月 10, 2023
```
* add attn_bias.py

* add Python interface

* add license

* add test_attn_bias.py

* fix CPU test error

* fix ci error
```
  54331f1a
- 陈
  
  No.54：为 Paddle allclose、isclose 算子实现 float16 数据类型支持 (#51168) · 24258c27
  由陈沧夜提交于 3月 10, 2023
  
  24258c27
- C
  [static code gen]add error msg in composite maker code gen (#51211) · 07d8770f
  由 Charles-hit 提交于 3月 10, 2023
```
* support variable parameter in optest

* add error msg for use tensor attr in static code gen

* fix static code gen

* fix prim op test

* modify comment

* fix op test

* fix ci

* remove code
```
  07d8770f
- W
  
  update some log. (#51274) · 0d096b3b
  由 wuhuachaocoding 提交于 3月 10, 2023
  
  0d096b3b
- A
  [Framework]Support deliver stop_gradient in static mode (#49013) · 38d233d9
  由 Aurelius84 提交于 3月 10, 2023
```
* [framework]support pass stop_gradient in static mode

* fix control_flow op stop_gradient
```
  38d233d9
- K
  [with_data_parallel][part7] remove with_data_parallel in custom op test (#51164) · 1cffb1ff
  由 kangguangli 提交于 3月 10, 2023
```
* remove with_data_parallel in custom op test

* finish TestCustomOpReluModelStaticMultiDevice

* fix typo

* add checks for relu output

* fix ci

* fix ci

* fix compile checks

* fix coverage ci
```
  1cffb1ff
- N
  
  Delete duplicate code in optimizer.py and support master_param for bf16 in optimzer (#51367) · af2c31a6
  由 niuliling123 提交于 3月 10, 2023
  
  af2c31a6
- L
  
  Address bug of open amp after dynamic to static, when control op in program. (#50799) · 3c7cde95
  由 liuruyan 提交于 3月 10, 2023
  
  3c7cde95
- Z
  [AMP OP&Test] Modify the logic of comparing grad in bfloat16 (#51345) · 6f30b14f
  由 Zhang Zheng 提交于 3月 10, 2023
```
* [AMP OP&Test] Modify the logic of comparing grad in bfloat16
```
  6f30b14f
- C
  
  add flashattn raw kernel (#51383) · f951832d
  由 Chitsing KUI 提交于 3月 10, 2023
  
  f951832d
09 3月, 2023 6 次提交

[AMP OP&Test] arange op support fp16/bf16 (#51106) · f3448977

由 yangjianfengo1 提交于 3月 09, 2023

* AMP arange & Test

* fix arange bfloat16 dtype

* update for review

* update for review2

* fix tile

* update

* fix ci

* r

* f

* fix windows ci

* update bfloat data

* fix bloat16 input

* add print

* Update test_where_op.py

* update kernel

* del repeat

* update review

f3448977

[AMP OP&Test] where support bf16/fp16 (#51137) · 2727dddb

由 yangjianfengo1 提交于 3月 09, 2023

* where op test

* update bfloat16

* fix

* fix windows ci

* update bfloat16 data

* fix bloat16 x

* reset

* fix randint

* add print

* add delta

* cancel print

* code style

* update revirew

2727dddb

Remove paddle.fluid.layers.utils.* (#51033) · 86e990d4

由 zqw_1997 提交于 3月 09, 2023

* move fluid.utils to paddle.utils.layers_utils

* fix error

* delete original fluid layers utils

* remove import and old utils

* remove more old utils import

* change import path of fill_constant in the layers_utils.py

* fix mistake

* fix error

* expose in __init__.py

* for comment

* when change the ref of func is_sequence, it should change to the root of is_sequence instead

* for codecheck

86e990d4

add prim erf grad (#50436) · b7e4d974

由 GGBond8488 提交于 3月 09, 2023

* add prim erf grad

* add yaml config for prim erf grad

* add math.h

* add cmath

* add math  defines

* use define math

* use define math

* define M_2_SQRTPI

* M_2_SQRTPI math

* try math.h

* fix typro

* remove pow in erf grad

* use new optest

* add fp16 fp32 test

* remove fp16 test

b7e4d974

Enable gpups run on rec model (#51115) · 32f369a8

由 pangengzheng 提交于 3月 09, 2023

* support run haokanctr model in heterps-models

* polish setup.py

* polish JVM_LIB in evn_dict

32f369a8

W
Add softplus double grad (#50261) · 542844b4
由 will-jl944 提交于 3月 09, 2023
```
* add softplus double grad

* use constant method
```
542844b4

PaddlePaddle / Paddle 大约 1 年 前同步成功

PaddlePaddle / Paddle
大约 1 年前同步成功