提交 · 8baf33a4baeedf0274a581cdf9d412edc4f3c82d · PaddlePaddle / Paddle

30 3月, 2023 3 次提交

[CINN] pass global seed to CINN (#52078) · 94aea284

由 jiangcheng 提交于 3月 30, 2023

* [CINN] pass global seed to CINN

* fix cu not include cinn/runtime/flags.h bug

* fix DefaultCUDAGenerator should has device id bug

94aea284

[CodeStyle][C416][C417] rewrite unnecessary comprehension with function call... · 929892c3

由 cyberslack_lee 提交于 3月 30, 2023

[CodeStyle][C416][C417] rewrite unnecessary comprehension with function call and use generator instead of map (#52140)

* codestyle c416 c417

* fix error

* fix inc

* unify all C4 rules into one

* fix inc

---------
Co-authored-by: NSigureMo <sigure.qaq@gmail.com>

929892c3

Add Gloo SendRecv Function (#52221) · b8850521

由 yuehuayingxueluo 提交于 3月 30, 2023

* add gloo  send_recv

* fix code_stype

* fix CI bug

* fix send_recv.cc

* add send_recv without sync_op

* fix send_recv test

* fix gather.cc

b8850521

29 3月, 2023 26 次提交
- Z
  [AMP OP&Test] pad3d add unittests of fp16 and bf16 (#51015) · f86d0be7
  由 zengshao0622 提交于 3月 29, 2023
```
* pad3d add unittests of fp16 and bf16

* pad3d add unittests of fp16 and bf16

* fix cuda place

* fix random to uniform

* fix class name

* fix fp16 max relative error to 1.5e-3

* add dytpe register for onednn

* add pad uint16 check of common.py

* remove check_eager

* test_check_grad --> test_check_grad_normal
```
  f86d0be7
- J
  Clear the infrt-related code (#52273) · da5a2584
  由 jjyaoao 提交于 3月 29, 2023
```
* Clear the infrt-related code

* remove tools/infrt
```
  da5a2584
- Y
  Add group_norm composite rule (#51874) · cabf3921
  由 Yichen Zhang 提交于 3月 29, 2023
```
* add group_norm composite rule

* add test for scale_grad and bias_grad

* resolve conflicts

* remove amp in composite_rule.py

* add float16 test

* deal with NHWC format

* keep the composite rule in float16 identical as original kernel

* resolve conflicts
```
  cabf3921
- W
  Del old dygraph optest8 (#52094) · d612faf5
  由 wanghuancoder 提交于 3月 29, 2023
```
* delete old dygraph op test
```
  d612faf5
- H
  Add output defines for graph_sample_neighbors and group_norm (#51503) · 37bd7e78
  由 hjyp 提交于 3月 29, 2023
```
* regist output type for GraphSampleNeighbors and GroupNorm

* Update return type

* fix return type

* update

* fix detail
```
  37bd7e78
- C
  
  Fix the type conflicts against the openblas (#52187) · a5ca2672
  由 chenxujun 提交于 3月 29, 2023
  
  a5ca2672
- H
  [CustomOP Inplace] Automap inplace dtype and shape, prepare for vector<Tensor> output (#52214) · fc02b1e6
  由 HongyuJia 提交于 3月 29, 2023
```
* [CustomOP Inplace] Automap inplace dtype and shape, prepare for vector<Tensor> output

* delete dtype,shape func of multi_inplace op

* [CustomOP Inplace] Automap inplace dtype and shape, support vector<Tensor> output

* [CustomOP Inplace] Auto-generate python API for inplace vector<Tensor> output
```
  fc02b1e6
- G
  
  Fix_Linux_[-Wterminate]warning (#52186) · 225f1af2
  由 Galaxy1458 提交于 3月 29, 2023
  
  225f1af2
- 张
  [CodeStyle][UP034] remove (()) cases (#52060) · c0697296
  由张春乔提交于 3月 29, 2023
```
* add up34

* modify var name in loop

* revert changes in test_slice

* Revert "modify var name in loop"

This reverts commit 6d748e371afb417054ed0c6b36fd11e87959a90d.

* temporarily ignore test_slice.py

* add comment

* empty commit, re-trigger all ci

* fix inc

---------
Co-authored-by: NSigureMo <sigure.qaq@gmail.com>
```
  c0697296
- S
  [BugFix] fix compute error in fused_dropout_add (#52261) · 8082ba8a
  由 ShenLiang 提交于 3月 29, 2023
```
* fix bg

* add utest

* add utest
```
  8082ba8a
- [Zero-Dim] change Tensor.numpy() usage to other equivalent usage, avoid hack (#52197) · 73df2b1e
  由 zhouweiwei2014 提交于 3月 29, 2023
  
  73df2b1e
- X
  tanh_double_grad_rules (#52192) · d966301e
  由 xiaoguoguo626807 提交于 3月 29, 2023
```
* tanh_double_grad_rules

* delete log got api_base

* modify composite yaml

* optimize rules
```
  d966301e
- Z
  
  [XPU] optimize pass (#52099) · 599388e3
  由 zhupengyang 提交于 3月 29, 2023
  
  599388e3
- Q
  
  fix bkcl_all_gather and c_embedding_grad bug for xpu (#51785) · ad76d37e
  由 QingshuChen 提交于 3月 29, 2023
  
  ad76d37e
- Z
  [Move Test] Move imperative (#52170) · 8cc48b2e
  由 Zheng-Bicheng 提交于 3月 29, 2023
```
* update

* update
```
  8cc48b2e
- J
  
  [Test Mv] remove infrt tests (#52063) · c4abe26c
  由 jjyaoao 提交于 3月 29, 2023
  
  c4abe26c
- Y
  Add Fuse Adamw Pass (#50484) · 66098bff
  由 yuehuayingxueluo 提交于 3月 29, 2023
```
* add fuse adamw pass

* fix some bugs

* fix CIbug

* change chunk_size

* fix CI bug

* rm test_fused_adam_op.py

* fix CI bugs

* fix fuse_adamw_op_pass.cc

* change code style

* fix CI bug

* fix ut bug and use_adamw_op_pass.cc

* fix test_fuse_adamw_pass.py

* fix CI bug

* remove fluid

* fix ci bug

* fix CI bug
```
  66098bff
- G
  Support ignore_index for c_softmax_with_cross_entropy_op. (#52157) · 5c76b38b
  由 Ghost Screaming 提交于 3月 29, 2023
```
* Support ignore_index for c_softmax_with_cross_entropy_op.

* Polish code. Remove useless comments and add Testcase.

* Polish code for TestCase.

* Polish code.

* Polish code style.

* Polish code.

* Change loss calculation formula and ignore_index dtype.

* Polish TestCase.
```
  5c76b38b
- N
  
  Support op check list and op skip in check_nan_inf_tools (#51998) · 7067763e
  由 niuliling123 提交于 3月 29, 2023
  
  7067763e
- Z
  
  move clip_by_norm kernel to phi for xpu (#52183) · bf61a0d9
  由 zhangyikun02 提交于 3月 29, 2023
  
  bf61a0d9
- J
  [kunlun] support min/max in dygraph mode (#52228) · 5e9a2038
  由 jameszhang 提交于 3月 29, 2023
```
* [kunlun] support min/max in dygraph mode

* update xccl to 1.0.13
```
  5e9a2038
- R
  
  auto generate a phi config header (#52224) · 5a9d59c5
  由 ronnywang 提交于 3月 29, 2023
  
  5a9d59c5
- H
  fix Kunlun-KP-Build (#52188) · 4f74656d
  由 huangjiyi 提交于 3月 29, 2023
```
* fix kp compile

* test

* Revert "test"

This reverts commit 3a1cbfaa0f23e6e06d3dcd8d0b0c28aa63a98e70.

* update copyright

* update cmake

* update cmake

* update cmake

* update cmake
```
  4f74656d
- 傅
  
  [AMP OP&Test] add bf16 fp16 test case for interpolate (#51160) · df423557
  由傅剑寒提交于 3月 29, 2023
  
  df423557
- Y
  
  [AMP OP&Test]label_smooth op fp/bf16 support (#52193) · c4b6d1ae
  由 YuhangLi 提交于 3月 29, 2023
  
  c4b6d1ae
- S
  Fix generate_kernels.py in CUDA 12.0 (#52232) · f2c96bc2
  由 sneaxiy 提交于 3月 29, 2023
```
* fix generate_kernels.py in CUDA 12.0

* fix attrs bug
```
  f2c96bc2
28 3月, 2023 11 次提交

Add basic functionalities to support Scalar & Scalars in op attr (#51984) · 2e9fd5e4

由 Feiyu Chan 提交于 3月 28, 2023

Add basic functionalities to support Scalar & Scalars in operator attribute.

1. extend allowed types in operator's attribute type, add `paddle::experimental::Scalar`, add corresponding protobuf Message types;
2. Scalar enhancement, add formatting, equality;
3. add code to handle Scalar & Scalars in opmaker, conversion from paddle operator to phi kernel, opdesc construction and manipulation, tensorrt converter, tracer, operator construction, etc;
4. bind `paddle::experimental::Scalar` to python, as `libpaddle.Scalar`;
5. add functionality to canonicalize attribute map according to OpProto(if the op the attribute map used for has an OpProto);
6. add code to manipulate Scalar proto message via protobuffer python API;

Add unittests.

1. add test cases for formatting, equality for Scalars, and WrapAsScalars;
2. add test cases for 'casting' between different morphs of attributes;
3. add test cases for extracting scalar & scalars from attribute;
4. add test cases for CanonicalizeScalarAttrs(and fix a bug in type index offset);
5. fix gmock's library filename on windows platform.
6. clean code: use canonicalize_attrs instead of inlining the function;
7. add test cases for libpaddle.Scalar in python code.
8. add test cases for `make_scalar_proto`, which manipulate proto message `Scalar` via protobuffer python API.

2e9fd5e4

Z
[inference] Remove log about fluid and fix uninitialization warning (#51558) · e91a7896
由 Zhang Jun 提交于 3月 28, 2023
```
* Remove log about fluid
* Remove useless forward declarations
* Fix uninitialization warning (trt onehot)
```
e91a7896
C

support auto generate for kldiv_loss (#51886) · cdba7e36
由 cyberslack_lee 提交于 3月 28, 2023

cdba7e36
张
support auto generate for cumprod (#52047) · a2d3c335
由张春乔提交于 3月 28, 2023
```
* mv cumprod

* add attrs

* Update backward.yaml

* Update backward.yaml
```
a2d3c335
W
Del old dygraph optest7 (#51999) · 6d0fa6f2
由 wanghuancoder 提交于 3月 28, 2023
```
* delete old dygraph op test
```
6d0fa6f2

【prim】change layernorm_grad rules (#51879) · 789aac8a

由 xiaoguoguo626807 提交于 3月 28, 2023

* support layer_norm prim and cinn test

* enable cinn test

* fix merge conflict

* polish input for check_output_with_place

* fix merge conflict

* add more test case

* fix merge conflict

* polish test case

* polish op_test

* change ln_g rules

* modify scale is none case

* modify scale is none case

* add public_python_api for check prim

* modify setoutputgrad and fp64bug

* add todo & delete log

* delete Single***varname

* delete get varname

* modify FP64 bug

* delete op test

* recover

* fix conflict

---------
Co-authored-by: NWeilong Wu <veyron_wu@163.com>

789aac8a

L
add support to set chunk size of auto_growth_allocator (#52204) · b3efc923
由 Leo Chen 提交于 3月 28, 2023
```
* add flag to set chunk size

* use the flag

* add vlog

* add ut

* rename ut
```
b3efc923
S
Add overflow check in memory efficient attention implementation (#52191) · ecff3864
由 sneaxiy 提交于 3月 28, 2023
```
* add overflow check in memory efficient attention

* fix ci compile error

* fix ci compile error
```
ecff3864
H
fix int8 support for full kernel (#52194) · c145fd1e
由 houj04 提交于 3月 28, 2023
```
* fix int8 support for full kernel

* fix ut.
```
c145fd1e
C
support auto generate for huber_loss (#51951) · 2ba4515e
由 cyberslack_lee 提交于 3月 28, 2023
```
* fix huber_loss

* fix

* fix ops.yaml add intermediate

* fix

* fix test
```
2ba4515e
R
support auto generate static for one_hot_v2 (#52134) · b6af72eb
由 RedContritio 提交于 3月 28, 2023
```
* support auto generate static for one_hot_v2

* format
```
b6af72eb

PaddlePaddle / Paddle 大约 1 年 前同步成功

PaddlePaddle / Paddle
大约 1 年前同步成功