提交 · a6cf6cddd323436b0e441aeb6f67a9a5da6c2172 · 机器未来 / Paddle

13 1月, 2022 4 次提交

W
roi_align aligned supported (#38905) · 08dcea18
由 wenbin 提交于 1月 13, 2022
```
roi_align aligned supported
```
08dcea18

Added mul BF16/FP32 FWD/BWD oneDNN kernel (#38552) · fc6eed5b

由 jakpiase 提交于 1月 13, 2022

* base changes for mul reimplementation

* empty commit

* tmp save

* full implementation of mul bf16/fp32 fwd bwd

* CI fix

* CI rerun

* changed unity build cmake to avoid gpu issues

* removed mul mkldnn from unity build

* added skipping tests if not cpu_bf16

* CI fix

* CI fix

* CI fix

fc6eed5b

C
Fix mkldnn invalid infershape impl (#38837) · 281644cd
由 Chen Weihang 提交于 1月 13, 2022
```
* fix mkldnn invalid infershape

* add unittest for mkldnn in new executor

* add import os
```
281644cd

Support test_imperative using_non_zero_gpu with _test_eager_guard() (#38881) · 5e515781

由 Weilong Wu 提交于 1月 13, 2022

* Support test_imperative using_non_zero_gpu and Add a TODO comment

* Change GPU number to 0

* Modify the cuda device selection method

5e515781

12 1月, 2022 7 次提交

the_one_ps dirs reconstruct (#38804) · 50609214

由 ziyoujiyi 提交于 1月 12, 2022

* delete gloo connect retry

* the_one_ps dirs reconstruct

* .

* .

* create the_one_ps dirs

* create the_one_ps dirs

* create the_one_ps dirs

* create the_one_ps dirs

* create the_one_ps dirs

* create the_one_ps dirs

* the one ps dirs modify

* the one ps dirs modify

* the one ps dirs modify

* the one ps dirs modify

50609214

S
Fix conv act int8 scale (#38331) · 4825addd
由 Sylwester Fraczek 提交于 1月 12, 2022
```
* fix conv act int8 scale

* add unit test for conv+hard_swish
```
4825addd

support 5d for nearest interp (#38868) · d296456c

由 xiaoting 提交于 1月 12, 2022

* support 5d for nearest

* update nearest3d unittest, test=develop

* fix approve ci, test=develop

* fix approve ci, test=develop

d296456c

[Dist Pass] Amp Pass (#38764) · cc24427e

由 JZ-LIANG 提交于 1月 12, 2022

* auto parallel sharding base

* chmod

* add unitest

* set unitest cmake dist label

* revise code according to rewiew

* chmod

* bugfix for grad_clip and param broadcast

* chmod

* update unitest

* chmod

* add clip

* chmod

* add amp pass

* chmod

* add unitest

* remove grad update

* fixed bug

* fixed bug

* fixed typose

* fixed typoes

cc24427e

J

support test_auto_prune_partial (#38871) · 4640955c
由 Jiabin Yang 提交于 1月 12, 2022

4640955c

Fix api docs (#38882) · 572ba24e

由 Chen Long 提交于 1月 12, 2022

* update readme test=document_fix

* update conll05 docs

* update conll05 docs test=document_fix

572ba24e

S
add args check and comment for exp,polynomy decay (#38782) · b7bae939
由 Sing_chan 提交于 1月 12, 2022
```
* add args check and comment for exp,polynomy decay

* modify according to zhouwei's comment
```
b7bae939

11 1月, 2022 5 次提交

W

Support test_numpy_bridge and thread_local_has_grad (#38835) · 29c211ee
由 Weilong Wu 提交于 1月 11, 2022

29c211ee

【Auto Parallel】New local tensor (#38747) · d3ba1895

由 caozhou 提交于 1月 11, 2022

* update dist tensor

* add unitest

* update unitest

* refactor dist tensor

* update dist tensor and unitest

d3ba1895

Z
[AMP] Check call order of paddle.amp.decorate and paddle.DataParallel (#38785) · fbb40281
由 zhangbo9674 提交于 1月 11, 2022
```
* check amp.decorate and DataParallel

* refine coverage

* fix layer dtype

* refine code
```
fbb40281

Jit pre save hook (#38186) · e91f7c02

由 Ming-Xu Huang 提交于 1月 11, 2022

* Pre-save hooks of jit.save

1. Added pre_save_hooks features to jit.save.
2. Added related unittests

* Added jit pre_save_hooks functions's alias to paddle.jit and copyright.

* Make jit.save_pre_hook style be consisent with Paddle's rule.

* Fixed arguments passing bug in run_save_pre_hooks

* Added API Documents

* Move clear and run_pre_save_hooks as internal methonds only.

* Made register_save_pre_hook as an internal function.

e91f7c02

[Eager] fix some eager logic (#38576) · d3686471

由 wanghuancoder 提交于 1月 11, 2022

* Rearranged Eager AutoCodeGen directory structure

* Removed USE_OP in Eager AutoCodeGen

* Enabled generation for Operators without Grad/Inputs/Outputs

* Resolved operators without input

* Fixed merge conflicts

* Enabled Eager AutoCodeGen for 10+ more operators

* Refactored Eager AutoCodeGen with more organized helper objects

* Enabled Eager AutoCodeGen for operators with multiple OpBases

* Adjusted Eager AutoCodeGen to Enable Passing Output Tensor as Input Argument

* Handled Dispensable Inputs/Outputs in Eager AutoCodeGen

* Adjusted function generation/call between Python-C API & Dygraph API

* Synchronized auto-generated Python-C API with Dygraph Forward Functions

* support more eager tensor api

* fix merge compile error

* fix compile error and fit develop code

* support pure CPU

* fix some logic error in eager_mode

* support _varbase_creator in eager mode

* Added safe_initialized interface to EagerTensor for use in processing dispensable inputs

* for eager mode

* refine

* support multiple constructor for eager tensor

* add place related code

* polish code

* specific randint with dtype of int64

* Support pure cpu test

* eager logic

* refine test in pure cpu

* eager logic

* eager logic

* eager logic, test=develop

* skip core.eager when in inference, test=develop

* refine, test=develop

* refine, test=develop

* call RetainGrad after run forward kernel, test=develop

* refine, test=develop

* support dygraph util, meta, guard test

* eager test case

* support inference test

* refine test and fix initializer failed

* modify eagertensor patch method

* add eagertensor.clear_grandint, test=develop

* refine, test=develop

* refine, test=develop

* refine, test=develop

* support create varbase and fix retain grad error

* call monkey_patch_varbase in _test_eager_guard, test=develop

* fix windows error

* split clear_gradient to clear_gradient and zero_grads, test=develop

* refine, test=develop

* refine, test=develop

* support test_imperative_basic test in eager mode

* remove additional log in variable.h

* remove additional log in variable.h

* remove additional code create in merge

* eager

* fix some eager logic, test=develop

* refine, test=develop

* refine, test=develop

* refine, test=develop

* refine, test=develop

* refine, test=develop
Co-authored-by: Njim19930609 <jim19930609@gmail.com>
Co-authored-by: NJiabinYang <360788950@qq.com>

d3686471

10 1月, 2022 24 次提交
- B
  
  update mul_gru_fuse_pass ut timeout setting (#38763) · 1f8fe035
  由 baoachun 提交于 1月 10, 2022
  
  1f8fe035
- H
  Add gpu kernel for new api : linalg.lstsq (#38621) · 405103d8
  由 Haohongxiang 提交于 1月 10, 2022
```
* add lstsq gpu kernel

* update

* add docs_en

* modify ut

* fix bugs

* modify example in docs_en

* remove lstsq_op.cu from ROCM cmake

* modify docs_en

* modify docs_en

* modify docs_en

* remove unneccessary TensorCopy
```
  405103d8
- L
  
  [Fleet Executor] Modified python cache strategy to support multi carriers (#38839) · c50c22b0
  由 LiYuRio 提交于 1月 10, 2022
  
  c50c22b0
- S
  
  fix bug of fp16 (#38838) · 7d4ce5b3
  由 ShenLiang 提交于 1月 10, 2022
  
  7d4ce5b3
- Y
  Add the backward support for QR (#38824) · 657b6742
  由 Yulong Ao 提交于 1月 10, 2022
```
* Add the backward support for QR

* Remove unnecessary comments
```
  657b6742
- H
  
  replace where with min and max · e30150dd
  由 HydrogenSulfate 提交于 1月 10, 2022
  
  e30150dd
- H
  
  update code · 3ab9ace5
  由 HydrogenSulfate 提交于 12月 28, 2021
  
  3ab9ace5
- H
  
  add static label check · 09d4a3a4
  由 HydrogenSulfate 提交于 12月 28, 2021
  
  09d4a3a4
- H
  
  replace .where to '==' · b4eec5d5
  由 HydrogenSulfate 提交于 12月 28, 2021
  
  b4eec5d5
- H
  
  Update test_cross_entropy_loss.py · 9765be09
  由 HydrogenSulfate 提交于 12月 28, 2021
  
  9765be09
- H
  
  remove hard labels check · 51398ab9
  由 HydrogenSulfate 提交于 12月 27, 2021
  
  51398ab9
- H
  
  change to IndexError · 7ddfec00
  由 HydrogenSulfate 提交于 12月 27, 2021
  
  7ddfec00
- H
  
  change to IndexError · 739cff2d
  由 HydrogenSulfate 提交于 12月 27, 2021
  
  739cff2d
- H
  
  change to ValueError · 3997f99a
  由 HydrogenSulfate 提交于 12月 26, 2021
  
  3997f99a
- H
  
  change error to IndexError · 30213703
  由 HydrogenSulfate 提交于 12月 26, 2021
  
  30213703
- H
  
  change error to IndexError · 04cd0aef
  由 HydrogenSulfate 提交于 12月 26, 2021
  
  04cd0aef
- H
  
  restore test for min,max labels · d49daff0
  由 HydrogenSulfate 提交于 12月 26, 2021
  
  d49daff0
- H
  
  Remove the labels range check under the dynamic graph · 1e3e17df
  由 HydrogenSulfate 提交于 12月 26, 2021
  
  1e3e17df
- H
  
  Remove the labels range check under the dynamic graph · 87d9fdae
  由 HydrogenSulfate 提交于 12月 26, 2021
  
  87d9fdae
- H
  
  Remove the labels range check under the dynamic graph · 46e856c7
  由 HydrogenSulfate 提交于 12月 26, 2021
  
  46e856c7
- A
  Revert "Reupload: Added numpy bf16 datatype support via custom pip package (#38703)" (#38777) · b4dd7828
  由 Aganlengzi 提交于 1月 10, 2022
```
This reverts commit ee813e34.
```
  b4dd7828
- L
  
  [new-exec] refine ut (#38798) · 897f63b4
  由 Leo Chen 提交于 1月 10, 2022
  
  897f63b4
- C
  Support setting infershape function for custom grad op (#38776) · 046553c7
  由 Chen Weihang 提交于 1月 10, 2022
```
* unify infer_shape func calling

* support set grad infer shape fn for custom op

* unify infershape in new executor and eager

* remove todo comment

* revert infershape in operator
```
  046553c7
- W
  
  modify comment of mish (#38805) · 492e6dd0
  由 wangxinxin08 提交于 1月 10, 2022
  
  492e6dd0

机器未来 / Paddle 与 Fork 源项目一致

机器未来 / Paddle
与 Fork 源项目一致