提交 · ba71fbea5c73040639f5c8074c426e2603067a65 · BaiXuePrincess / Paddle

11 5月, 2022 11 次提交

[Phi] Change the output format of C++ backward api (Part1) (#42677) · ba71fbea

由 zyfncg 提交于 5月 11, 2022

* change the output format of C++ backward api

* fix merge conflict

* fix sparse api code auto-gen

* fix eager_gen bug

* fix bug of output is null

* fix bug of conv2d_grad_impl

* fix optional grad

* fix bug of eager-gen double_grad

* fix bug

* fix multiply_double_grad bug

* remove node pruning

ba71fbea

Move weights and biases scale computing into pass (#42241) · c0652972

由 Zuza Gawrysiak 提交于 5月 11, 2022

* Add int8 scales gathering pass for convolution

* Fix typo

* Add unittest

* Add corrected unit test

* Change test name

* Remove enabling mkldnn in test

* Speed up test

* Change max examples

* Add functional test

* Change test name

* Add new test case

* Rename pass

c0652972

[Yaml]add Double grad (#42638) · c16345cb

由 chentianyu03 提交于 5月 11, 2022

* add conv2d_transpose_double_grad yaml

* add test_conv_transpose double_grad test case

c16345cb

[New API] add API paddle.vision.transforms.RandomPerspective and... · 754820fe

由 Feng Ni 提交于 5月 11, 2022

[New API] add API paddle.vision.transforms.RandomPerspective and paddle.vision.transforms.perspective (#42390)

* add RandomPerspective and perspective

* fix UT, clean codes

* fix UT

* add batch transform

* remove batch in tensor func

* fix typos and coments, test=develop

754820fe

[New API] add API paddle.vision.transforms.RandomAffine and... · cbb8df78

由 Feng Ni 提交于 5月 11, 2022

[New API] add API paddle.vision.transforms.RandomAffine and paddle.vision.transforms.affine (#42278)

* add affine codes

* adjustment codes

* fix test case

* fix F_cv2.affine

* clean codes, add UT

* fix UT

* fix UT

* fix UT shear

* add functional test_errors

* fix typos and coments, test=develop

cbb8df78

J

support custom operator run in double grad mode (#42653) · 00ecb98f
由 Jiabin Yang 提交于 5月 11, 2022

00ecb98f
W

[Eager] Enable_legacy_for_mkldnn_ops_on_off (#42648) · 6c696db1
由 Weilong Wu 提交于 5月 11, 2022

6c696db1
A
[IPU] update to popart v2.5.0 (#42552) · 27acc6c3
由 Allen Guo 提交于 5月 11, 2022
```
* update to popart v2.5.0

* use a specific version of sdk2.5.0
```
27acc6c3
F

stride_slice don't support trt6 (#42639) · c4bed7e4
由 feng_shuai 提交于 5月 11, 2022

c4bed7e4
H
[Dygraph] Support diff batch for sparse of EagerReducer (#42646) · c5232b4b
由 Haohongxiang 提交于 5月 11, 2022
```
* support diff batch for sparse of eagerreducer

* fix
```
c5232b4b
T

remove old XDNN implementation test=kunlun (#42404) · 7b828f71
由 taixiurong 提交于 5月 11, 2022

7b828f71

10 5月, 2022 14 次提交

improve introduction of bfgs args (#42191) · 000edfd2

由 Sing_chan 提交于 5月 10, 2022

* improve introduction of bfgs args; test=document_fix

* modify according to zhouwei's comment; test=document_fix

000edfd2

X
[EinsumOp] Polish forward logic and backward logic for optimize (#42603) · cf198dc9
由 xiongkun 提交于 5月 10, 2022
```
* change logic for optimize

* modifty
```
cf198dc9

[Auto Parallel] Refactor the engine api and parallelizer (#42576) · 83a4b26a

由 Yulong Ao 提交于 5月 10, 2022

* [Auto Parallel] Refactor the engine api and parallelizer

* [Auto Parallel] Fix the default dist op for the slice op

* [Auto Parallel] Fix the format of planer.py

* [Auto Parallel] Fix a bug

83a4b26a

Rea-dd conv_affine_channel fuse pass as oneDNN only pass (#41998) · 3540d33b

由 piotrekobi 提交于 5月 10, 2022

* Readd conv_affine_channel fuse pass as mkldnn pass

* Fix formatting

* Add new test to parallel_UT_rule.py

* Fix Coverage and Windows CI issues

* Revert "Fix Coverage and Windows CI issues"

This reverts commit f33459846385c9fd51c07f9f44e7ff283a652637.

* Fix CI errors

* Remove unnecessary conv_eltwise_add_affine_channel fuse pass

* Remove test from parallel_UT_rule.py

3540d33b

[Eager] Refactor several sharding test (#42608) · 668a0a41

由 Weilong Wu 提交于 5月 10, 2022

* [Eager] fix sharding under eager mode

* [Eager] fix several sharding test under eager mode

* Recover using _test_eager_guard

* Ensured fleet.init under legacy

* Ensured fleet.init under legacy

* Fix CI issue, re-definition strategy and call fleet.init() in stage2_offload

* Modified dygraph_group_sharded_api.py, move fleet.init to a better line

668a0a41

J

update RandomErasing document (#42627) · 08158e93
由 JYChen 提交于 5月 10, 2022

08158e93

[Video detection] Added fill_constant FP32 FWD oneDNN kernel (#37216) · 66a10f36

由 jakpiase 提交于 5月 10, 2022

* added fill_constant kernel

* CI fix

* ci fix

* switched from nan to zero memory

* CI FIX

* ci fixes

* CI rerun

* ci fix

* minor change

* CI rerun

66a10f36

【PaddlePaddle Hackathon 2】18、为 Paddle 新增 paddle.heaviside 和 paddle.Tensor.heaviside API (#41872) · 4892d592

由 BrilliantYuKaimin 提交于 5月 10, 2022

* Create elementwise_heaviside_op.cc

* add ElementwiseHeavisideFunctor

* Create test_elementwise_heaviside_op.py

* 增加heaviside的python接口

* add heaviside in white list

* 增加heaviside的签名

* 增加heaviside的核函数

* 增加heaviside梯度的核函数

* 增加heaviside梯度的注册

* 调整代码格式

* Update elementwise_sig.cc

* add heaviside in __all__

* Update heaviside docs

* Update math.py

* Update math.py

* Update math.py

4892d592

C

update base of cost model (#42601) · 6ac08db5
由 caozhou 提交于 5月 10, 2022

6ac08db5
Q

[MLU]add adam, adamw op of mlu device (#42557) · cc077693
由 qipengh 提交于 5月 10, 2022

cc077693
F

[MLU] add layernorm mlu kernel (#42356) · ecd6db43
由 fwenguang 提交于 5月 10, 2022

ecd6db43
Q

[MLU]add assign op of mlu device (#42591) · 4e5fb733
由 qipengh 提交于 5月 10, 2022

4e5fb733
Z
fix adamw unittest (#42593) · c6f49f0b
由 zhaoyingli 提交于 5月 10, 2022
```
* fix adamw unittest

* tiny fix

* fix param name
```
c6f49f0b
L

fix bug for heter (#42590) · 21b35167
由 lilong12 提交于 5月 10, 2022

21b35167

09 5月, 2022 6 次提交

A
[Eager]Fix tensor.name is empty behavior (#42587) · 81078a88
由 Aurelius84 提交于 5月 09, 2022
```
* [Eager]Fix tensor.name is empty behavior

* fix unittest
```
81078a88
L
fix docs of auto_cast, cuda_places, static.save (#42107) · c3b7bc61
由 Liyulingyue 提交于 5月 09, 2022
```
* auto_cast; test=document_fix

* static.save; test=document_fix

* cuda_places; test=document_fix
```
c3b7bc61

[Ready to merge] oneDNN NHWC matmul & elementwise kernels fixes (#42506) · bf481550

由 Jacek Czaja 提交于 5月 09, 2022

* - fix to crash

- more fixes

- added diagnostic

- matmul output fixes.

- compilation fix

- stop rotating too small shapes

* - Added enabling of matmul_V2 onednn test

bf481550

double grad yaml and test case (#42553) · 8b546f1c

由 chentianyu03 提交于 5月 09, 2022

* add abs double grad yaml and test case

* add pool2d double grad yaml

* add pool2d dygraph double grad test case

8b546f1c

W
[Eager] Fix several sharding test under eager mode (#42573) · 13bcb7cd
由 Weilong Wu 提交于 5月 09, 2022
```
* [Eager] fix sharding under eager mode

* [Eager] fix several sharding test under eager mode
```
13bcb7cd

【PaddlePaddle Hackathon 2】3、为 Paddle 新增 corrcoef(皮尔逊积矩相关系数) API (#40690) · 95a502a2

由 liqitong-a 提交于 5月 09, 2022

* corrcoef commit

* corrcoef commit

* Update test_corr.py

* Update linalg.py

* Update test_corr.py

* Update test_corr.py

* Update test_corr.py

* Update test_corr.py

* Update test_corr.py

* Update test_corr.py

* Update test_corr.py

* Update test_corr.py

* Update test_corr.py

* Update test_corr.py

* Update linalg.py

* Update linalg.py

* Update linalg.py

* Update test_corr.py

* Update test_corr.py

* Update test_corr.py

* Update test_corr.py

* Update test_corr.py

* Update test_corr.py

* Update test_corr.py

* Update test_corr.py

* Update test_corr.py

* Update test_corr.py

* Update test_corr.py

* Update test_corr.py

* Update test_corr.py

95a502a2

07 5月, 2022 5 次提交
- C
  put_record_event_in_python_on_timeline_python (#42555) · 80015c06
  由 chenjian 提交于 5月 07, 2022
```
* put_record_event_in_python_on_timeline_python

* fix
```
  80015c06
- Z
  
  fix the problem of slice infer shape (#42568) · c1e45a11
  由 zyfncg 提交于 5月 07, 2022
  
  c1e45a11
- W
  
  add some no need buff (#42556) · 3f372814
  由 wanghuancoder 提交于 5月 07, 2022
  
  3f372814
- Y
  [Auto Parallel] Improve the codes of the completion and distributed context (#40671) · bed9aaea
  由 Yulong Ao 提交于 5月 07, 2022
```
* [Auto Parallel] Replace the old planner by the new partition tuner

* [Auto Parallel] Improve the completion and distributed context

* [Auto Parallel] Fix some bugs of the compatible check of some dist ops

* [Auto Parallel] Fix some bugs
```
  bed9aaea
- A
  
  sync misc changes (#42534) · 37580838
  由 Allen Guo 提交于 5月 07, 2022
  
  37580838
06 5月, 2022 4 次提交

bind elementwise_mod_op_xpu (#42175) · 6ea2f049

由 enzodechine 提交于 5月 06, 2022

* bind elementwise_mod_op_xpu *test=kunlun

* add more supported dtypes and UTs *test=kunlun

* fix datatype error

* add op to in xpu1_op_list

* Update Mac cmake version >=3.15 (#41456)

* Update Mac cmake version >=3.15

* notest;read test1

notest;read test2

notest;read test3

* fix inference link error

* fix inference link error

* fix windows link error

* fix cmake_policy

* fix build big size

* Add paddle::variant and replace paddle::any (#42139)

* add variant and replace any

* split attribute

* disable unittest failed in eager CI in temporary (#42101)

* test=py3-eager

* test=py3-eager

* test=py3-eager

* combine graph_table and feature_table in graph_engine (#42134)

* extract sub-graph

* graph-engine merging

* fix

* fix

* fix heter-ps config

* test performance

* test performance

* test performance

* test

* test

* update bfs

* change cmake

* test

* test gpu speed

* gpu_graph_engine optimization

* add dsm sample method

* add graph_neighbor_sample_v2

* Add graph_neighbor_sample_v2

* fix for loop

* add cpu sample interface

* fix kernel judgement

* add ssd layer to graph_engine

* fix allocation

* fix syntax error

* fix syntax error

* fix pscore class

* fix

* change index settings

* recover test

* recover test

* fix spelling

* recover

* fix

* move cudamemcpy after cuda stream sync

* fix linking problem

* remove comment

* add cpu test

* test

* add cpu test

* change comment

* combine feature table and graph table

* test

* test

* pybind

* test

* test

* test

* test

* pybind

* pybind

* fix cmake

* pybind

* fix

* fix

* add pybind

* add pybind
Co-authored-by: NDesmonDay <908660116@qq.com>

* [CustomDevice] add eager mode support (#42034)

* fix FlattenContiguousRangeOpConverter out dim error (#42087)

* fix FlattenContiguousRangeOpConverter out dim error

* update code

* fix python3.10 compile bug on windows (#42140)

* Optimize dygraph GetExpectedKernelType perf (#42154)

* opt dygraph scheduling

* revert part impl

* fix incorrect usages of std::move and other compile errors (#41045)

* fix bug of std::move and others

* fix an compile error in debug mode

* fix wrong copy assignment operator
Signed-off-by: Ntiancaishaonvjituizi <452565578@qq.com>

* reformat
Signed-off-by: Ntiancaishaonvjituizi <452565578@qq.com>

* reformat
Signed-off-by: Ntiancaishaonvjituizi <452565578@qq.com>

* fix ArrayRef constructor following llvm

* fix format

* fix conflict with master

* fix variant compile error (#42203)

* [Eager] Support numpy.ndarry in CastNumpy2Scalar (#42136)

* [Eager] Remove redundancy code, fix fp16 case (#42169)

* [Eager] Support div(scalar) in eager mode (#42148)

* [Eager] Support div scalar in eager mode

* Updated and remove debug logs

* Remove list, use 'or' directly

* Remove useless statement

* fix recompute (#42128)

* fix recompute

* modify return

* add LICENSE in wheel dist-info package (#42187)

* replace any by variant in infermeta (#42181)

* 【PaddlePaddle Hackathon 2】24、为 Paddle 新增 nn.ChannelShuffle 组网 API (#40743)

* Add infermeta for ChannelShuffle

* Create channel_shuffle_grad_kernel.h

* Create channel_shuffle_kernel.h

* Create channel_shuffle_sig.cc

* Create channel_shuffle_op.cc

ChannelShuffle算子的描述

* Create channel_shuffle_kernel_impl.h

ChannelShuffle核函数的实现

* Create channel_shuffle_grad_kernel_impl.h

ChannelShuffle反向核函数的实现

* Add kernel register of channel shuffle and grad

注册ChannelShuffle及其反向的核函数

* add nn.functional.channel_shuffle

* add nn.ChannelShuffle

* Create test_channel_shuffle.py

* Update example of ChannelShuffle in vision.py

* Update test_channel_shuffle.py

* 修改channel_shuffle核函数的实现位置

* 修正代码格式

* 删除多余空格

* 完善channel_shuffle的错误检查

* Update unary.cc

* Update channel_shuffle_op.cc

* Update test_channel_shuffle.py

* Update unary.cc

* add channel_shuffle

* Update test_channel_shuffle.py

* Update vision.py

* 调整代码格式

* Update channel_shuffle_sig.cc

* 更新ChannelShuffle的文档

* 更新channel_shuffle的文档

* remove ChannelShuffleOpArgumentMapping

* add ChannelShuffleGradInferMeta

* Update channel_shuffle_op.cc

* 调整channel_shuffle及其梯度的核函数的位置

* Do not reset default stream for StreamSafeCUDAAllocator (#42149)

* remove redundant computation in Categorical.probs (#42114)

* Downloading data for test_analyzer_vit_ocr (#42041)

* Change server URL

* update config

* add test to parallel UT rule

* add checksum to ensure files are downloaded

* change downloading target

* reuse existing variable

* change target directory

* fix en docs of some Apis (gradients, scope_guard, cuda_places, name_scope, device_guard, load_program_state, scale, ParamAttr and WeightNormParamAttr) (#41604)

* Update scope_guard; test=document_fix

* gradients; test=document_fix

* gradients; test=document_fix

* name_scope; test=document_fix

* cpu_places; test=document_fix

* WeightNormParamAttr; test=document_fix

* cuda_places; test=document_fix

* load_program_state; test=document_fix

* device_guard; test=document_fix

* device_guard; test=document_fix

* ParamAttr; test=document_fix

* scale; test=document_fix

* scale; test=document_fix

* update code example；test=document_fix
Co-authored-by: NChen Long <1300851984@qq.com>

* fix datatype error

add op to in xpu1_op_list

*test=kunlun

* fix elementwise_mod op path error  *test=kunlun

* fix elementwise_mod UT error  *test=kunlun

* fix datatype error

add op to in xpu1_op_list

*test=kunlun

add op to in xpu1_op_list

fix elementwise_mod op path error  *test=kunlun

fix elementwise_mod UT error  *test=kunlun
Co-authored-by: Ntianshuo78520a <707759223@qq.com>
Co-authored-by: NChen Weihang <chenweihang@baidu.com>
Co-authored-by: Npangyoki <pangyoki@126.com>
Co-authored-by: Nseemingwang <seemingwang@users.noreply.github.com>
Co-authored-by: NDesmonDay <908660116@qq.com>
Co-authored-by: Nronnywang <524019753@qq.com>
Co-authored-by: Nbaoachun <962571062@qq.com>
Co-authored-by: Zhou Wei <1183042833@qq.com>
Co-authored-by: Ntiancaishaonvjituizi <452565578@qq.com>
Co-authored-by: NWeilong Wu <veyron_wu@163.com>
Co-authored-by: NRoc <30228238+sljlp@users.noreply.github.com>
Co-authored-by: NBrilliantYuKaimin <91609464+BrilliantYuKaimin@users.noreply.github.com>
Co-authored-by: NRuibiao Chen <chenruibiao@baidu.com>
Co-authored-by: NFeiyu Chan <chenfeiyu@baidu.com>
Co-authored-by: NSławomir Siwek <slawomir.siwek@intel.com>
Co-authored-by: NYilingyelu <103369238+Yilingyelu@users.noreply.github.com>
Co-authored-by: NChen Long <1300851984@qq.com>

6ea2f049

A
[NPU] add clip_by_norm op (#42411) · 1588e7e7
由 Aganlengzi 提交于 5月 06, 2022
```
* [NPU] add clip_by_norm op

* fix

* update
```
1588e7e7
L

add send/recv for ProcessGroupHeter (#42318) · df113208
由 lilong12 提交于 5月 06, 2022

df113208
A

update UTs 3 (#42519) · 94acf7c8
由 Allen Guo 提交于 5月 06, 2022

94acf7c8

BaiXuePrincess / Paddle 与 Fork 源项目一致

BaiXuePrincess / Paddle
与 Fork 源项目一致