提交 · 178d7e5ed1d3a44231f5c73a1f7f68602d6fcd26 · BaiXuePrincess / Paddle

18 10月, 2022 9 次提交

L

add strategy group (#47021) · 178d7e5e
由 LiYuRio 提交于 10月 18, 2022

178d7e5e
L

Add value check & error message for gather_tree (#47051) · e5e3d5cf
由 liu zhengxi 提交于 10月 18, 2022

e5e3d5cf

Merge layernorm trt fuse (#46320) · 5e9f491e

由 Wang Bojun 提交于 10月 18, 2022

* first version, accuracy corrected

* disable debug print

* use blockReduceSum in phi

* add UT

* add opCompat

* code style

* code refine

* bug fix

* code refine

* test fix

* bugfix

* codesytle fix

* code style

* code-style

* code-style

* code-style

5e9f491e

FC + activation fuse passes (#45183) · b7a23adb

由 Sławomir Siwek 提交于 10月 18, 2022

* git

* style

* leave default relu in kernel

* style

* cleanup FCMKLDNN pattern

* merge conflicts

* update develop

* update develop

* add const

* rename to oneDNN and adjust attributes

* whitespace

b7a23adb

[Auto Parallel] Add cost interface (#47043) · da051350

由 caozhou 提交于 10月 18, 2022

* add cost interface

* update inferface and add unittest

* update unittest

* update inferface

da051350

X

[Paddle Inference] Add_expand_v2_trt_layer (#47002) · a21a2b5b
由 xiaoxiaohehe001 提交于 10月 18, 2022

a21a2b5b

[CodeStyle][py2] remove `compat` module (to_text) (#47036) · ad4c773b

由 Nyakku Shigure 提交于 10月 18, 2022

* [CodeStyle][py2] remove `compat` module (to_text)

* remove some unnecessary decode

* remove to_text definition and unittest

* Revert "remove to_text definition and unittest"

This reverts commit a6b69cb8dca8b9b031ce10ea32d1040e7e0dd267.

* remove an assertion

* empty commit

ad4c773b

W

[Eager, Performance optimization] support pow( ** operator) to sink to Cpp layer (#47077) · 62c0abac
由 Weilong Wu 提交于 10月 18, 2022

62c0abac

[AutoParallel] add callbacks (#47014) · 7c92177c

由 zhaoyingli 提交于 10月 18, 2022

* [AutoParallel] add callbacks

* fix unittest

* fix dist_context

* fix engine

* fix cmakelist

* fix unittest's returns

* fix cmakelist

7c92177c

17 10月, 2022 10 次提交

O
[hidden trouble] Update test_sparse_transpose_op.py to get rid of a hidden trouble. (#47017) · d43c972c
由 OccupyMars2025 提交于 10月 17, 2022
```
* Update test_sparse_transpose_op.py

* Update test_sparse_transpose_op.py
```
d43c972c

【Hackathon No.8】 add gumbel distribution api (#46255) · f1a9f877

由 YuRonan 提交于 10月 17, 2022

* init gumbel api

* commit: update test file

* fix：bug

* update Gumbel API

* upgrade distribution/gumbel.py

* add tests/test_distribution_gumbel.py

* fix：code style

* fix：code style

* fix：code style

* fix：code style

* fix bug

* fix：code style

* fix：code style

* fix：rollback uniform

* fix：delete invalid code

* fix：bug and add static test

* fix：code style

* fix：code style

* fix：delete init transforms

* fix：bug

* fix：bug

* fix：code style

* fix：code style

* fix：add transforms

* fix：code style

* fix：code style

* fix：bug

* fix：bug

* fix：code style

* fix：code style

* fix：bug

* fix：code style

* fix：code style

* fix：bug for gumbel.py / add：judge transforms'len for transformed_distribution.py

* update gumbel.py

* fix：bug for test_distribution_gumbel.py

* fix：bug for test_distribution_gumbel_static.py

* fix：code style

* fix：code style

* fix：coverage

* fix：bug

* fix：bug

* fix：code style

* fix：bug

* delete：no use package for gumbel.py

* add：coverage transforms'len judge for test_distribution_gumbel.py

* fix：code style for test_distribution_gumbel.py

* fix：coverage

* fix：code style

* fix：code style

* fix：code style

* fix：code style

* fix：code style

* fix：en doc

* fix：param

* fix：copyright

* fixSample; test=document_fix
Co-authored-by: Ndasen <sen15530876201@163.com>

f1a9f877

[Hackathon 3rd No.22 ] add paddle.incubate.sparse.reshape (#46694) · abb38136

由 OccupyMars2025 提交于 10月 17, 2022

* add sparse reshape

* change the dtype in all test cases to int64

* just one test case

* modify comments

* Update test_sparse_reshape_op.py

* chang the type of "shape"  from  vector<int64_t>  to  IntArray

* check whether sp_out.to_dense() is the cause  of error

* print sp_out

* Update reshape_kernel.cc

* use numpy to generate the equal paddle tensor

* just check dense_tensor.numpy()

* check cpu and cuda versions

* Update test_sparse_reshape_op.py

* supply all test cases for cpu forward coo kernel

* test forward coo cuda kernel

* change configuration of cuda kernel

* keep only one test case

* test coo cpu kernel (forward and backward)

* row major or column major ???

* test cuda coo forward kernel

* complete declaration and registration

* Update __init__.py

* rebuild

* retrigger CI

* add cudaMalloc and cudaMemcpy  in  ReshapeCooKernel  and change back to row major order in a cuda dense tensor

* midify minor error

* test only cpu coo forward kernel

* add all test cases for coo forward kernel  (both cpu and gpu)

* test all forward kernels (coo, csr; cpu, gpu)

* add all test cases for all kinds of kernels

* just retrigger CI

* Update sparse_ops.yaml

* Update sparse_ops.yaml

* Update sparse_ops.yaml

* resolve conflicts

* Update sparse_ops.yaml

* don't specify tensor place

* new shape has -1 or 0 in it

* Update unary_grad_kernel.h

* correct lvalue error

* code style

* Update sparse_backward.yaml

* Update sparse_ops.yaml

* Update unary_kernel.h

* Update unary.py

* Update sparse_backward.yaml

* Update unary.py

* code style

* code style

* code style

* Update unary.py

* specify tensor place explicitly

* do not use numpy array

* use numpy array in unit test again

* modify example code in docstring

abb38136

W

support __floordiv__ (#47060) · 64307903
由 Weilong Wu 提交于 10月 17, 2022

64307903

Layernorm shift partition enhance (#46816) · 9e08633c

由 Wang Bojun 提交于 10月 17, 2022

* first version of ln_s_p with s>0

* refine and UT

* pass opt draft

* pass opt

* code refine

* code-style

* bug fix

* fix ci test

* code style

9e08633c

skip ReplaceAllReduceOp in GraphtoBlock when nccl_ctxs_ is nullptr (#46911) · 2e7dc666

由 pangyoki 提交于 10月 17, 2022

* skip ReplaceAllReduceOp in GraphtoBlock when nccl_ctxs_ is nullptr

* update ut

* test_dist_allreduce_op failed

* fix test_dist_allreduce_op

* add ut

* fix nccl cpu compile

* fix

2e7dc666

[CodeStyle][py2] remove `compat` module (to_bytes) (#47035) · 198c7993

由 Nyakku Shigure 提交于 10月 17, 2022

* [CodeStyle][py2] remove `compat` module (to_bytes)

* remove some unused imports

* clean up to_bytes definition and unittests

* Revert "clean up to_bytes definition and unittests"

This reverts commit e726539e1768172a411ff60e63fab82f164343cf.

* use `b` prefix instead of `encode()`

198c7993

G

fix dygraph new format problem export in QAT (#47023) · 6566b8f5
由 Guanghua Yu 提交于 10月 17, 2022

6566b8f5
G

fix unittest test_post_training_quantization_lstm_model problem (#47024) · f4ea771d
由 Guanghua Yu 提交于 10月 17, 2022

f4ea771d
D
[Custom Device] Add singleton to custom device (#46963) · 73196e5a
由 duanyanhui 提交于 10月 17, 2022
```
* add singleton to custom device

* Update custom_device.cc

Init device_init_flag_ in default
```
73196e5a

14 10月, 2022 4 次提交
- W
  
  Fix collective APIs cannot be recognized when building docs (#46962) · 2010bdc3
  由 Wen Sun 提交于 10月 14, 2022
  
  2010bdc3
- Z
  [AutoParallel] adapt for gpt-gen (#46771) · 31a437b1
  由 zhaoyingli 提交于 10月 14, 2022
```
* for gpt-gen

* fix reshard

* adapt assign and shape op

* add dist_assign & unittest

* add conditional block unittest

* rename unittest
```
  31a437b1
- W
  
  remove BackendType in inference api. (#46942) · eb429936
  由 Wilber 提交于 10月 14, 2022
  
  eb429936
- Z
  
  [inference][trt] fix reshape2 opteller and elementwise min/max trt registration (#46861) · 2f9de5f3
  由 Zhang Jun 提交于 10月 14, 2022
  
  2f9de5f3
13 10月, 2022 14 次提交

S

[geometric] Add unittest for send_uv (#46948) · f6ae9fb9
由 Siming Dai 提交于 10月 13, 2022

f6ae9fb9

Fix quantize model deploy bugs when using MKLDNN (#45920) · 561fd8c8

由 yeliang2258 提交于 10月 13, 2022

* fix immutable op quantize bugs

* fix

* fix build bug

* fix test

* notest,test=inference

* fix ppyoloe acc drop bugs

* fix test

* fix test

* add test

* fix

* fix

* fix test

* fix refined name bug

* fix test

* bias fix

* fix matmul weight dequant bug

* re-ci

* fix tester

* fix test

* fix tester

* update weight dequantize func

* update code

* update test for converage

* update test

* update cmake

* update cmakelist

* update code

* rerun ci

* remove useless code

561fd8c8

X

logsumexp support fp16 (#45817) · 910e1b6a
由 xiaohemaikoo 提交于 10月 13, 2022

910e1b6a
X

[Paddle Inference] Add bmm trt convert layer. (#46877) · e86dbd62
由 xiaoxiaohehe001 提交于 10月 13, 2022

e86dbd62
L

add thread name for dataloader (#46990) · 770501b8
由 Leo Chen 提交于 10月 13, 2022

770501b8
W
Add symbolic shape deduction function for unfold, scatter_nd_add, p_norm,... · 46f8e882
由 weishengying 提交于 10月 13, 2022
```
Add symbolic shape deduction function for unfold, scatter_nd_add, p_norm, grid_sampler, pad3d, etc (#46291)
```
46f8e882
P

Tests for other dtypes corrected (#46836) · fa2f67a5
由 Paulina Gacek 提交于 10月 13, 2022

fa2f67a5
[Zero-Dim] support 0D for paddle.transpose/reshape/stack/tile/unsqueeze (#46555) · 78add057
由 zhouweiwei2014 提交于 10月 13, 2022

78add057
A
[BUG]Fix expand_as_v2 bug while X and Y with different dtype (#46950) · 97a68ad2
由 Aurelius84 提交于 10月 13, 2022
```
* [BUG]Fix expand_as_v2 bug while X and Y with different dtype

* fix commit
```
97a68ad2

Revert #46111 (#46961) · cf9ca61d

由 Zhang Ting 提交于 10月 13, 2022

* Revert "【Hackathon No.56&38】deformable_conv_v1 算子实现 float16 数据类型支持&前向运行加速 (#46111)"

cf9ca61d

[WIP]飞桨PaddlePaddle 分布式强化学习功能研发 (#45998) · f0afcabc

由 Xinger 提交于 10月 13, 2022

* add rpc module in cpp side

* add rpc module in python side

* support win32 and mac for rpc

* 代码优化

* 优化代码

* update rpc

* update rpc launch

* rpc remove rank and world_size api

* fix logger import bug

* remove support for win and mac

* remove support for xpu, npu, cinn and rocm

* remove support for xpu, npu, cinn and rocm

* fix shutdown barrier timeout bug

* update:python_rpc_handler to shared ptr

* fix master shutodwn first bug

* tests support for cpu

* update log to vlog

* update get service info api

* add single process test case

* remove process group

* remove some useless dependencies

* update rpc api comments

* update rpc comments: Example to Examples

* update rpc api comments

* update rpc api comments

* update launch api comments

* update init_rpc comments

* update rpc sync and async comments

* fix bug: init_rpc cant be called repeatly in a process

* update rpc api comment: make master endpoint unique

* update rpc api:service to worker, timeout_ms to timeout

* rename ServiceInfo to WorkerInfo

* refactor: rename server to worker, log to vlog

* add launch test

* remove unused codes

* refine

f0afcabc

【Paddle Hackathon No.11】 (#45595) · 8474392d

由 yangguohao 提交于 10月 13, 2022

* 2022-08-30_update nn.layer.loss nn.functional.loss, test_file

* 2022-08-30_update nn.layer.loss nn.functional.loss, test_file

* fix: test_file

* fix: test_file, docs, multi_margin_loss

* fix: doc weight function

* fix: test_multi_margin_loss

* fix: weight np.testing.assert_allclose

* fix: test_file

* fix: en_doc

* 2022-10-10

8474392d

N

remove compat.round (#46923) · f246ebba
由 Nyakku Shigure 提交于 10月 13, 2022

f246ebba
N

[CodeStyle][F401] fix incremental flake8 F401 and F541 issues (#46926) · f4a5fe95
由 Nyakku Shigure 提交于 10月 13, 2022

f4a5fe95

12 10月, 2022 3 次提交

[Auto Parallel] Improve the fine-grained APIs (#46552) · 686fa07a

由 Yulong Ao 提交于 10月 12, 2022

* [Auto Parallel] Suppport different dataloaders

* [Auto Parallel] Add num_shards config for dataset

* [Auto Parallel] Unify the logger and outputs of Engine API

* [Auto Parallel] Fix the bugs of to_static

* [Auto Parallel] Adjust the test_to_static.py

* [Auto Parallel] Add the prepare API and replace __call__ with run

* [Auto Parallel] Improve the private implementations of Engine

* [Auto Parallel] Set capacity of dataloader for opt tuning

* [Auto Parallel] [WIP] Change the fine-grained API

* [Auto Parallel] Improve APIs to support different user cases

* [Auto Parallel] Add removed config

* [Auto Parallel] Add imports

* [Auto Parallel] Fix bugs for to_static

* [Auto Parallel] Remove unnecessary imports

686fa07a

[Zero-Dim] support input 0D Tensor for some unary api (#45992) · 05c2b9ba
由 zhouweiwei2014 提交于 10月 12, 2022
```
* [Zero-Dim] support input 0D Tensor for unary api

* fix CI
```
05c2b9ba

[CodeStyle][F401] remove unused imports in unittests/dygraph_to_static,ir (#46787) · ea0d84bb

由 Nyakku Shigure 提交于 10月 12, 2022

* [CodeStyle][F401] remove unused imports in unittests/dygraph_to_static

* [CodeStyle][F401] remove unused imports in unittests/ir

* add noqa after required imports

ea0d84bb

BaiXuePrincess / Paddle 与 Fork 源项目一致

BaiXuePrincess / Paddle
与 Fork 源项目一致