提交 · 7c92177ce4260ed99fbc18da0be154804bdf22ff · BaiXuePrincess / Paddle

17 10月, 2022 11 次提交
- G
  Add enable_partial_send_recv switch in pipeline_configs (#46992) · b9a2f29c
  由 Ghost Screaming 提交于 10月 17, 2022
```
* Fix bug of reduce_sum op. When input.numel() > INT32_MAX, its result
is wrong.

* Support allow_partial switch, which can be configure in
pipeline_configs. If sent tensor are not the same from
different hosts, they shouldn't been sent partially and
then concated as a whole tensor.

* Change name allow_partial to enable_partial_send_recv.

* Add global variable _enable_partial_send_recv
```
  b9a2f29c
- G
  Support BF16 training for sharding (#46846) · 0b39b244
  由 Ghost Screaming 提交于 10月 17, 2022
```
* Fix bug of reduce_sum op. When input.numel() > INT32_MAX, its result
is wrong.

* support pure bfloat16

* support bf16 linear

* update PR to pass CI

* tiny fix where_grad_kernel.cu

* Support bfloat16 type for reducer and sharding.

* Fix some bug.

* Polish code.

* Polise code.

* Add bfloat16 datatype in fill_grad kernels.
Co-authored-by: Nsneaxiy <sneaxiy@126.com>
```
  0b39b244
- H
  Revert "add common subexpression elimination (#44386)" (#47062) · 7c6835ca
  由 hong 提交于 10月 17, 2022
```
This reverts commit 166ff39a.
```
  7c6835ca
- Y
  [PHI]Modify DataLayout's namespace from paddle::experimental to phi (#46869) · ec749398
  由 YuanRisheng 提交于 10月 17, 2022
```
* namespace modify

* update by comment
```
  ec749398
- W
  
  support __floordiv__ (#47060) · 64307903
  由 Weilong Wu 提交于 10月 17, 2022
  
  64307903
- W
  Layernorm shift partition enhance (#46816) · 9e08633c
  由 Wang Bojun 提交于 10月 17, 2022
```
* first version of ln_s_p with s>0

* refine and UT

* pass opt draft

* pass opt

* code refine

* code-style

* bug fix

* fix ci test

* code style
```
  9e08633c
- J
  
  fix for conv_bias_mkldnn_pass (#47037) · acbda3e4
  由 jakpiase 提交于 10月 17, 2022
  
  acbda3e4
- P
  skip ReplaceAllReduceOp in GraphtoBlock when nccl_ctxs_ is nullptr (#46911) · 2e7dc666
  由 pangyoki 提交于 10月 17, 2022
```
* skip ReplaceAllReduceOp in GraphtoBlock when nccl_ctxs_ is nullptr

* update ut

* test_dist_allreduce_op failed

* fix test_dist_allreduce_op

* add ut

* fix nccl cpu compile

* fix
```
  2e7dc666
- J
  
  add shape info into eager log (#46934) · 74938395
  由 Jiabin Yang 提交于 10月 17, 2022
  
  74938395
- H
  
  fix typo error in operator.cc (#46995) · 328236d2
  由 HongyuJia 提交于 10月 17, 2022
  
  328236d2
- W
  
  [Eager] use CastPyArg2Double to parse python float obj (#47029) · b4a1f43f
  由 Weilong Wu 提交于 10月 17, 2022
  
  b4a1f43f
16 10月, 2022 1 次提交
- Z
  
  add common subexpression elimination (#44386) · 166ff39a
  由 ZeKai Zhou 提交于 10月 16, 2022
  
  166ff39a
15 10月, 2022 1 次提交
- H
  
  delete GetExpectedKernelType mkldnn of transpose2 (#46977) · 64b61fc4
  由 HongyuJia 提交于 10月 15, 2022
  
  64b61fc4
14 10月, 2022 6 次提交
- C
  Simplify conv_mkldnn op registration (#46907) · eded6013
  由 Chen Weihang 提交于 10月 14, 2022
```
* simplify conv_mkldnn op registration

* remove custom type value in conv grad op
```
  eded6013
- W
  TRT pool2d adaptive mode bugfix (#46802) · eb32746a
  由 Wang Bojun 提交于 10月 14, 2022
```
* draft with debug print
```
  eb32746a
- W
  
  remove BackendType in inference api. (#46942) · eb429936
  由 Wilber 提交于 10月 14, 2022
  
  eb429936
- Z
  
  [inference][trt] fix reshape2 opteller and elementwise min/max trt registration (#46861) · 2f9de5f3
  由 Zhang Jun 提交于 10月 14, 2022
  
  2f9de5f3
- W
  Add more record event in run program op (#46949) · 48bb2c0a
  由 WangZhen 提交于 10月 14, 2022
```
* Add more record event in run program op

* Refine code

* Restore code

* Rename event
```
  48bb2c0a
- S
  
  Update distributed_strategy.proto (#46531) · fcdc6777
  由 Shijie 提交于 10月 14, 2022
  
  fcdc6777
13 10月, 2022 13 次提交

Fix quantize model deploy bugs when using MKLDNN (#45920) · 561fd8c8

由 yeliang2258 提交于 10月 13, 2022

* fix immutable op quantize bugs

* fix

* fix build bug

* fix test

* notest,test=inference

* fix ppyoloe acc drop bugs

* fix test

* fix test

* add test

* fix

* fix

* fix test

* fix refined name bug

* fix test

* bias fix

* fix matmul weight dequant bug

* re-ci

* fix tester

* fix test

* fix tester

* update weight dequantize func

* update code

* update test for converage

* update test

* update cmake

* update cmakelist

* update code

* rerun ci

* remove useless code

561fd8c8

X

[Paddle Inference] Add bmm trt convert layer. (#46877) · e86dbd62
由 xiaoxiaohehe001 提交于 10月 13, 2022

e86dbd62
L

add thread name for dataloader (#46990) · 770501b8
由 Leo Chen 提交于 10月 13, 2022

770501b8
W

test=infer-coverage (#46983) · f856fc8d
由 Wangzheee 提交于 10月 13, 2022

f856fc8d
W
Add symbolic shape deduction function for unfold, scatter_nd_add, p_norm,... · 46f8e882
由 weishengying 提交于 10月 13, 2022
```
Add symbolic shape deduction function for unfold, scatter_nd_add, p_norm, grid_sampler, pad3d, etc (#46291)
```
46f8e882
[Zero-Dim] support 0D for paddle.transpose/reshape/stack/tile/unsqueeze (#46555) · 78add057
由 zhouweiwei2014 提交于 10月 13, 2022

78add057
W
[Paddle Inference]test=infer-coverage (#46955) · 19438131
由 Wangzheee 提交于 10月 13, 2022
```
* test=infer-coverage
```
19438131
Y

fix bugs (#46951) · 20335b7c
由 YuanRisheng 提交于 10月 13, 2022

20335b7c
A
[BUG]Fix expand_as_v2 bug while X and Y with different dtype (#46950) · 97a68ad2
由 Aurelius84 提交于 10月 13, 2022
```
* [BUG]Fix expand_as_v2 bug while X and Y with different dtype

* fix commit
```
97a68ad2

[WIP]飞桨PaddlePaddle 分布式强化学习功能研发 (#45998) · f0afcabc

由 Xinger 提交于 10月 13, 2022

* add rpc module in cpp side

* add rpc module in python side

* support win32 and mac for rpc

* 代码优化

* 优化代码

* update rpc

* update rpc launch

* rpc remove rank and world_size api

* fix logger import bug

* remove support for win and mac

* remove support for xpu, npu, cinn and rocm

* remove support for xpu, npu, cinn and rocm

* fix shutdown barrier timeout bug

* update:python_rpc_handler to shared ptr

* fix master shutodwn first bug

* tests support for cpu

* update log to vlog

* update get service info api

* add single process test case

* remove process group

* remove some useless dependencies

* update rpc api comments

* update rpc comments: Example to Examples

* update rpc api comments

* update rpc api comments

* update launch api comments

* update init_rpc comments

* update rpc sync and async comments

* fix bug: init_rpc cant be called repeatly in a process

* update rpc api comment: make master endpoint unique

* update rpc api:service to worker, timeout_ms to timeout

* rename ServiceInfo to WorkerInfo

* refactor: rename server to worker, log to vlog

* add launch test

* remove unused codes

* refine

f0afcabc

L
[new-exec] remove variable scope, stage2 (#43936) · 1230a3f4
由 Leo Chen 提交于 10月 13, 2022
```
* remove class ScopeBase

* reopen test
```
1230a3f4

[Kernel Selection] Remove hard code of PADDLE_WITH_MKLDNN (#46606) · ef1c8759

由 HongyuJia 提交于 10月 13, 2022

* remove PADDLE_WITH_MKLDNN, test white_list=abs

* fix unique_ptr

* fix op.Type()

* remove TODO in kernel_dispatch.h

* remove IndicateVarDataType function, update white_list

* remove mkldnn hard code

* add comments

* fix ==

* update mkldnn_op_list

* delete hard code of OPs

* update mkldnn_op_list

* update mkldnn_op_list, remove interp

* add error check for ExecutionContext

* update mkldnn_op_list, remove transpose2_grad

* remove interpolate mkldnn

* remove fill_constant mkldnn

* opt HasAttr in DygraphExecutionContext

* deprecated commit, test mkldnn_white_list

* deprecated commit, test mkldnn_white_list

* deprecated commit, test mkldnn_black_list

* update mkldnn_op_list, add assert error op

* solve cudnn related op

* fix error

* add mkldnn fallback in phi_utils.cc

* remove mkldnn fallback in phi_utils.cc

* opt code implementation

* polish Copyright License

ef1c8759

Add unsigned int8 scale propagation (#46378) · c72b3bfa

由 joanna.wozna.intel 提交于 10月 13, 2022

* Add unsigned int8 propagation

* Add or modify unit tests

* Correct concat scale checking

* Apply review suggestions

* Corrections

c72b3bfa

12 10月, 2022 8 次提交
- S
  fix wz review (#46937) · cdc44a54
  由 sunli 提交于 10月 12, 2022
```
* fix wz review

* update code
```
  cdc44a54
- W
  
  [Eager] polish the place setting code (#46840) · 01baa0b6
  由 Weilong Wu 提交于 10月 12, 2022
  
  01baa0b6
- Z
  
  [Paddle-TRT]support shape tensor is the input of trt-subgraph (#46482) · f2a778c9
  由 zhoutianzi666 提交于 10月 12, 2022
  
  f2a778c9
- L
  clean code of interpretercore (#46891) · 5303b66b
  由 Leo Chen 提交于 10月 12, 2022
```
* refactor

* refine code
```
  5303b66b
- W
  
  test=infer-coverage (#46924) · 21fab90d
  由 Wangzheee 提交于 10月 12, 2022
  
  21fab90d
- W
  
  remove all control_vars in IR graph (#46888) · bf1dc548
  由 weishengying 提交于 10月 12, 2022
  
  bf1dc548
- Z
  
  support generating code of opmaker for backward op invoke forward op (#46912) · 227ab74d
  由 zyfncg 提交于 10月 12, 2022
  
  227ab74d
- S
  Fix some operators when the tensor.numel() > INT32_MAX (#46767) · e896567e
  由 sneaxiy 提交于 10月 12, 2022
```
* fix some ops for int64 range

* update error message
```
  e896567e

BaiXuePrincess / Paddle 与 Fork 源项目一致

BaiXuePrincess / Paddle
与 Fork 源项目一致