提交 · 767efaca7ac6ada90b70eff8d876a1baf373f47c · PaddlePaddle / Paddle

16 6月, 2022 3 次提交
- L
  fix xpu kp compilation (#43496) · 767efaca
  由 Leo Chen 提交于 6月 16, 2022
```
* fix xpu kp compilation

* add depends
```
  767efaca
- L
  [new-exec] lazy creating work queue (#43551) · 238f82e6
  由 Leo Chen 提交于 6月 16, 2022
```
* lazy creating work queue

* fix dry_run
```
  238f82e6
- G
  
  tmp fix (#42508) · 39d2c89c
  由 gongweibao 提交于 6月 16, 2022
  
  39d2c89c
15 6月, 2022 17 次提交
- H
  
  op cache supports un-persistable attributes (#43221) · 2b5771c4
  由 huzhiqiang 提交于 6月 15, 2022
  
  2b5771c4
- Y
  Optimize prod's python implementation for dygraph. (#43309) · 9b7126d0
  由 Yiqun Liu 提交于 6月 15, 2022
```
* Optimize prod's python implementation for dygraph.

* Change key_dim to head_dim.

* Add comment in unittest.

* Disable TF32 in unittest.
```
  9b7126d0
- F
  
  [MLU] add size kernel for mlu (#43450) · 4d0ca02b
  由 fwenguang 提交于 6月 15, 2022
  
  4d0ca02b
- L
  
  test=document_fix; replace id with user name (#43540) · 79dc32b4
  由 Leo Chen 提交于 6月 15, 2022
  
  79dc32b4
- F
  
  [MLU] add bce kernel for mlu (#43467) · 1dfa2d49
  由 fwenguang 提交于 6月 15, 2022
  
  1dfa2d49
- F
  
  [MLU] add bce kernel (#43435) · 669d8689
  由 fwenguang 提交于 6月 15, 2022
  
  669d8689
- A
  [IPU] Decoupling ipu sharding and modeling (#43164) · 13cf4ced
  由 Allen Guo 提交于 6月 15, 2022
```
* Decoupling ipu sharding and modeling (#665)

* feat(shard): decoupling shard setting with modeling.

* fix(shard): split test cases to avoid failure.

* fix(shard): add function docs and fix typo.

* test(shard): add tests.

* test(shard): more test case.

* fix(): change ipu_index/stage default value to -1.

* fix format
Co-authored-by: Nczr-gc <96037699+czr-gc@users.noreply.github.com>
```
  13cf4ced
- Z
  set_state_dict not use state_dict hook (#43407) · 1d0d6594
  由 zhangbo9674 提交于 6月 15, 2022
```
* set_state_dict not use state_dict hook

* add ut

* refine doc
```
  1d0d6594
- Z
  
  Conv3 support bias (#43458) · 64e2f10c
  由 zhangkaihuo 提交于 6月 15, 2022
  
  64e2f10c
- G
  
  modify index dtype from int to int64_t of concat_and_split_functor (#43479) · 81abaaf5
  由 Guoxia Wang 提交于 6月 15, 2022
  
  81abaaf5
- L
  
  revert -Wno-error=maybe-uninitialized (#43511) · a89060ac
  由 Leo Chen 提交于 6月 15, 2022
  
  a89060ac
- Z
  place all save/load path into temporary directory (#43451) · 0c51f241
  由 zhaoyingli 提交于 6月 15, 2022
```
* use tempfile to place temporary files

* update

* revert test_communicator

* fix test_dist_base
```
  0c51f241
- Z
  Rename yaml (#43470) · fcd32950
  由 zyfncg 提交于 6月 15, 2022
```
* rename yaml file

* fix merge conflict

* fix infrt
```
  fcd32950
- add some kernels(csr*dense->csr, dense*dense->csr) of SparseTensor matmul (#42935) · 346efe96
  由 zhouweiwei2014 提交于 6月 15, 2022
```
* add some kernel(csr*dense->csr, dense*dense->csr) of SparseTensor matmul

* fix CI

* fix CI

* fix comment

* fix comment
```
  346efe96
- Z
  
  [DOC] fix dist api document(#43394) · 19eb0eb8
  由 Zhong Hui 提交于 6月 15, 2022
  
  19eb0eb8
- Y
  Use int64_t in GetGpuLaunchConfig1D and ElementwiseKernel as index type to... · 15577630
  由 Yiqun Liu 提交于 6月 15, 2022
```
Use int64_t in GetGpuLaunchConfig1D and ElementwiseKernel as index type to support large tensor. (#43506)

* Change some data type from int to int64_t in GetGpuLaunchConfig1D to support large tensor.

* Use int64_t in ElementwiseKernel as index type to support large tensor.
```
  15577630
- R
  Refactor dynload/port.h (#43431) · 332fdd1e
  由 Ruibiao Chen 提交于 6月 15, 2022
```
* Refactor port.h

* Remove some unnecessary code

* Fix CI errors
```
  332fdd1e
14 6月, 2022 20 次提交
- L
  
  fix is_test bug in fused_feedforward. (#43508) · 193ab32c
  由 Li Min 提交于 6月 14, 2022
  
  193ab32c
- J
  [Eager] Fix edvr starganv2 (#43471) · c62a7e25
  由 Jiabin Yang 提交于 6月 14, 2022
```
* fix starganv2

* fix starganv2 stop_gradient end error

* fix edvr_starganv2

* fix mul kernel to fix optional ddx

* fix typo
```
  c62a7e25
- R
  Support sequential run GPU OPs for standalone executor (#43243) · 8cec1271
  由 Ruibiao Chen 提交于 6月 14, 2022
```
* Support sequential run for standalone executor

* Add UTs

* Fix test_standalone_multiply_write

* Remove unnecessary UTs
```
  8cec1271
- C
  
  [MLU] add mlu kernel for depthwise conv2d op (#43359) · 077f3788
  由 cambriconhsq 提交于 6月 14, 2022
  
  077f3788
- Z
  [MLU]: add elementwise_max mlu kernel (#43365) · ceb6b3f1
  由 zhaoying9105 提交于 6月 14, 2022
```
* [MLU]: add elementwise_max mlu kernel

* [MLU]: add int32 support for elementwise maxk MLU kernel
```
  ceb6b3f1
- Z
  
  [MLU]: add log log10 log2 MLU kernel (#43360) · 4642e8c4
  由 zhaoying9105 提交于 6月 14, 2022
  
  4642e8c4
- H
  
  fix warning infos of recompute (#43495) · ed6f1f90
  由 Haohongxiang 提交于 6月 14, 2022
  
  ed6f1f90
- T
  
  fix whl check (#43415) · b1f77b4d
  由 tianshuo78520a 提交于 6月 14, 2022
  
  b1f77b4d
- S
  
  [pre-commit] make pre-commit a single pipeline (#43469) · cb2958ae
  由 Sing_chan 提交于 6月 14, 2022
  
  cb2958ae
- W
  [Dy2St]Refine ifelse early return (#43328) · 1950a360
  由 WangZhen 提交于 6月 14, 2022
```
* Refine ifelse early return
```
  1950a360
- Y
  [IPU]update paddle.distributed.launch (#43311) · 083d769b
  由 yaozhixin 提交于 6月 14, 2022
```
* update paddle.distributed.launch

* add sample code

* update shell

* fix typo

* fix typo

* update docs

* rm code

* fix doc 2

* fix doc 3

* fix doc 4
Co-authored-by: Nroot <root@sgjur-pod004-1.ipu.graphcore.cn>
```
  083d769b
- S
  
  fix update loss scaling (#43487) · 0e6462d6
  由 sneaxiy 提交于 6月 14, 2022
  
  0e6462d6
- Y
  
  [cuda graph] partial program with cuda graph under static mode (#43440) · d83d59dd
  由 Yuang Liu 提交于 6月 14, 2022
  
  d83d59dd
- L
  
  updata_README (#43391) · db58dd27
  由 Ligoml 提交于 6月 14, 2022
  
  db58dd27
- S
  [windows CI]open inference_ut in windows-inference pipeline (#43446) · 058c52b6
  由 Sing_chan 提交于 6月 14, 2022
```
* open inference_ut;test=windows_ci_inference

* inference_ut need onnx;test=windows_ci_inference

* disable trt_split_converter_test; use higher parallel level

* too high parallel will cause ut timeout
```
  058c52b6
- Z
  
  fix compiling werror (#43337) · c6421019
  由 Zhang Jun 提交于 6月 14, 2022
  
  c6421019
- X
  [ Make FLAGS_einsum_opt as default ] Einsum memory optimization (#43397) · 83abec60
  由 xiongkun 提交于 6月 14, 2022
```
* change logic for optimize

* modifty

* optimize the backward speed of EinsumOp

* add cache optimizer for einsum op

* EinsumOp: fix new dygraph mode error

* fix bug

* change Cache->InnerCache

* fix code

* fix

* add nan inf utils for einsum op

* add as_extra

* memory optimizer for einsum

* update code
```
  83abec60
- Z
  fix the bug that _DataLoaderIterMultiProcess use time to generate the seed (#43318) · 2106f668
  由 Zhang Ting 提交于 6月 14, 2022
```
* fix the bug that _DataLoaderIterMultiProcess use time to generate the seed

* use np.random.randint to generate a base seed
```
  2106f668
- S
  
  【code format check upgrade】 step3：enable clang-format sort these infrt files's headers (#43333) · 403b127b
  由 Sing_chan 提交于 6月 14, 2022
  
  403b127b
- S
  
  【code format check upgrade】 step3：enable clang-format sort these cinn files's headers(#43329) · d14e3698
  由 Sing_chan 提交于 6月 14, 2022
  
  d14e3698

PaddlePaddle / Paddle 大约 2 年 前同步成功

PaddlePaddle / Paddle
大约 2 年前同步成功