提交 · 13cf4cede2c563b622335079dde391ef5c57a63c · PaddlePaddle / Paddle

15 6月, 2022 11 次提交
- A
  [IPU] Decoupling ipu sharding and modeling (#43164) · 13cf4ced
  由 Allen Guo 提交于 6月 15, 2022
```
* Decoupling ipu sharding and modeling (#665)

* feat(shard): decoupling shard setting with modeling.

* fix(shard): split test cases to avoid failure.

* fix(shard): add function docs and fix typo.

* test(shard): add tests.

* test(shard): more test case.

* fix(): change ipu_index/stage default value to -1.

* fix format
Co-authored-by: Nczr-gc <96037699+czr-gc@users.noreply.github.com>
```
  13cf4ced
- Z
  set_state_dict not use state_dict hook (#43407) · 1d0d6594
  由 zhangbo9674 提交于 6月 15, 2022
```
* set_state_dict not use state_dict hook

* add ut

* refine doc
```
  1d0d6594
- Z
  
  Conv3 support bias (#43458) · 64e2f10c
  由 zhangkaihuo 提交于 6月 15, 2022
  
  64e2f10c
- G
  
  modify index dtype from int to int64_t of concat_and_split_functor (#43479) · 81abaaf5
  由 Guoxia Wang 提交于 6月 15, 2022
  
  81abaaf5
- L
  
  revert -Wno-error=maybe-uninitialized (#43511) · a89060ac
  由 Leo Chen 提交于 6月 15, 2022
  
  a89060ac
- Z
  place all save/load path into temporary directory (#43451) · 0c51f241
  由 zhaoyingli 提交于 6月 15, 2022
```
* use tempfile to place temporary files

* update

* revert test_communicator

* fix test_dist_base
```
  0c51f241
- Z
  Rename yaml (#43470) · fcd32950
  由 zyfncg 提交于 6月 15, 2022
```
* rename yaml file

* fix merge conflict

* fix infrt
```
  fcd32950
- add some kernels(csr*dense->csr, dense*dense->csr) of SparseTensor matmul (#42935) · 346efe96
  由 zhouweiwei2014 提交于 6月 15, 2022
```
* add some kernel(csr*dense->csr, dense*dense->csr) of SparseTensor matmul

* fix CI

* fix CI

* fix comment

* fix comment
```
  346efe96
- Z
  
  [DOC] fix dist api document(#43394) · 19eb0eb8
  由 Zhong Hui 提交于 6月 15, 2022
  
  19eb0eb8
- Y
  Use int64_t in GetGpuLaunchConfig1D and ElementwiseKernel as index type to... · 15577630
  由 Yiqun Liu 提交于 6月 15, 2022
```
Use int64_t in GetGpuLaunchConfig1D and ElementwiseKernel as index type to support large tensor. (#43506)

* Change some data type from int to int64_t in GetGpuLaunchConfig1D to support large tensor.

* Use int64_t in ElementwiseKernel as index type to support large tensor.
```
  15577630
- R
  Refactor dynload/port.h (#43431) · 332fdd1e
  由 Ruibiao Chen 提交于 6月 15, 2022
```
* Refactor port.h

* Remove some unnecessary code

* Fix CI errors
```
  332fdd1e
14 6月, 2022 26 次提交
- L
  
  fix is_test bug in fused_feedforward. (#43508) · 193ab32c
  由 Li Min 提交于 6月 14, 2022
  
  193ab32c
- J
  [Eager] Fix edvr starganv2 (#43471) · c62a7e25
  由 Jiabin Yang 提交于 6月 14, 2022
```
* fix starganv2

* fix starganv2 stop_gradient end error

* fix edvr_starganv2

* fix mul kernel to fix optional ddx

* fix typo
```
  c62a7e25
- R
  Support sequential run GPU OPs for standalone executor (#43243) · 8cec1271
  由 Ruibiao Chen 提交于 6月 14, 2022
```
* Support sequential run for standalone executor

* Add UTs

* Fix test_standalone_multiply_write

* Remove unnecessary UTs
```
  8cec1271
- C
  
  [MLU] add mlu kernel for depthwise conv2d op (#43359) · 077f3788
  由 cambriconhsq 提交于 6月 14, 2022
  
  077f3788
- Z
  [MLU]: add elementwise_max mlu kernel (#43365) · ceb6b3f1
  由 zhaoying9105 提交于 6月 14, 2022
```
* [MLU]: add elementwise_max mlu kernel

* [MLU]: add int32 support for elementwise maxk MLU kernel
```
  ceb6b3f1
- Z
  
  [MLU]: add log log10 log2 MLU kernel (#43360) · 4642e8c4
  由 zhaoying9105 提交于 6月 14, 2022
  
  4642e8c4
- H
  
  fix warning infos of recompute (#43495) · ed6f1f90
  由 Haohongxiang 提交于 6月 14, 2022
  
  ed6f1f90
- T
  
  fix whl check (#43415) · b1f77b4d
  由 tianshuo78520a 提交于 6月 14, 2022
  
  b1f77b4d
- S
  
  [pre-commit] make pre-commit a single pipeline (#43469) · cb2958ae
  由 Sing_chan 提交于 6月 14, 2022
  
  cb2958ae
- W
  [Dy2St]Refine ifelse early return (#43328) · 1950a360
  由 WangZhen 提交于 6月 14, 2022
```
* Refine ifelse early return
```
  1950a360
- Y
  [IPU]update paddle.distributed.launch (#43311) · 083d769b
  由 yaozhixin 提交于 6月 14, 2022
```
* update paddle.distributed.launch

* add sample code

* update shell

* fix typo

* fix typo

* update docs

* rm code

* fix doc 2

* fix doc 3

* fix doc 4
Co-authored-by: Nroot <root@sgjur-pod004-1.ipu.graphcore.cn>
```
  083d769b
- S
  
  fix update loss scaling (#43487) · 0e6462d6
  由 sneaxiy 提交于 6月 14, 2022
  
  0e6462d6
- Y
  
  [cuda graph] partial program with cuda graph under static mode (#43440) · d83d59dd
  由 Yuang Liu 提交于 6月 14, 2022
  
  d83d59dd
- L
  
  updata_README (#43391) · db58dd27
  由 Ligoml 提交于 6月 14, 2022
  
  db58dd27
- S
  [windows CI]open inference_ut in windows-inference pipeline (#43446) · 058c52b6
  由 Sing_chan 提交于 6月 14, 2022
```
* open inference_ut;test=windows_ci_inference

* inference_ut need onnx;test=windows_ci_inference

* disable trt_split_converter_test; use higher parallel level

* too high parallel will cause ut timeout
```
  058c52b6
- Z
  
  fix compiling werror (#43337) · c6421019
  由 Zhang Jun 提交于 6月 14, 2022
  
  c6421019
- X
  [ Make FLAGS_einsum_opt as default ] Einsum memory optimization (#43397) · 83abec60
  由 xiongkun 提交于 6月 14, 2022
```
* change logic for optimize

* modifty

* optimize the backward speed of EinsumOp

* add cache optimizer for einsum op

* EinsumOp: fix new dygraph mode error

* fix bug

* change Cache->InnerCache

* fix code

* fix

* add nan inf utils for einsum op

* add as_extra

* memory optimizer for einsum

* update code
```
  83abec60
- Z
  fix the bug that _DataLoaderIterMultiProcess use time to generate the seed (#43318) · 2106f668
  由 Zhang Ting 提交于 6月 14, 2022
```
* fix the bug that _DataLoaderIterMultiProcess use time to generate the seed

* use np.random.randint to generate a base seed
```
  2106f668
- S
  
  【code format check upgrade】 step3：enable clang-format sort these infrt files's headers (#43333) · 403b127b
  由 Sing_chan 提交于 6月 14, 2022
  
  403b127b
- S
  
  【code format check upgrade】 step3：enable clang-format sort these cinn files's headers(#43329) · d14e3698
  由 Sing_chan 提交于 6月 14, 2022
  
  d14e3698
- F
  
  Use tempfile to place all the temporary files. Modify some code structure. (#43376) · 95f66c26
  由 freeliuzc 提交于 6月 14, 2022
  
  95f66c26
- W
  fix cmake-lint problems. (#43406) · 59f89236
  由 Wilber 提交于 6月 14, 2022
```
* cmake-lint

* update
```
  59f89236
- S
  增加为Jetson推理的库体积裁剪工具 (#43453) · d74d1838
  由 Shang Zhizhou 提交于 6月 14, 2022
```
* test=document_fix

* test=document_fix; add patch file

* test=document_fix;update style

* test=document_fix;update patch file

* test=document_fix;remove useless patch file
```
  d74d1838
- Z
  
  fix bug of infer shape for slice (#43443) · e0a01461
  由 zyfncg 提交于 6月 14, 2022
  
  e0a01461
- J
  [Eager] Fix custom op error (#43463) · 42754088
  由 Jiabin Yang 提交于 6月 14, 2022
```
* fix custom op error

* fix code error
```
  42754088
- Z
  Fix numpy 1.20+ deprecation warnings (#42929) · 90cf2299
  由 zlsh80826 提交于 6月 14, 2022
```
* Replace np.bool/np.bool8 with np.bool_

* Replace np.object with np.object_

* Replace np.complex with np.complex128

* Replace np.float with np.float64

* Replace np.int with np.int_

* Rerun pre-commit for newer pre-commit configuration

* Use builtin bool instead of np.bool_ based on the context
```
  90cf2299
13 6月, 2022 3 次提交
- C
  
  [MLU] add UTs for mlu interp_v2(bilinear). (#43386) · a0363d18
  由 Chenxiao Niu 提交于 6月 13, 2022
  
  a0363d18
- Q
  
  [MLU]add lookup_table_v2 op and fix amp feature of bert with mlu device (#43366) · 67bd5d9c
  由 qipengh 提交于 6月 13, 2022
  
  67bd5d9c
- C
  
  add mlu interp_v2(nearest&bilinear). (#43383) · affe25b7
  由 Chenxiao Niu 提交于 6月 13, 2022
  
  affe25b7

PaddlePaddle / Paddle 大约 1 年 前同步成功

PaddlePaddle / Paddle
大约 1 年前同步成功