提交 · 1cb4c154d1f5a6d550bcbf861e62cb37f0c87ca0 · PaddlePaddle / Paddle

26 7月, 2021 3 次提交

Fix the bug in paddle.distributed.split demo. The paddle.distributed.split op... · 1cb4c154

由李季提交于 7月 26, 2021

Fix the bug in paddle.distributed.split demo. The paddle.distributed.split op just can be used in static mode. (#34306)

* fix the bug in paddle.distributed.split demo

1cb4c154

[NPU] add hard_sigmoid (#34094) · b5d8f43e

由 furnace 提交于 7月 26, 2021

* [NPU] add hard_sigmoid

* [NPU] delete check_dygraph=False and max_relative_error

* [NPU] delete debug codes

* [NPU] add more test cases

* [NPU] add api test TestHardsigmoidAPI

* [NPU] temp delete hard_sigmoid for resovle conficts

* [NPU] resolve conflicts

b5d8f43e

J
Fix for failing CI(test_activation_mkldnn_op.py) (#34329) · 81dec05a
由 jakpiase 提交于 7月 26, 2021
```
* fixed CI failing

* removed unnecessary imports
```
81dec05a

23 7月, 2021 6 次提交
- A
  Revert "[Dy2Stat] Refactor ExecutorCache logic and pre-support BuildStrategy... · 577fdde5
  由 Aurelius84 提交于 7月 23, 2021
```
Revert "[Dy2Stat] Refactor ExecutorCache logic and pre-support BuildStrategy for pass (#34181)" (#34348)

This reverts commit 609f8225.
```
  577fdde5
- Y
  
  disable test_dataloader_dataset,test=document_fix (#34353) · 0f60998e
  由 YUNSHEN XIE 提交于 7月 23, 2021
  
  0f60998e
- W
  Logical Ops support more data types (#34141) · 27417f1f
  由 will-jl944 提交于 7月 23, 2021
```
* logical ops support int8, int16, int32, int64, float, double

* update docs of logical ops

* fix npu and xpu logical ops

* fix npu and xpu logical ops

* fix bug in xpu logical op code

* update test_logical_op_npu and test_logical_op_xpu

* correct error type
```
  27417f1f
- R
  [NPU] add index_sample_op_npu and tests (#34239) · 63f6ce7b
  由 ronnywang 提交于 7月 23, 2021
```
* add index_sample_op_npu and tests

* update
```
  63f6ce7b
- S
  
  fix bug for num_iters in fit/evaluate (#34059) · 08c5b1d1
  由 shangliang Xu 提交于 7月 23, 2021
  
  08c5b1d1
- R
  
  add npu_sampling_id and tests (#34302) · 04288091
  由 ronnywang 提交于 7月 23, 2021
  
  04288091
22 7月, 2021 11 次提交
- L
  copy found_inf to cpu in advance to improve performance (#34274) · 781f4028
  由 Leo Chen 提交于 7月 22, 2021
```
* copy found_inf to cpu in advance to improve performance

* add npu test

* add npu test

* refine code

* refine memcpy op

* fix adam
```
  781f4028
- A
  [Dy2Stat] Refactor ExecutorCache logic and pre-support BuildStrategy for pass (#34181) · 609f8225
  由 Aurelius84 提交于 7月 22, 2021
```
* modify into program_id

* fix cache_info declare problem

* fix python int to C long problem

* modify point to reference

* add ENVS
```
  609f8225
- T
  
  fix m1 not found leaf7_features (#34309) · 832c3894
  由 tianshuo78520a 提交于 7月 22, 2021
  
  832c3894
- Q
  [NPU] update NPU ci tests, test=npu_aarch64 (#34272) · e0da9666
  由 Qi Li 提交于 7月 22, 2021
```
* [NPU] update NPU ci tests, test=npu_aarch64

* [NPU] fix x86 build and add disable_ut for NPU, test=npu_aarch64

* [NPU] address review comments, test=develop
```
  e0da9666
- J
  Added sigmoid BF16 FWD/BWD kernels and gelu BF16 BWD kernel (#34216) · 5d3c89cf
  由 jakpiase 提交于 7月 22, 2021
```
* added sigmoid BF16 FWD/BWD and gelu BF16 BWD

* added newline at EOF

* switched from lambdas to local functions

* changed function names
```
  5d3c89cf
- L
  
  enable amp unsupported_fp16_list for npu (#34314) · b0a2f005
  由 Leo Chen 提交于 7月 22, 2021
  
  b0a2f005
- C
  Add int16 kernel for lookup_talbe and dequantize_abs_max op (#34275) · 85e531a9
  由 cc 提交于 7月 22, 2021
```
* add int16 kernel for lookup_talbe and dequantize_abs_max op
```
  85e531a9
- J
  
  fix hapi fleet bug in static mode (#34311) · 13991b5e
  由 Jiaqi Liu 提交于 7月 22, 2021
  
  13991b5e
- Z
  
  Fix the save logic for the qat save unit test. (#34273) · 2fa3d59e
  由 Zhen Wang 提交于 7月 22, 2021
  
  2fa3d59e
- W
  
  fix index erro in conv2d_transpose (#34270) · 24c7087f
  由 wangguanzhong 提交于 7月 22, 2021
  
  24c7087f
- Z
  Support getitem by ellipsis index in dynamic mode (#34267) · 82339ed1
  由 zyfncg 提交于 7月 22, 2021
```
* Support getitem by ellipsis index in dynamic mode

* change some code style
```
  82339ed1
21 7月, 2021 6 次提交
- K
  
  fix DataLoader memory leak. test=develop (#34140) · 6fc33a0c
  由 Kaipeng Deng 提交于 7月 21, 2021
  
  6fc33a0c
- C
  
  add more info to tensor.grad warning message (#34264) · 32cb0f5a
  由 chentianyu03 提交于 7月 21, 2021
  
  32cb0f5a
- L
  [NPU] Refine npu unit tests (#34240) · 712f9fe5
  由 Leo Chen 提交于 7月 21, 2021
```
* add npu unittest only if WITH_ASCEND_CL is ON

* remove @unittest.skipIf, since these unittests will only be created when WITH_ASCEND_CL is ON

* open dygraph test for npu test
```
  712f9fe5
- K
  
  fix os.setsid in windows (#34278) · f50a67eb
  由 kuizhiqing 提交于 7月 21, 2021
  
  f50a67eb
- W
  trt reduce_mean supported. (#34204) · aff14962
  由 wenbin 提交于 7月 21, 2021
```
* reduce_mean supported. test=allcase

* ut. test=allcase

* test=develop

* ut.test=allcase

* correct name. test=allcase

* correct UT. test=allcase

* correct UT.test=develop

* remove op

* UT

* add convert

* fix timeout issue

* more uts

* more ut

* correct ut
```
  aff14962
- F
  [CPU-PSLIB] Add consistency insepection of op's embedding name and sparse... · 2f76bb8b
  由 Fan Zhang 提交于 7月 21, 2021
```
[CPU-PSLIB] Add consistency insepection of op's embedding name and sparse table name in config_fleet.py, test=develop (#34249)
```
  2f76bb8b
20 7月, 2021 6 次提交
- T
  fix crop_tensor op doc (#34263) · c8fb6fc4
  由 Thomas Young 提交于 7月 20, 2021
```
* fix crop_tensor op doc

* update code example test=document_fix
```
  c8fb6fc4
- K
  Revert "fix cifar label dimension. test=develop (#33475)" (#34242) · 1f6f2235
  由 Kaipeng Deng 提交于 7月 20, 2021
```
This reverts commit 6c110344.
```
  1f6f2235
- 0
  [Dy2Stat]Support Nest sequtial container (#34246) · 6a9610ed
  由 0x45f 提交于 7月 20, 2021
```
* support Nest sequtial container

* rename model path
```
  6a9610ed
- Y
  
  [hybird optim] reduce rend/recv times for recompute, test=develop (#34248) · 3a5f1f22
  由 Yuang Liu 提交于 7月 20, 2021
  
  3a5f1f22
- W
  change strided_slice when step<0. (#34205) · 7f2b5be3
  由 WeiXin 提交于 7月 20, 2021
```
* change strided_slice when step<0.

* add unittest for paddle.strided_slice

* polish unittest
```
  7f2b5be3
- W
  
  [hybrid parallel] Optimize pipeline memory (#34230) · a74208c1
  由 WangXi 提交于 7月 20, 2021
  
  a74208c1
19 7月, 2021 8 次提交

L
fix the order of unfold parameters (#34156) · 056b8741
由 lzzyzlbb 提交于 7月 19, 2021
```
* fix the order of unfold parameters
```
056b8741
C
Update while loop (#34229) · 6fbb975d
由 Chen Long 提交于 7月 19, 2021
```
* update readme test=document_fix

* update while loop docs test=document_fix
```
6fbb975d
Q

[NPU] add is_empty_op_npu, test=develop (#34234) · d4fb5c68
由 Qi Li 提交于 7月 19, 2021

d4fb5c68
J

Fix format in requantize mkldnn op (#34137) · 1dfd857b
由 joanna.wozna.intel 提交于 7月 19, 2021

1dfd857b

[amp] pass found_inf to adam to suppport skip_update (#34176) · 9bc59673

由 Leo Chen 提交于 7月 19, 2021

* pass found_inf to adam

* add unittest

* fix bug

* refine unittest

* change unit test's directory

* disable unittest on cpu

9bc59673

L
move the recv op the beginning of the forward/backward phase for pipeline (#34197) · cc007dce
由 lilong12 提交于 7月 19, 2021
```
* mv recv to head, test=develop
```
cc007dce

Add Cuda event and stream API (#32460) · 9c7f6af5

由 chentianyu03 提交于 7月 19, 2021

* add cuda event and stream api

* add cuda event and stream api

* add get_current_stream api

* add get_current_stream api

* init streams

* modify get_current_stream

* modify get_cuttent_stream

* add synchronize func

* add current_stream doc and test file

* move get_current_stream into CUDA macro

* move CudaEvent into CUDA macro

* move _get_current_stream and _device_synchronize into cuda macro

* modify the macro of cuda stream and event

* add test case for synchronize

* add paddle.devices.cuda module

* event and stream support hip

* add doc for stream and event class

* move cuda stream and event into single pybind

* add cuda_streams_py.cc to cmakelist

* add _device_synchronize and _get_current_stream to core module

* add test case for cudastream and cudaevent

* move __all__ in streams.py

* fix test fail

* add cuda to devices __all__

* fix current_stream doc writing error

* move devices to device direction, and merge device.py into __init__.py

* add required:gpu to sample codes

* remove cuda direction from device/__init__.py

9c7f6af5

J

enabled bf16 tests in prelu (#34196) · 68f51239
由 jakpiase 提交于 7月 19, 2021

68f51239

PaddlePaddle / Paddle 大约 1 年 前同步成功

PaddlePaddle / Paddle
大约 1 年前同步成功