提交 · e7b476c15d9b00ea338b12f9066f553dd833fd95 · BaiXuePrincess / Paddle

28 9月, 2020 4 次提交
- L
  
  Revert "Initialize gloo for low level collective apis (#27356)", test=document_fix (#27665) · 36c04102
  由 lilong12 提交于 9月 28, 2020
  
  36c04102
- W
  add paddle.fluid._cuda_synchronize (#27595) · c68a0313
  由 wanghuancoder 提交于 9月 28, 2020
```
* add paddle.fluid._cuda_synchronize, test=develop

* fix bug about core_avx core_noavx, test=develop

* delete CPUPlace and XPUPlace, test=develop
```
  c68a0313
- L
  Support assignment to a Variable in dynamic mode but not deal with backward. (#27471) · 074a71bd
  由 liym27 提交于 9月 28, 2020
```
* Support assignment to a Variable in dynamic mode. Note: not deal with backward.

* Rewrite VarBase __setitem__ for high-performance.

* try to test 3 means to do __setitem__ and test the performance of 3 means.

* Retain the means of the highest performance: C++ code and don't trace op.
```
  074a71bd
- L
  Initialize gloo for low level collective apis (#27356) · fa73e4a2
  由 lilong12 提交于 9月 28, 2020
```
* add gloo initializer, test=develop
```
  fa73e4a2
27 9月, 2020 1 次提交

add support to float64 input of warpctc op. (#27399) · 1501a80f

由 Li Fuchen 提交于 9月 27, 2020

* add float64 input to ctc_loss

* modified error message of  warpctc

* update repo and tag of warpctc

* add test for warpctc with float64 input

* modified warpctc.cmake to make sure build always

* resolved sample code bug of warpctc

* add core.ops in warpctc dygraph

* fix a bug of test

1501a80f

26 9月, 2020 1 次提交
- J
  
  Add conv2d bfloat16 support (#27325) · b0ee1405
  由 joanna.wozna.intel 提交于 9月 26, 2020
  
  b0ee1405
23 9月, 2020 1 次提交

Make the Bind Method of Tensor more automatic (#27270) · 1e1ae5c5

由 Zhou Wei 提交于 9月 23, 2020

* Makes the Bind Method more intelligent

* Makes the Bind Method more intelligent

* fix unittest

* fix unittest

* fix conflict

1e1ae5c5

21 9月, 2020 2 次提交

[Feature] Enhance inplace addto strategy for gradient accumulation in static graph (#27112) · aba759ba

由 Leo Chen 提交于 9月 21, 2020

* support use add instead of sum to do gradient accumulation

* add inplace addto pass

* add grad_add op and inplace addto pass

* remove debug code

* code refine

* fix bug when sereral sum ops inserts at same op_idx

* fix Flags type

* add addto attribute for conv3d

* fix ut

* code clean

* fix type

aba759ba

Quant op dev (#25932) · 02606d45

由 huangxu96 提交于 9月 21, 2020

* Finished ChannelWiseQuantDequantAbsMaxOp and Passed unittests.

* Finished channel-wise quantize strategy in imperative quantization.

* Added Cuda code of ChannelWiseQuantDequantMaxAbsOP
Add Cuda code of ChannelWiseQuantDequantMaxAbsOp

* Add quant_axis for channel_wise quant.

* fixed a bug in unnitests, which will not trigger axis = 1 case and cannot meet the coverage rate requirement.

* Added some assert infomation and fixed some coding style mistakes.

02606d45

15 9月, 2020 1 次提交
- W
  
  [Pass Compatible] Bind python compatible. (#27262) · f827665a
  由 Wilber 提交于 9月 15, 2020
  
  f827665a
14 9月, 2020 2 次提交

J

Add bfloat16 passes (#26999) · 1483ea23
由 joanna.wozna.intel 提交于 9月 14, 2020

1483ea23

Update amp_check_finite_and_scale_op and add an updating_loss_scaling op for... · d708b210

由 Zhen Wang 提交于 9月 14, 2020

Update amp_check_finite_and_scale_op and add an updating_loss_scaling op for static graph amp training. (#26240)

* update amp_check_finite_and_scale_op for static_amp.

* use amp_check_finite_and_scale in static graph amp.

* update grads to zero when grads own infinite values(as for amp_checkout_finite_and_scale op).

* add update_loss_scaling op in cpp.

* add update_loss_scaling_op unit test.

* update the doc of the check_finite_and_unscale op

* Update the process of gradients updating skipping if the gradients have infinite values.

* update the way to zero grads.

* update test_update_loss_scaling_op.py

* add log info when find infinite grads.

* add the unit test for UpdateLossScaling Layer.

d708b210

08 9月, 2020 1 次提交

Enhance ops to support LoD as input for dygraph detection models. (#25316) · a28ae86e

由 wangguanzhong 提交于 9月 08, 2020

* enhance collect_op for dygraph, test=develop

* enhance detection ops with lod, test=develop

* support none bbox left in generate_proposals, test=develop

* unfiy MultiLevelRoisNum, test=develop

* update core.ops, test=develop

* add op register for new input & output, test=develop

a28ae86e

07 9月, 2020 1 次提交
- W
  
  Refine python inference api (#26958) · 63212541
  由 Wilber 提交于 9月 07, 2020
  
  63212541
04 9月, 2020 1 次提交
- Y
  
  add cuda generator (#26786) · 7f3e6ca5
  由 yaoxuefeng 提交于 9月 04, 2020
  
  7f3e6ca5
03 9月, 2020 1 次提交
- J
  
  Add bfloat16 data type (#25402) · 95e1434b
  由 joanna.wozna.intel 提交于 9月 03, 2020
  
  95e1434b
02 9月, 2020 1 次提交
- J
  Restore "Add mkldnn bfloat16 option to C-API " (#26882) · 0627a319
  由 joanna.wozna.intel 提交于 9月 02, 2020
```
* Add mkldnn bfloat16 option to C-API

* Add test for bfloat16 gpu

* Change coverage test

* Repair capi_gpu test
```
  0627a319
01 9月, 2020 1 次提交
- 石
  Revert "Add mkldnn bfloat16 option to C-API (#26676)" (#26854) · ced6e87e
  由石晓伟提交于 9月 01, 2020
```
This reverts commit 02083bda.
```
  ced6e87e
31 8月, 2020 2 次提交

Add use of global flag 'use_mkldnn' to layer_helper (#26497) · 885c61f0

由 arlesniak 提交于 8月 31, 2020

* get use of global 'use_mkldnn' in layer_helper

* update for CI

* update for CI, relu test

* update for CI, relu test added, make FLAGS_use_mkldnn a public flag

* added more strict tests, fixes after review

* fixes after review

* fixes after review, CI stuff

885c61f0

Y

fleet add save with whitelist test=develop (#23376) · a47d92d8
由 yaoxuefeng 提交于 8月 31, 2020

a47d92d8

28 8月, 2020 4 次提交

W
refine paddle inference api (#26774) · 68e0560c
由 Wilber 提交于 8月 28, 2020
```
* refine paddle inference api
Co-authored-by: Nnhzlx <nhzlx.dragon@gmail.com>
```
68e0560c

Refine paddle.manual_seed (#26496) · 844583c8

由 Leo Chen 提交于 8月 28, 2020

* refine manual seed

* fix ci problem

* fix unittests

* fix unittest

* set is_init_py=false in manual_seed

* fix unittest

* fix bernoulli_op

* fix(unittest): change random_seed to manual_seed

* 🐞fix(unittest): fix manual_seed

* trigger ci

* fix test_sentiment

* fix test_imperative_save_load

* fix test_uniform_random_op

* fix test_uniform_random_op

* fix test_jit_save_load

* merge develop

* fix manual_seed

* fix manual_seed

* use global engine

* use shared_ptr

* fix double free

* fix bug

* fix bug

* fix bug

* fix test bug

* fix test bug

* fix test bug

* fix ci

844583c8

J
Add mkldnn bfloat16 option to C-API (#26676) · 02083bda
由 joanna.wozna.intel 提交于 8月 28, 2020
```
* Add mkldnn bfloat16 option to C-API

* Add test for bfloat16 gpu

* Change coverage test
```
02083bda

Update the demo code and the doc of varbase.backward. (#26506) · f9066e6a

由 Zhen Wang 提交于 8月 28, 2020

* update the demo code and the doc of varbase.backward.

* update the doc of the fake interface `paddle.fluid.Variable`.

* remove BackwardStrategy.

f9066e6a

27 8月, 2020 1 次提交
- L
  [api 2.0] add collective op for cpu using gloo and paddle.distributed.* apis (#26552) · 1c681383
  由 lilong12 提交于 8月 27, 2020
```
add collective op for cpu using gloo and paddle.distributed.* apis
```
  1c681383
25 8月, 2020 2 次提交

Z
improve unique op (#26537) · 0a895bc0
由 Zhang Ting 提交于 8月 25, 2020
```
* add unique_v2 op

* remove unique_v2 op

* update doc
```
0a895bc0

optimized transformation form tensor to numpy (#26447) · c1f5df52

由 wanghuancoder 提交于 8月 25, 2020

* optimized transformation form tensor to numpy, test=develop

* optimized transformation form tensor to numpy, pass pre-commit, test=develop

* modify fetchophandle zerocopy to deepcopy in PE&CUP, test=develop

* modify py:array construct, test=develop

* fix _fetch_var to use deep copy, test=develop

c1f5df52

24 8月, 2020 2 次提交

api2.0 paddle.nn.Bilinear and paddle.nn.functional.bilinear (#26399) · 422a1620

由 wanghuancoder 提交于 8月 24, 2020

* api2.0 paddle.nn.Bilinear and paddle.nn.functional.bilinear, test=develop

* api2.0 fix code examples, test=develop

* modify test_bilinear_api, about place,to_tensor , test=develop

* re pass pre-commit, test=develop

* Update common.py

* fix BilinearTensorProduct ci error, test=develop

422a1620

W
add op_function_generator.exe retry in windows, test=develop (#26591) · 6e823cfe
由 wanghuancoder 提交于 8月 24, 2020
```
add op_function_generator.exe retry in windows
```
6e823cfe

23 8月, 2020 1 次提交
- W
  
  add paddle.gather for API2.0 (#26455) · ebf9b212
  由 wangchaochaohu 提交于 8月 23, 2020
  
  ebf9b212
21 8月, 2020 1 次提交

support Baidu Kunlun AI Accelerator (#25959) · 138ecf24

由 QingshuChen 提交于 8月 21, 2020

* support Baidu AI Accelerator
  * test=kunlun

* minor
 * test=kunlun

* support xpu op in separate file
 * test=kunlun

* update XPU error message and remove duplicated code

 * test=kunlun

* minor
 * test=kunlun

* minor
 * test=kunlun

138ecf24

19 8月, 2020 1 次提交
- C
  Add SyncBatchNorm (#26032) · 56890dc7
  由 ceci3 提交于 8月 19, 2020
```
* add SyncBatchNorm,test=develop
```
  56890dc7
18 8月, 2020 2 次提交
- L
  
  Print user-friendly error message in core.ops [part 2] (#26377) · 049ac56c
  由 Leo Chen 提交于 8月 18, 2020
  
  049ac56c
- Y
  
  add cpu random Generator (#26013) · 23261ff4
  由 yaoxuefeng 提交于 8月 18, 2020
  
  23261ff4
17 8月, 2020 1 次提交
- L
  Print user-friendly error message in core.ops (#26261) · 672578a7
  由 Leo Chen 提交于 8月 17, 2020
```
* print user-friendly error message

* adjust error sumary
```
  672578a7
16 8月, 2020 2 次提交
- W
  
  [API2.0] add op for cudnn version query test=develop (#26180) · 0b81d763
  由 wangchaochaohu 提交于 8月 16, 2020
  
  0b81d763
- W
  
  [API2.0] add Device api (set_device and get_device)(#26103) · bb11cbc2
  由 wangchaochaohu 提交于 8月 16, 2020
  
  bb11cbc2
15 8月, 2020 1 次提交

expose and unify the Tensor concepts to the user (#25978) · 6de463d3

由 Zhou Wei 提交于 8月 15, 2020

* expose and unify the Tensor concepts to the user

* expose tensor to user

* add copy place for Tensor

* add copy place for Tensor

* add note

* add macro PADDLE_WITH_CUDA

* remove RUN_TYPE=DIST

* fix some error

6de463d3

14 8月, 2020 1 次提交
- Z
  
  fix_copy_if_different (#25868) · 20147ace
  由 Zhou Wei 提交于 8月 14, 2020
  
  20147ace
13 8月, 2020 1 次提交

Feature/Enable Auto-Mixed-Precision in dynamic graph (#24903) · 2d95280e

由 Leo Chen 提交于 8月 13, 2020

* add auto_cast, test=develop

* add loss scaler, test=develop

* add comments, test=develop

* refine code, test=develop

* refine code, test=develop

* do not set flags automatically, test=develop

* fix custom op bug, test=develop

* add more test, test=develop

* refine enable logic, test=develop

* enable amp test with GPU, test=develop

* add unittest

* add test for found_inf

* follow comments

* follow comments

* remove global variable, use singleton

* add some notes

* update comments

* update comments

* update comments

* add use_dynamic_loss_scaling argument

* refine found_inf

* refine found_inf

2d95280e

BaiXuePrincess / Paddle 与 Fork 源项目一致

BaiXuePrincess / Paddle
与 Fork 源项目一致