提交 · dcdd18aeb6296244d6f2bce1b7bc037f6ceaab55 · BaiXuePrincess / Paddle

21 1月, 2020 1 次提交
- L
  
  change std::cout to log(INFO), vlog (#22316) (#22337) · 90ce4aea
  由 lidanqing 提交于 1月 21, 2020
  
  90ce4aea
20 1月, 2020 1 次提交
- T
  integrated HALF_ASYNC to communicator (#21869) (#22343) · fa4e0e82
  由 tangwei12 提交于 1月 20, 2020
```
* add half_async in the communicator
* fix DistributedStrategy
```
  fa4e0e82
17 1月, 2020 1 次提交
- Q
  
  Fix infer_shape in compling for elementwise_op (#22291) (#22353) · 410a5356
  由 qingqing01 提交于 1月 17, 2020
  
  410a5356
16 1月, 2020 1 次提交
- A
  
  Add caching mechanizm to requantize_mkldnn_op (#22267) · 35bab4f2
  由 Adam 提交于 1月 16, 2020
  
  35bab4f2
14 1月, 2020 3 次提交
- 1
  Bug fix for sparse recorder (#21969) (#22245) · 2e834eab
  由 123malin 提交于 1月 14, 2020
```
* test=develop, bug fix for sparse recorder
```
  2e834eab
- F
  
  add backward gradient computation for op argsort. cherry-pick #22203. test=release/1.7 (#22233) · 681d908e
  由 FlyingQianMM 提交于 1月 14, 2020
  
  681d908e
- Z
  
  support the fusion of batch_norm and relu for AMP. test=release/1.7 (#22210) · c63a63d5
  由 Zhen Wang 提交于 1月 14, 2020
  
  c63a63d5
10 1月, 2020 1 次提交
- B
  
  Improve ngraph file line coverage (#22155) · 298ee7d2
  由 baojun 提交于 1月 09, 2020
  
  298ee7d2
09 1月, 2020 2 次提交

test Optimizer in dygraph (#21949) · d0f0a252

由 zhongpu 提交于 1月 09, 2020

* test Optimizer in dygraph, test=develop

* add optest for Optimizer in dygraph, test=develop

* fix adagrad optimizer, test=develop

* fix dpsgd optimizer, test=develop

* fix test_optimizer.py, test=develop

* fix dpsgd optimizer, this op only support cpu, test=develop

* add optest for optimizer, test=develop

* add description for dpsgd, test=develop

* add rmsprop to white_list in unused_var_check.cc, test=develop

* polish code style, test=develop

* polish code style, test=develop

* delete seed attribute for DpsgdOptimizer, test=develop

* change testing to debugging, test=develop

d0f0a252

石

[Feature] Lite subgraph (#22114) · ad0dfb17
由石晓伟提交于 1月 09, 2020

ad0dfb17

08 1月, 2020 3 次提交

Refine stack op to improve xlnet performance, test=develop (#22142) · 3d4f2aa6

由 zhaoyuchen2018 提交于 1月 08, 2020

stack's wait cost a lot of cpu time, use cuda kernel to do memory copy
will reduce cpu time.
Signed-off-by: Nzhaoyuchen <zhaoyuchen01@baidu.com>

3d4f2aa6

L

add double register op_data_type of pad2d and fix compile error, test=develop (#22075) · 64a40442
由 liu zhengxi 提交于 1月 08, 2020

64a40442

Support prroi_pool_op with Tensor and LoDTensor rois (#20649) · 6ea38091

由 Double_V 提交于 1月 08, 2020

1. Add a new input named batch_roi_nums for prroi_pool_op. batch_roi_nums includes the number of roi for each image in batch when rois is Tensor. This information is saved in rois's lod when rois is LoDTensor.
2. add grad check to prroi_pool_op and solve unnormal X grad diff in CPU.

6ea38091

07 1月, 2020 4 次提交
- Z
  Fix windows build not kernel issue, test=develop (#22105) · 3dbd4087
  由 zhaoyuchen2018 提交于 1月 07, 2020
```
windows conv_fusion failed as no kernel， explicit declare lambda
Signed-off-by: Nzhaoyuchen <zhaoyuchen01@baidu.com>
```
  3dbd4087
- C
  Update pyramid related OP (#21372) · 418abc92
  由 Chengmo 提交于 1月 07, 2020
```
* add special way to add distribute vars， Update Pyramid hash op
```
  418abc92
- F
  add erf op (#21785) · 14aebc7a
  由 Feiyu Chan 提交于 1月 07, 2020
```
* add erf op and python interface.

* add fp16 support for erf op.

* add unitests for erf op and its python interface.
```
  14aebc7a
- C
  
  replace CUDNN_ENFORCE with PADDLE_ENFORCE_CUDA_SUCCESS, test=develop (#22109) · ba8414d3
  由 Chen Weihang 提交于 1月 07, 2020
  
  ba8414d3
06 1月, 2020 4 次提交

support elu_op double grad (#21822) · fab4b076

由 Double_V 提交于 1月 06, 2020

* support elu activation double grad,test=develop

* delete the code commit in .cc,test=develop

* fix relu test unpass, test=develop

* add elu double grad kernel and unit test

* add caculate dX in elu double grad functor, test=develop

* update the commit code,test=develop

fab4b076

Add TRT support for BERT (#21135) · 0a51098a

由 Pei Yang 提交于 1月 06, 2020

* add gelu plugin

* align trt bert with gpu

* add support for fused fc with relu,

* add unittest for bert trt

0a51098a

J

[MKL-DNN] Conv grad and Batch Norm grad NHWC support (#22088) · b0b27ff6
由 Jacek Czaja 提交于 1月 06, 2020

b0b27ff6
1
add distributed_strategy (#21710) · 7fb817d4
由 123malin 提交于 1月 06, 2020
```
* add distributed_strategy
```
7fb817d4

05 1月, 2020 1 次提交
- J
  
  [MKL-DNN] Pool & LRN Grad Ops NHWC support (#21747) · ad8a9cb8
  由 Jacek Czaja 提交于 1月 05, 2020
  
  ad8a9cb8
04 1月, 2020 1 次提交
- K
  
  polish cross_entropy ENFORCE (#22056) · 34c57120
  由 Kaipeng Deng 提交于 1月 04, 2020
  
  34c57120
03 1月, 2020 5 次提交

S
register int/int64_t/float16 in pow/square kernel,test=develop (#22023) · 7f4abaf2
由 SunAhong1993 提交于 1月 03, 2020
```
* register int/int64_t/float16 in  pow/square kernel,test=develop

* add abs/square/exp type,test=develop
```
7f4abaf2

由 Leo Chen 提交于 1月 03, 2020

* fix test_conv2d_ngraph for grad diff, test=develop

* register NoNeedBufferVarsInference for max_pool_grad_op, test=develop

* refine error message, test=develop

* fix numpy, test=develop

* disable test conv2d_ngraph_op, test=develop
Co-authored-by: NZhang Ting <709968123@qq.com>

3f653c83

Add the first implememtation of fusion_group op (#19621) · d4832077

由 Yiqun Liu 提交于 1月 03, 2020

* Add the dynamic load of nvrtc, and support runtime compiling of CUDA kernel using nvrtc.
test=develop

* Call CUDA driver api to launch the kernel compiled by nvrtc.
test=develop

* Disable for mac and windows.
test=develop

* Refine the codes to support manually specified num_threads and workload_per_thread.
test=develop

* Refine the CUDA kernel to support large dims.
test=develop

* Add DeviceCodePool to manage all device codes.

* Add the first implementation fusion_group op.

* Add unit-test for fusion_group op.

* Add the check of result.

* Add the check of nvrtc in unit-test.
test=develop

* Add comment to explain the inputs, outputs and features of fusion_group op.
test=develop

* Disable fusion_group op for mac and windows.
test=develop

* Make the compiling of device code return status instead of hanging up.
test=develop

* Add the check of whether there is CUDA driver library, and do not core dump when failing to call the CUDA driver API.

* Unify fusion_group_op's input and output names.
test=develop

* Add the check of CUDA driver library in unittest.
test=develop

* Refine the calling of PADDLE_ENFORCE.
test=develop

d4832077

M

[DNNL] 3D Fully-Connected (#21746) · 61921084
由 Michał Gallus 提交于 1月 03, 2020

61921084
F
fix generate_proposal_labesl op (#21793) · aa2ed0dc
由 FDInSky 提交于 1月 03, 2020
```
* test=develop fix generate_proposal_labesl op
```
aa2ed0dc

02 1月, 2020 2 次提交

C
update error log for batch_norm_grad (#22017) · 95d79b6d
由 ceci3 提交于 1月 02, 2020
```
* update error information about batch_norm_grad

* update bn,test=develop
```
95d79b6d

fix integer overflow in match_matrix (#22036) · c53b62eb

由 Aurelius84 提交于 1月 02, 2020

* fix integer overflow in match_matrix test=develop

* fix integer overflow in match_matrix test=develop

* fix typo test=develop

c53b62eb

31 12月, 2019 1 次提交
- W
  
  polish code test=develop (#22014) · 64baee41
  由 wangchaochaohu 提交于 12月 31, 2019
  
  64baee41
30 12月, 2019 1 次提交
- D
  
  fix broadcast bug;test=develop (#21898) · b7697f62
  由 danleifeng 提交于 12月 30, 2019
  
  b7697f62
27 12月, 2019 3 次提交

Refine multihead kernel, align block to 32 (#21961) · 8859ddd6

由 zhaoyuchen2018 提交于 12月 27, 2019

* Refine multihead kernel, align block to 32

test=develop
Signed-off-by: Nzhaoyuchen <zhaoyuchen01@baidu.com>

* Refine log comments

test=develop
Signed-off-by: Nzhaoyuchen <zhaoyuchen01@baidu.com>

8859ddd6

add shuffle batch op (#21674) · cee2ccb0

由 zhoushiyu 提交于 12月 27, 2019

* add shuffle batch op, test=develop, test=document_preview

* fix size_t conflict and check_output test=develop, test=document_preview

* fix bug test=develop, test=document_preview

* add unittest of shuffle_batch layer test=develop, test=document_preview

* fix py coverage and op input type, test=develop, test=document_preview

* fix py coverage, test=develop

* fix en doc, test=develop

* move to contrib test=develop

* add unique_name test=develop

* invoke shuffle_batch in contrib.layers test=develop

cee2ccb0

M
make reverse op support negative axis (#21925) · c3e19549
由 mapingshuo 提交于 12月 27, 2019
```
* make reverse op support negative axis
```
c3e19549

26 12月, 2019 2 次提交
- A
  Remove double registered dataType in Pad2d (#21942) · 10d68469
  由 Aurelius84 提交于 12月 26, 2019
```
* fix compile error in CUDA10 test=develop

* remove double in pad2d test=develop
```
  10d68469
- H
  fix aucop stat shape (#21846) · 27decacb
  由 hutuxian 提交于 12月 26, 2019
```
* fix stat shape back in global auc scenario
* add UT to cover global auc
```
  27decacb
25 12月, 2019 3 次提交
- A
  add register op_data_type of pad/expand_as et.al (#21718) · 5cb2c741
  由 Aurelius84 提交于 12月 25, 2019
```
* add register op_data_type test=develop

* fix register bug in isfinite op test=develop

* rm int int64_t in pad2d gradKernel  test=develop
```
  5cb2c741
- H
  
  fix matmul error message; test=develop (#21885) · 30d000f8
  由 hong 提交于 12月 25, 2019
  
  30d000f8
- Z
  
  remove patch command and file of cares to Improved quality of Paddle Repo (#21776) · a01663ca
  由 zhouwei25 提交于 12月 25, 2019
  
  a01663ca

BaiXuePrincess / Paddle 与 Fork 源项目一致

BaiXuePrincess / Paddle
与 Fork 源项目一致