提交 · 9e29d3ebed59a62366a6197821dd9e39b3255a94 · BaiXuePrincess / Paddle

11 2月, 2020 3 次提交

H

【OpPorting Example】DEMO OF FIX COMPILE&RUNTIME LOD_EQUALITY (#22460) · 9e29d3eb
由 huzhiqiang 提交于 2月 11, 2020

9e29d3eb

Improve transpose performance with tile sm copy, test=develop (#22311) · 54970444

由 zhaoyuchen2018 提交于 2月 11, 2020


* Refine code, fix select tile error,test=develop

* Refine element type and some comments, test=develop

* Refine comments and gpu utils, test=develop

* Remove some useless condition

* Refine floor and ceil, test=develop

* refine for loop. test=develop
Signed-off-by: Nzhaoyuchen <zhaoyuchen01@baidu.com>

54970444

Compile without nccl deps. [1/2] (#22509) · a90fa540

由 Wilber 提交于 2月 11, 2020

支持不依赖nccl进行编译。[1/2]

多卡下，如果没有打开WITH_NCCL开关编译，多卡不能通信，则只能选择一张卡使用。
Co-authored-by: N石晓伟 <39303645+Shixiaowei02@users.noreply.github.com>

a90fa540

10 2月, 2020 3 次提交
- W
  Compile without nccl deps. [2/2] (#22484) · de009152
  由 Wilber 提交于 2月 10, 2020
```
Compile without nccl deps. [1/2]
Co-authored-by: N石晓伟 <39303645+Shixiaowei02@users.noreply.github.com>
```
  de009152
- Y
  Fix dismatch of std::max's arguments type on windows. (#22507) · 4b2227e9
  由 Yiqun Liu 提交于 2月 10, 2020
```
test=develop
```
  4b2227e9
- W
  
  fix test_fusion_seqpool_concat lod level between compile and runtime (#22488) · 870f4658
  由 Wilber 提交于 2月 10, 2020
  
  870f4658
07 2月, 2020 4 次提交
- Z
  Fix the integer overflow problem of sequence2batch (#22479) · a61d0952
  由 Zhong Hui 提交于 2月 07, 2020
```
Fix the  integer overflow problem in the op of sequence2batch, change the int32_t to size_t，
In the /paddle/fluid/operators/math/sequence2batch.h#L122.
```
  a61d0952
- C
  Add weight quantization in post_training_quanzitaion (#22445) · 197913eb
  由 cc 提交于 2月 07, 2020
```
* support weight quantization in post_training_quanzitaion, test=develop
* add test for weight quantization, test=develop
```
  197913eb
- T
  refine reshape_op shape error message (#22480) · 7c9ce097
  由 Tao Luo 提交于 2月 07, 2020
```
test=develop
```
  7c9ce097
- L
  optimize performance of interpolate op (#22436) · 2b1386b2
  由 LielinJiang 提交于 2月 07, 2020
```
* optimize interpolate op, test=develop
```
  2b1386b2
06 2月, 2020 1 次提交

Correct the use of DeviceContext in unittest sequence_pooling_test and... · 44b45b9f

由 Yiqun Liu 提交于 2月 06, 2020

Correct the use of DeviceContext in unittest sequence_pooling_test and sequence_padding_test (#22456)

* Add log in memory::Copy for debug purpose.

* Change to use context in DeviceContextPool directly in sequence_pooling_test, instead to new one.

* Change to use context in DeviceContextPool directly in sequence_padding_test, instead to new one.
test=develop

* Change the type of second_dim from size_t to int64_t.
test=develop

44b45b9f

05 2月, 2020 2 次提交

add WITH_NCCL option for cmake. (#22384) · 7bc4b095

由 Wilber 提交于 2月 05, 2020

cmake选项中添加了WITH_NCCL，显示指定是否编译NCCL的部分代码，WITH_NCCL默认打开，但如果WITH_GPU为OFF，则关闭WITH_NCCL

添加了PADDLE_WITH_NCCL定义

单机单卡能够关闭NCCL编译，多卡的话需要默认打开NCCL，如果关闭NCCL，则只能使用单卡
Co-authored-by: N石晓伟 <39303645+Shixiaowei02@users.noreply.github.com>

7bc4b095

fix sigmoid cudnn bug (#22439) · 943cb8c6

由 Tao Luo 提交于 2月 05, 2020

* Sigmoid bug fix, test=develop

* fix code format

test=develop
Co-authored-by: NManjunath Bhat <manjunathbhat9920@gmail.com>

943cb8c6

04 2月, 2020 1 次提交
- 石
  
  remove anakin from code, test=develop (#22420) · e1b0d7cb
  由石晓伟提交于 2月 04, 2020
  
  e1b0d7cb
02 2月, 2020 1 次提交
- L
  Update the precision of pad, pad2d, pad_constant_like's unit tests from fp32 to fp64 (#22394) · 0404e7a9
  由 liu zhengxi 提交于 2月 02, 2020
```
* update the ut precision of pad pad2d pad_constant_like from fp32 to fp64, test=develop
```
  0404e7a9
31 1月, 2020 2 次提交

[DNNL] Fix accuracy in INT8 FC (#22404) · 269db0d1

由 Michał Gallus 提交于 1月 31, 2020

* Enable quantize to reorder to nchw as well

* Correct FC MKL-DNN input dim requirements to accept 3D

* Improve DNNL FC format, error and 3D input handling

test=develop

* Improve error checking in FC

test=develop

* Improve PADDLE_ENFORCE messages in fc-related files

* Remove data layout attribute from obligatory pass args

test=develop

* Fix message in fc_mkldnn_pass to be logically correct

test=develop

269db0d1

J

[UT coverage]Remove unnecessary transpose op registration (#22402) · fb3086fd
由 joanna.wozna.intel 提交于 1月 31, 2020

fb3086fd

25 1月, 2020 1 次提交
- L
  
  [UT Coverage]Improve sum_mkldnn_op line coverage (#22275) · ade50226
  由 lidanqing 提交于 1月 25, 2020
  
  ade50226
23 1月, 2020 1 次提交
- W
  
  improve elementwise_add_mkldnn_op test code coverage (#22359) · 92462e94
  由 Wojciech Uss 提交于 1月 23, 2020
  
  92462e94
22 1月, 2020 1 次提交
- C
  
  add benchmark flag for conv_transpose (#22389) · 20f30dd6
  由 ceci3 提交于 1月 22, 2020
  
  20f30dd6
21 1月, 2020 1 次提交
- C
  Fix GEO-SGD init & send Bug (#22375) · 8f36c395
  由 Chengmo 提交于 1月 21, 2020
```
* test=develop, fix geo Send & Init
```
  8f36c395
19 1月, 2020 2 次提交
- Z
  
  update unittest accuracy to float64 for relu, prelu, maxout (#22273) · c6f888e5
  由 zhupengyang 提交于 1月 19, 2020
  
  c6f888e5
- W
  
  Optimize the depthwise op test=develop (#22265) · 0d8b222b
  由 wangchaochaohu 提交于 1月 19, 2020
  
  0d8b222b
17 1月, 2020 2 次提交
- Q
  
  Fix infer_shape in compling for elementwise_op (#22291) · 2d20869c
  由 qingqing01 提交于 1月 17, 2020
  
  2d20869c
- T
  integrated HALF_ASYNC to communicator (#21869) · 82bc814a
  由 tangwei12 提交于 1月 17, 2020
```
* add half_async in the communicator
* fix DistributedStrategy
```
  82bc814a
16 1月, 2020 4 次提交
- W
  
  remove unused code test=develop (#22327) · 1e932ecc
  由 wangchaochaohu 提交于 1月 17, 2020
  
  1e932ecc
- L
  Remove unused inputs for some operators (#22284) · 3e5744aa
  由 Leo Chen 提交于 1月 16, 2020
```
* remove unused inputs, test=develop

* remove unused inputs, test=develop

* update dtype, test=develop

* remove unused inputs, test=develop

* update op_use_default_grad_op_maker, tese=develop

* resolve conflicts, test=develop

* follow comments, test=develop

* update center_loss_grad, test=develop
```
  3e5744aa
- Z
  
  fix typo in error message (#22312) · 805328e1
  由 zhangchunle 提交于 1月 16, 2020
  
  805328e1
- L
  
  change std::cout to log(INFO), vlog (#22316) · 895f8da7
  由 lidanqing 提交于 1月 16, 2020
  
  895f8da7
15 1月, 2020 1 次提交

Remove disable flag in test_fsp_op.py (#22171) · faba4b11

由 Bai Yifan 提交于 1月 15, 2020

* fix fsp_op, test=develop

* fix fsp grad op maker, test=develop

* update op_use_default_grad_op_maker.spec, test=develop

faba4b11

13 1月, 2020 2 次提交
- A
  
  Add caching mechanizm to requantize_mkldnn_op (#22223) · 9942d9ed
  由 Adam 提交于 1月 13, 2020
  
  9942d9ed
- 1
  Bug fix for sparse recorder (#21969) · 985bceac
  由 123malin 提交于 1月 13, 2020
```
* test=develop, bug fix for sparse recorder
```
  985bceac
10 1月, 2020 3 次提交

F
add backward gradient computation for op argsort (#22203) · 443a713c
由 FlyingQianMM 提交于 1月 10, 2020
```
* add backward gradient computation for op argsort test=developo

* use pre-commit test=develop
```
443a713c

Add bn and relu fuse pass (#22048) · 46189b16

由 Zhen Wang 提交于 1月 10, 2020

* add bn and relu fuse pass

* add op attr assert and dtype assert

* fix some inputs&&outputs bugs for the fused op and pattern.

* add the unittest for fuse_bn_act_pass. test=develop

* use normative enforce statements. test=develop

* add the cpu test. test=develop

* add the support of batch_size=1 for the bn with relu op. test=develop

* add the error type for paddle throws. test=develop

* add fused_batch_norm_act and fused_batch_norm_act_grad to op_has_unsed_vars_white_list. test=develop

46189b16

B

Improve ngraph file line coverage (#22155) · 298ee7d2
由 baojun 提交于 1月 09, 2020

298ee7d2

09 1月, 2020 2 次提交

test Optimizer in dygraph (#21949) · d0f0a252

由 zhongpu 提交于 1月 09, 2020

* test Optimizer in dygraph, test=develop

* add optest for Optimizer in dygraph, test=develop

* fix adagrad optimizer, test=develop

* fix dpsgd optimizer, test=develop

* fix test_optimizer.py, test=develop

* fix dpsgd optimizer, this op only support cpu, test=develop

* add optest for optimizer, test=develop

* add description for dpsgd, test=develop

* add rmsprop to white_list in unused_var_check.cc, test=develop

* polish code style, test=develop

* polish code style, test=develop

* delete seed attribute for DpsgdOptimizer, test=develop

* change testing to debugging, test=develop

d0f0a252

石

[Feature] Lite subgraph (#22114) · ad0dfb17
由石晓伟提交于 1月 09, 2020

ad0dfb17

08 1月, 2020 3 次提交

Refine stack op to improve xlnet performance, test=develop (#22142) · 3d4f2aa6

由 zhaoyuchen2018 提交于 1月 08, 2020

stack's wait cost a lot of cpu time, use cuda kernel to do memory copy
will reduce cpu time.
Signed-off-by: Nzhaoyuchen <zhaoyuchen01@baidu.com>

3d4f2aa6

L

add double register op_data_type of pad2d and fix compile error, test=develop (#22075) · 64a40442
由 liu zhengxi 提交于 1月 08, 2020

64a40442

Support prroi_pool_op with Tensor and LoDTensor rois (#20649) · 6ea38091

由 Double_V 提交于 1月 08, 2020

1. Add a new input named batch_roi_nums for prroi_pool_op. batch_roi_nums includes the number of roi for each image in batch when rois is Tensor. This information is saved in rois's lod when rois is LoDTensor.
2. add grad check to prroi_pool_op and solve unnormal X grad diff in CPU.

6ea38091

BaiXuePrincess / Paddle 与 Fork 源项目一致

BaiXuePrincess / Paddle
与 Fork 源项目一致