提交 · 1a7962be9773e692583bb7d4d86e3db6477f248a · BaiXuePrincess / Paddle

11 2月, 2020 4 次提交

Paddlebox about box_wrapper (#22497) · 1a7962be

由 hutuxian 提交于 2月 11, 2020

Refine PaddleBox Framework, Main functions: 
* Add MetricMsg util class, which can calculate metrics like AUC, bucket_error, COPC.
* Replace FeedPass with new interface: BeginFeedPass & EndFeedPass
* Refactor Pull/Push Sparse Function in box_wrapper.
* Use CUDA Kernel to copy keys and copy feasign between tensor and boxps struct.
* Cache copied keys in pull sparse in order to reuse it in push period.

1a7962be

H

【OpPorting Example】DEMO OF FIX COMPILE&RUNTIME LOD_EQUALITY (#22460) · 9e29d3eb
由 huzhiqiang 提交于 2月 11, 2020

9e29d3eb

Improve transpose performance with tile sm copy, test=develop (#22311) · 54970444

由 zhaoyuchen2018 提交于 2月 11, 2020


* Refine code, fix select tile error,test=develop

* Refine element type and some comments, test=develop

* Refine comments and gpu utils, test=develop

* Remove some useless condition

* Refine floor and ceil, test=develop

* refine for loop. test=develop
Signed-off-by: Nzhaoyuchen <zhaoyuchen01@baidu.com>

54970444

Compile without nccl deps. [1/2] (#22509) · a90fa540

由 Wilber 提交于 2月 11, 2020

支持不依赖nccl进行编译。[1/2]

多卡下，如果没有打开WITH_NCCL开关编译，多卡不能通信，则只能选择一张卡使用。
Co-authored-by: N石晓伟 <39303645+Shixiaowei02@users.noreply.github.com>

a90fa540

10 2月, 2020 3 次提交
- W
  Compile without nccl deps. [2/2] (#22484) · de009152
  由 Wilber 提交于 2月 10, 2020
```
Compile without nccl deps. [1/2]
Co-authored-by: N石晓伟 <39303645+Shixiaowei02@users.noreply.github.com>
```
  de009152
- Y
  Fix dismatch of std::max's arguments type on windows. (#22507) · 4b2227e9
  由 Yiqun Liu 提交于 2月 10, 2020
```
test=develop
```
  4b2227e9
- W
  
  fix test_fusion_seqpool_concat lod level between compile and runtime (#22488) · 870f4658
  由 Wilber 提交于 2月 10, 2020
  
  870f4658
07 2月, 2020 4 次提交
- Z
  Fix the integer overflow problem of sequence2batch (#22479) · a61d0952
  由 Zhong Hui 提交于 2月 07, 2020
```
Fix the  integer overflow problem in the op of sequence2batch, change the int32_t to size_t，
In the /paddle/fluid/operators/math/sequence2batch.h#L122.
```
  a61d0952
- C
  Add weight quantization in post_training_quanzitaion (#22445) · 197913eb
  由 cc 提交于 2月 07, 2020
```
* support weight quantization in post_training_quanzitaion, test=develop
* add test for weight quantization, test=develop
```
  197913eb
- T
  refine reshape_op shape error message (#22480) · 7c9ce097
  由 Tao Luo 提交于 2月 07, 2020
```
test=develop
```
  7c9ce097
- L
  optimize performance of interpolate op (#22436) · 2b1386b2
  由 LielinJiang 提交于 2月 07, 2020
```
* optimize interpolate op, test=develop
```
  2b1386b2
06 2月, 2020 1 次提交

Correct the use of DeviceContext in unittest sequence_pooling_test and... · 44b45b9f

由 Yiqun Liu 提交于 2月 06, 2020

Correct the use of DeviceContext in unittest sequence_pooling_test and sequence_padding_test (#22456)

* Add log in memory::Copy for debug purpose.

* Change to use context in DeviceContextPool directly in sequence_pooling_test, instead to new one.

* Change to use context in DeviceContextPool directly in sequence_padding_test, instead to new one.
test=develop

* Change the type of second_dim from size_t to int64_t.
test=develop

44b45b9f

05 2月, 2020 2 次提交

add WITH_NCCL option for cmake. (#22384) · 7bc4b095

由 Wilber 提交于 2月 05, 2020

cmake选项中添加了WITH_NCCL，显示指定是否编译NCCL的部分代码，WITH_NCCL默认打开，但如果WITH_GPU为OFF，则关闭WITH_NCCL

添加了PADDLE_WITH_NCCL定义

单机单卡能够关闭NCCL编译，多卡的话需要默认打开NCCL，如果关闭NCCL，则只能使用单卡
Co-authored-by: N石晓伟 <39303645+Shixiaowei02@users.noreply.github.com>

7bc4b095

fix sigmoid cudnn bug (#22439) · 943cb8c6

由 Tao Luo 提交于 2月 05, 2020

* Sigmoid bug fix, test=develop

* fix code format

test=develop
Co-authored-by: NManjunath Bhat <manjunathbhat9920@gmail.com>

943cb8c6

04 2月, 2020 1 次提交
- 石
  
  remove anakin from code, test=develop (#22420) · e1b0d7cb
  由石晓伟提交于 2月 04, 2020
  
  e1b0d7cb
02 2月, 2020 1 次提交
- L
  Update the precision of pad, pad2d, pad_constant_like's unit tests from fp32 to fp64 (#22394) · 0404e7a9
  由 liu zhengxi 提交于 2月 02, 2020
```
* update the ut precision of pad pad2d pad_constant_like from fp32 to fp64, test=develop
```
  0404e7a9
31 1月, 2020 2 次提交

[DNNL] Fix accuracy in INT8 FC (#22404) · 269db0d1

由 Michał Gallus 提交于 1月 31, 2020

* Enable quantize to reorder to nchw as well

* Correct FC MKL-DNN input dim requirements to accept 3D

* Improve DNNL FC format, error and 3D input handling

test=develop

* Improve error checking in FC

test=develop

* Improve PADDLE_ENFORCE messages in fc-related files

* Remove data layout attribute from obligatory pass args

test=develop

* Fix message in fc_mkldnn_pass to be logically correct

test=develop

269db0d1

J

[UT coverage]Remove unnecessary transpose op registration (#22402) · fb3086fd
由 joanna.wozna.intel 提交于 1月 31, 2020

fb3086fd

25 1月, 2020 1 次提交
- L
  
  [UT Coverage]Improve sum_mkldnn_op line coverage (#22275) · ade50226
  由 lidanqing 提交于 1月 25, 2020
  
  ade50226
23 1月, 2020 1 次提交
- W
  
  improve elementwise_add_mkldnn_op test code coverage (#22359) · 92462e94
  由 Wojciech Uss 提交于 1月 23, 2020
  
  92462e94
22 1月, 2020 1 次提交
- C
  
  add benchmark flag for conv_transpose (#22389) · 20f30dd6
  由 ceci3 提交于 1月 22, 2020
  
  20f30dd6
21 1月, 2020 1 次提交
- C
  Fix GEO-SGD init & send Bug (#22375) · 8f36c395
  由 Chengmo 提交于 1月 21, 2020
```
* test=develop, fix geo Send & Init
```
  8f36c395
19 1月, 2020 2 次提交
- Z
  
  update unittest accuracy to float64 for relu, prelu, maxout (#22273) · c6f888e5
  由 zhupengyang 提交于 1月 19, 2020
  
  c6f888e5
- W
  
  Optimize the depthwise op test=develop (#22265) · 0d8b222b
  由 wangchaochaohu 提交于 1月 19, 2020
  
  0d8b222b
17 1月, 2020 2 次提交
- Q
  
  Fix infer_shape in compling for elementwise_op (#22291) · 2d20869c
  由 qingqing01 提交于 1月 17, 2020
  
  2d20869c
- T
  integrated HALF_ASYNC to communicator (#21869) · 82bc814a
  由 tangwei12 提交于 1月 17, 2020
```
* add half_async in the communicator
* fix DistributedStrategy
```
  82bc814a
16 1月, 2020 4 次提交
- W
  
  remove unused code test=develop (#22327) · 1e932ecc
  由 wangchaochaohu 提交于 1月 17, 2020
  
  1e932ecc
- L
  Remove unused inputs for some operators (#22284) · 3e5744aa
  由 Leo Chen 提交于 1月 16, 2020
```
* remove unused inputs, test=develop

* remove unused inputs, test=develop

* update dtype, test=develop

* remove unused inputs, test=develop

* update op_use_default_grad_op_maker, tese=develop

* resolve conflicts, test=develop

* follow comments, test=develop

* update center_loss_grad, test=develop
```
  3e5744aa
- Z
  
  fix typo in error message (#22312) · 805328e1
  由 zhangchunle 提交于 1月 16, 2020
  
  805328e1
- L
  
  change std::cout to log(INFO), vlog (#22316) · 895f8da7
  由 lidanqing 提交于 1月 16, 2020
  
  895f8da7
15 1月, 2020 1 次提交

Remove disable flag in test_fsp_op.py (#22171) · faba4b11

由 Bai Yifan 提交于 1月 15, 2020

* fix fsp_op, test=develop

* fix fsp grad op maker, test=develop

* update op_use_default_grad_op_maker.spec, test=develop

faba4b11

13 1月, 2020 2 次提交
- A
  
  Add caching mechanizm to requantize_mkldnn_op (#22223) · 9942d9ed
  由 Adam 提交于 1月 13, 2020
  
  9942d9ed
- 1
  Bug fix for sparse recorder (#21969) · 985bceac
  由 123malin 提交于 1月 13, 2020
```
* test=develop, bug fix for sparse recorder
```
  985bceac
10 1月, 2020 3 次提交

F
add backward gradient computation for op argsort (#22203) · 443a713c
由 FlyingQianMM 提交于 1月 10, 2020
```
* add backward gradient computation for op argsort test=developo

* use pre-commit test=develop
```
443a713c

Add bn and relu fuse pass (#22048) · 46189b16

由 Zhen Wang 提交于 1月 10, 2020

* add bn and relu fuse pass

* add op attr assert and dtype assert

* fix some inputs&&outputs bugs for the fused op and pattern.

* add the unittest for fuse_bn_act_pass. test=develop

* use normative enforce statements. test=develop

* add the cpu test. test=develop

* add the support of batch_size=1 for the bn with relu op. test=develop

* add the error type for paddle throws. test=develop

* add fused_batch_norm_act and fused_batch_norm_act_grad to op_has_unsed_vars_white_list. test=develop

46189b16

B

Improve ngraph file line coverage (#22155) · 298ee7d2
由 baojun 提交于 1月 09, 2020

298ee7d2

09 1月, 2020 2 次提交

test Optimizer in dygraph (#21949) · d0f0a252

由 zhongpu 提交于 1月 09, 2020

* test Optimizer in dygraph, test=develop

* add optest for Optimizer in dygraph, test=develop

* fix adagrad optimizer, test=develop

* fix dpsgd optimizer, test=develop

* fix test_optimizer.py, test=develop

* fix dpsgd optimizer, this op only support cpu, test=develop

* add optest for optimizer, test=develop

* add description for dpsgd, test=develop

* add rmsprop to white_list in unused_var_check.cc, test=develop

* polish code style, test=develop

* polish code style, test=develop

* delete seed attribute for DpsgdOptimizer, test=develop

* change testing to debugging, test=develop

d0f0a252

石

[Feature] Lite subgraph (#22114) · ad0dfb17
由石晓伟提交于 1月 09, 2020

ad0dfb17

08 1月, 2020 2 次提交
- Z
  Refine stack op to improve xlnet performance, test=develop (#22142) · 3d4f2aa6
  由 zhaoyuchen2018 提交于 1月 08, 2020
```
stack's wait cost a lot of cpu time, use cuda kernel to do memory copy
will reduce cpu time.
Signed-off-by: Nzhaoyuchen <zhaoyuchen01@baidu.com>
```
  3d4f2aa6
- L
  
  add double register op_data_type of pad2d and fix compile error, test=develop (#22075) · 64a40442
  由 liu zhengxi 提交于 1月 08, 2020
  
  64a40442

BaiXuePrincess / Paddle 与 Fork 源项目一致

BaiXuePrincess / Paddle
与 Fork 源项目一致