提交 · 49523ea1898ddaea4e166ae5a603fec577e165c9 · BaiXuePrincess / Paddle

03 9月, 2019 2 次提交
- T
  replace PADDLE_ASSERT with PADDLE_ASSERT_MSG (#19586) · 49523ea1
  由 Tao Luo 提交于 9月 03, 2019
```
* remove unused PADDLE_ASSERT(_IS_NOT_ERROR)

* replace PADDLE_ASSERT with PADDLE_ASSERT_MSG

test=develop
```
  49523ea1
- G
  Change backward_guard to optimize_guard to maximize the allreduce overlap. (#19506) · abaf87be
  由 gongweibao 提交于 9月 03, 2019
```
Change backward_guard to optimize_guard to maximize the allreduce overlap
```
  abaf87be
02 9月, 2019 3 次提交

G

Delete pserver complete file before executor running. (#19468) · 57f0f0f2
由 gongweibao 提交于 9月 02, 2019

57f0f0f2

add padding in linear_chain_crf op (#19583) · 4a7e6deb

由 JesseyXujin 提交于 9月 02, 2019

* add padding in linear_chain_crf op

* modify API.spec

* add linear_chain_crf_op.cc and linear_chain_crf_op.h

* remove useless unit test , test=develop

* modify API.spec, test=develop

* remove some blanks in nn.py , test=develop

* fix some bugs on nn.py and API.spec ,test=develop

* fix nn.py, test=develop

* fix API.spec ,test=develop

* fix bug of CI test in test_linear_chain_crf_op.py

* fix bug of CI test in test_linear_chain_crf_op.py, test=develop

* remove paddle_enforce, test=develop

* remove paddle_enforce, test=develop

* remove paddle_enforce, test=develop

* remove paddle_enforce, test=develop

* remove paddle_enforce, test=develop

* remove paddle_enforce, test=develop

* modify nn.py, test=develop

* fix API.spec, test=develop

* fix unittest bug, test=develop

4a7e6deb

Z

fix the compilation issue on windows caused by mkl_CSRMM (#19533) · 84c72801
由 zhouwei25 提交于 9月 02, 2019

84c72801

01 9月, 2019 1 次提交

[MKL-DNN] Refactoring Softmax (#19312) · cef95ee3

由 Jacek Czaja 提交于 9月 01, 2019

* - First set of modifications

- Compilation fixes

- compilation fix

- Another compilation fix

- Moved AcquireSoftmaxPrimitiveDescriptor call into handler

- MKL-DNN Softmax PD refactor

test=develop

- Compilation fix

test=develop

- another compilation fix

- cosmetcis

test=develop

- Compilation fix

- Fix to crash when softmax backward is created

* - Fixes after review of softmax refactoring

test=develop

cef95ee3

31 8月, 2019 2 次提交

Paddlebox Framework (#18982) · c756b5d2

由 hutuxian 提交于 8月 31, 2019

* Support looking up embeddings from BoxPS.
* Add a _pull_box_sparse op, for now this op is not exposed to users.
* Add a BoxHelper class, providing 'BeginPass', 'EndPass', 'FeedPass' functions and so on.
* Add 'BoxPSDataset' in python code.
* Add a compile options WITH_BOX_PS and a MACRO PADDLE_WITH_BOX_PS.
* Add UT.
* More concrete information pls refer to: https://github.com/PaddlePaddle/Paddle/pull/18982

c756b5d2

Z

remove reset recordio usage (#19519) · 5dce1da6
由 Zeng Jinle 提交于 8月 31, 2019

5dce1da6

30 8月, 2019 7 次提交

S
add gather_nd op and unit test (#19366) · 85914f7a
由 ShenLiang 提交于 8月 30, 2019
```
* fixed the code for coverage

* fixed the document,test=document_preview test=develop
```
85914f7a

[MKL-DNN] Fix to face model on AVX512 platforms (#19282) · ecd9f330

由 Jacek Czaja 提交于 8月 30, 2019

- Refactor step 1

- Compilation fix

- Yet another compilation fix

- Even more compilation fix

- Lint fixes

test=develop

- Removed deprectaed PADDLE_ENFORCE occurance

test=develop

- Candidate fix to BN forward

- Lint fixes

test=develop

- Refactoring in data_layout_transform

- compilation fix

- Another comppilation fix

- Step further into darkness

- Yet another compilation fix

- Yet another compilation fix

- missing header

- compilation fix

- Added MKLDNN -> Paddle conversion in fetch op

test=develop

- Compilation fix

test=develop

- Lint

test=develop

- Mul fix

- Fix to MKLDNN MUL op and Elementwise MUL UT

test=develop

- Workaround for diffrent weights with groups representation Paddle vs
  MKL-DNN.

test=develop

- Candidate fix for 5D convolution with groups

- Refactor of fix for conv3d and conv2d in fetch op

test=develop

- Compilation fix

- Still same compilation fix

- Compilation fix

- Compilation fix

- Reverted refactoring of fixes

- Adapted test_conv2d_int8_mkldnn so it exects data in NCHW format
  not NHWC

test=develop

- minor fix in UT

test=develop

- Lint fixes

test=develop

ecd9f330

G
Modify the dropout op to multi-thread (#19504) · e8405e5c
由 GaoWei8 提交于 8月 30, 2019
```
* Modify the dropout op to multi-thread
test=develop

* define parallel
test=develop
```
e8405e5c
H
Change ugly PADDLE_ENFORCE_EQ in recurrent_op.cc (#19470) · 2916caa2
由 Huihuang Zheng 提交于 8月 30, 2019
```
test=develop
```
2916caa2
L

change var name padding_num to padding_value (#19498) · 9dde5640
由 Liufang Sang 提交于 8月 30, 2019

9dde5640
A
Add sequence_topk_avg_pooling Op (#19442) · 5b5379b3
由 Aurelius84 提交于 8月 30, 2019
```
* add topk_avg_pooling

* refine api doc and modify api.spec test=develop
```
5b5379b3
T
remove unused assert.h (#19529) · 02270b3e
由 Tao Luo 提交于 8月 30, 2019
```
test=develop
```
02270b3e

29 8月, 2019 4 次提交
- L
  clean up intel labeled TODOs (#19476) · ba368bf6
  由 lidanqing 提交于 8月 29, 2019
```
test=develop
```
  ba368bf6
- Z
  
  fix sofmax seg fault in AVX, test=develop (#19487) · 11f2f784
  由 Zeng Jinle 提交于 8月 29, 2019
  
  11f2f784
- Z
  
  refine inplace inference registry, test=develop (#19032) · 5c8f210c
  由 Zeng Jinle 提交于 8月 29, 2019
  
  5c8f210c
- C
  Increase num_iteration_per_drop_scope (#19075) · b6d1d890
  由 chengduo 提交于 8月 29, 2019
```
* increase num_iteration_per_drop_scope
test=develop

* Fix bug of while_op
test=develop

* fix bug of whileOp
test=develop
```
  b6d1d890
28 8月, 2019 3 次提交

fix row_conv_op to force it support lodtensor and tensor input simultaneously,... · 1d0f0431

由 Double_V 提交于 8月 28, 2019

fix row_conv_op to force it support lodtensor and tensor input simultaneously, test=develop (#19412)

Support Tensor input for row_conv_op

1d0f0431

Fix the correctness of async mode at distributed training (#18863) · 65c73684

由 tangwei12 提交于 8月 28, 2019

* fix correctness of the communicator

* fix a bug in send thread when sending var context is empty, test=develop

* add lookup_table_prefetch_op and prefetch optimize, test=develop

* remove remote prefetch GPU supported

* word2vec force with CPU, test=develop

* test dist remote lookup table force with CPU, test=develop

65c73684

B
Update ngraph engine for multiple threading (#19155) · 6421c61a
由 baojun 提交于 8月 27, 2019
```
* update for multiple threading
test=develop

* remove PADDLE_ENFORCE test=develop
```
6421c61a

27 8月, 2019 3 次提交

supports multiple NCCL communicators preserved in NCCLCommContext (#19407) · efb05ba2

由 Yi Liu 提交于 8月 27, 2019

* supports multiple NCCL communicators preserved in NCCLCommContext
test=develop

* add ut for c_comm_init_all operator and fix cuda resource release problem
test=develop

efb05ba2

H

Delete useless ex-scope in recurrent op (#19426) · 56dd7653
由 Huihuang Zheng 提交于 8月 27, 2019

56dd7653

Support Tensor input with padding for warpctc op (#19322) · 482ce818

由 vincentXiyu 提交于 8月 27, 2019

* support tensor input with padding for warpctc op

* merge with develop

* test=develop

* modified python API examples test=develop

* nn.py is modified for code coverage test=develop

* update documents info about warpctc op in API.spec test=develop

* add test_warpctc_with_padding in test_layers test=develop

* add warning log for cuda_version back to warpctc_op.cc

* modify API.spec for warpctc op test=develop

* modify API.spec

* update warpctc test to new CompiledProgram API test=develop

* modify code examples for warpctc op test=develop

* modify API.spec for warpctc op test=develop

* modify API.spec for warpctc op test=develop

482ce818

26 8月, 2019 2 次提交
- H
  
  Change TensorCopy in recurrent_op to ShareDataWith (#19319) · 12d29f4d
  由 Huihuang Zheng 提交于 8月 26, 2019
  
  12d29f4d
- T
  fix distribute transpiler GRPC error code 4, RPC Deadline (#18984) · 19dac67e
  由 tangwei12 提交于 8月 26, 2019
```
* fix sync mode hang in transpiler
* remove sync mode in send/recv
* replace PADDLE_ENFORCE with PADDLE_ENFORCE_NE
```
  19dac67e
22 8月, 2019 3 次提交

翟

Use sparse matrix to implement FusedEmbeddingSeqPoolGradKernel (#19153) · 2e3ee579

由翟飞跃提交于 8月 22, 2019

* Implement the operator with sprase matrix multiply

* Update the URL of mklml library.

test=develop

* Disable MKLML implematation when using no-linux.

test=develop

* optimize bp with mkl sparse matrix
test=develop

2e3ee579

Enhance OpTest to check the consistency of operators when using and not using inplace (#19101) · a9d5fc51

由 Leo Chen 提交于 8月 22, 2019

* add pybind interface to get all inplace ops, test=develop

* enhance OpTest to check whether the consistency of operator when using and not using inplace, test=develop

* handle corner cases in op_test, test=develop

* support outputs without tensor holder_, like XShape in reshape_op, test=develop

* fix bug, some op has GradOpMaker, but actually no grad_op in OpInfoMap, test=develop

* use reshape_grad instead of reshape in FlattenGradOp, test=develop

* fix error debug dims info for variables like XShape, test=develop

* change computational order in sum_op to relieve computation difference using inplace, test=develop

* add inplace_atol to check group_norm, and skip inplace_grad for mkldnn, test=develop

* follow sneaxiy's comments, test=develop

* remove unused DefaultGradOpDescMaker in mkldnn op, test=develop

a9d5fc51

Supports diagonal initialization in uniform_random op (#19299) · 0d29cf18

由 Aurelius84 提交于 8月 22, 2019

* add diag init in Uniform_random op test=develop

* modify api.spec test=develop

* fix unform_batch_size_like maker test=develop

* add diag_num and diag_step assert check test=develop

0d29cf18

21 8月, 2019 2 次提交
- A
  Add generalized Conv+Activation MKLDNN fuse pass creation Part2 (#19237) · 97d1db18
  由 Adam 提交于 8月 21, 2019
```
* Add generalized Conv+Activation MKLDNN fuse pass creation Part2
test=develop

* Undefined behaviour of GetAttrIfExists<> FIX
test=develop
```
  97d1db18
- W
  
  fix generate mask fpn, test=develop (#19301) · 37428952
  由 wangguanzhong 提交于 8月 21, 2019
  
  37428952
20 8月, 2019 3 次提交

Z
Fix elementwise performance poor issue (#19278) · 5296294d
由 zhaoyuchen2018 提交于 8月 20, 2019
```
For small case use 1D block is better than 2D block.

Refer to this issue: #19275
```
5296294d

Use sparse matrix to implement fused emb_seq_pool operator (#19064) · b9203958

由 Yihua Xu 提交于 8月 20, 2019

* Implement the operator with sprase matrix multiply

* Update the URL of mklml library.

test=develop

* Disable MKLML implematation when using no-linux.

test=develop

* Ignore the deprecated status for windows

test=develop

b9203958

optimize the realization of cuda dropout (#19136) · 6e326ca2

由 wangchaochaohu 提交于 8月 20, 2019

* cuda optimie for dropout

* remove tmp swp file

* fix compile error test=develop

* test=develop optimize the cuda realization of dropout op

* remove unsed code test=develop

* remove tmp file test=develop

6e326ca2

19 8月, 2019 5 次提交

Fix BUG: Mask RCNN inference diff When using AnalysisPredictor. (#19213) · 76c95af0

由 Zhaolong Xing 提交于 8月 19, 2019

* fix mask rcnn bug:
1. affine channel fuse (diff)
2. condition block op (memory leak)
3. merge lod tensor op (diff)
4. memroy optim (diff)
test=develop

* fix ci aboud PADDLE_ENFOCE
fix merge lod infer op ut
test=develop

76c95af0

Q

Remove warning in batch_norm_op (#19260) · 5fc8de44
由 qingqing01 提交于 8月 19, 2019

5fc8de44

Add match_matrix_tensor op (#18525) · 78a3d837

由 Aurelius84 提交于 8月 19, 2019

* add matrch_matrix_tensor op test=develop

* fix ignore unittest if with_mkl=off test=develop

* clean code and rm is_test param test=develop

* modify API.spec test=develop

* rm useless code in search_compute.h test=develop

* modify api.spec test=develop

* modify default_grad.spec test=develop

* Add API test code test=develop

* clean code in search_computer.h

* modify PADDLE_ENFORCE and clean search_compute.h test=develop

* fix code style test=develop

78a3d837

Z

merge develop to solve conflict, also fix API doc, test=develop (#18823) · 5b6673c4
由 Zeng Jinle 提交于 8月 19, 2019

5b6673c4

add fl_listen_and_serv &fl_transpiler,test=develop (#19091) · 539c8707

由 zhang wenhui 提交于 8月 19, 2019

add fl_listen_and_serv op for Federated_learning and fl_distribute_transpiler add this op to pserver program . This op just listen the endpoint and sum&scale.

539c8707

BaiXuePrincess / Paddle 与 Fork 源项目一致

BaiXuePrincess / Paddle
与 Fork 源项目一致