提交 · 65c7368400319b4c5ea93c47ec761b1c29b7ca0f · PaddlePaddle / Paddle

28 8月, 2019 2 次提交

Fix the correctness of async mode at distributed training (#18863) · 65c73684

由 tangwei12 提交于 8月 28, 2019

* fix correctness of the communicator

* fix a bug in send thread when sending var context is empty, test=develop

* add lookup_table_prefetch_op and prefetch optimize, test=develop

* remove remote prefetch GPU supported

* word2vec force with CPU, test=develop

* test dist remote lookup table force with CPU, test=develop

65c73684

B
Update ngraph engine for multiple threading (#19155) · 6421c61a
由 baojun 提交于 8月 27, 2019
```
* update for multiple threading
test=develop

* remove PADDLE_ENFORCE test=develop
```
6421c61a

27 8月, 2019 3 次提交

supports multiple NCCL communicators preserved in NCCLCommContext (#19407) · efb05ba2

由 Yi Liu 提交于 8月 27, 2019

* supports multiple NCCL communicators preserved in NCCLCommContext
test=develop

* add ut for c_comm_init_all operator and fix cuda resource release problem
test=develop

efb05ba2

H

Delete useless ex-scope in recurrent op (#19426) · 56dd7653
由 Huihuang Zheng 提交于 8月 27, 2019

56dd7653

Support Tensor input with padding for warpctc op (#19322) · 482ce818

由 vincentXiyu 提交于 8月 27, 2019

* support tensor input with padding for warpctc op

* merge with develop

* test=develop

* modified python API examples test=develop

* nn.py is modified for code coverage test=develop

* update documents info about warpctc op in API.spec test=develop

* add test_warpctc_with_padding in test_layers test=develop

* add warning log for cuda_version back to warpctc_op.cc

* modify API.spec for warpctc op test=develop

* modify API.spec

* update warpctc test to new CompiledProgram API test=develop

* modify code examples for warpctc op test=develop

* modify API.spec for warpctc op test=develop

* modify API.spec for warpctc op test=develop

482ce818

26 8月, 2019 2 次提交
- H
  
  Change TensorCopy in recurrent_op to ShareDataWith (#19319) · 12d29f4d
  由 Huihuang Zheng 提交于 8月 26, 2019
  
  12d29f4d
- T
  fix distribute transpiler GRPC error code 4, RPC Deadline (#18984) · 19dac67e
  由 tangwei12 提交于 8月 26, 2019
```
* fix sync mode hang in transpiler
* remove sync mode in send/recv
* replace PADDLE_ENFORCE with PADDLE_ENFORCE_NE
```
  19dac67e
22 8月, 2019 3 次提交

翟

Use sparse matrix to implement FusedEmbeddingSeqPoolGradKernel (#19153) · 2e3ee579

由翟飞跃提交于 8月 22, 2019

* Implement the operator with sprase matrix multiply

* Update the URL of mklml library.

test=develop

* Disable MKLML implematation when using no-linux.

test=develop

* optimize bp with mkl sparse matrix
test=develop

2e3ee579

Enhance OpTest to check the consistency of operators when using and not using inplace (#19101) · a9d5fc51

由 Leo Chen 提交于 8月 22, 2019

* add pybind interface to get all inplace ops, test=develop

* enhance OpTest to check whether the consistency of operator when using and not using inplace, test=develop

* handle corner cases in op_test, test=develop

* support outputs without tensor holder_, like XShape in reshape_op, test=develop

* fix bug, some op has GradOpMaker, but actually no grad_op in OpInfoMap, test=develop

* use reshape_grad instead of reshape in FlattenGradOp, test=develop

* fix error debug dims info for variables like XShape, test=develop

* change computational order in sum_op to relieve computation difference using inplace, test=develop

* add inplace_atol to check group_norm, and skip inplace_grad for mkldnn, test=develop

* follow sneaxiy's comments, test=develop

* remove unused DefaultGradOpDescMaker in mkldnn op, test=develop

a9d5fc51

Supports diagonal initialization in uniform_random op (#19299) · 0d29cf18

由 Aurelius84 提交于 8月 22, 2019

* add diag init in Uniform_random op test=develop

* modify api.spec test=develop

* fix unform_batch_size_like maker test=develop

* add diag_num and diag_step assert check test=develop

0d29cf18

21 8月, 2019 2 次提交
- A
  Add generalized Conv+Activation MKLDNN fuse pass creation Part2 (#19237) · 97d1db18
  由 Adam 提交于 8月 21, 2019
```
* Add generalized Conv+Activation MKLDNN fuse pass creation Part2
test=develop

* Undefined behaviour of GetAttrIfExists<> FIX
test=develop
```
  97d1db18
- W
  
  fix generate mask fpn, test=develop (#19301) · 37428952
  由 wangguanzhong 提交于 8月 21, 2019
  
  37428952
20 8月, 2019 3 次提交

Z
Fix elementwise performance poor issue (#19278) · 5296294d
由 zhaoyuchen2018 提交于 8月 20, 2019
```
For small case use 1D block is better than 2D block.

Refer to this issue: #19275
```
5296294d

Use sparse matrix to implement fused emb_seq_pool operator (#19064) · b9203958

由 Yihua Xu 提交于 8月 20, 2019

* Implement the operator with sprase matrix multiply

* Update the URL of mklml library.

test=develop

* Disable MKLML implematation when using no-linux.

test=develop

* Ignore the deprecated status for windows

test=develop

b9203958

optimize the realization of cuda dropout (#19136) · 6e326ca2

由 wangchaochaohu 提交于 8月 20, 2019

* cuda optimie for dropout

* remove tmp swp file

* fix compile error test=develop

* test=develop optimize the cuda realization of dropout op

* remove unsed code test=develop

* remove tmp file test=develop

6e326ca2

19 8月, 2019 6 次提交

Fix BUG: Mask RCNN inference diff When using AnalysisPredictor. (#19213) · 76c95af0

由 Zhaolong Xing 提交于 8月 19, 2019

* fix mask rcnn bug:
1. affine channel fuse (diff)
2. condition block op (memory leak)
3. merge lod tensor op (diff)
4. memroy optim (diff)
test=develop

* fix ci aboud PADDLE_ENFOCE
fix merge lod infer op ut
test=develop

76c95af0

Q

Remove warning in batch_norm_op (#19260) · 5fc8de44
由 qingqing01 提交于 8月 19, 2019

5fc8de44

Add match_matrix_tensor op (#18525) · 78a3d837

由 Aurelius84 提交于 8月 19, 2019

* add matrch_matrix_tensor op test=develop

* fix ignore unittest if with_mkl=off test=develop

* clean code and rm is_test param test=develop

* modify API.spec test=develop

* rm useless code in search_compute.h test=develop

* modify api.spec test=develop

* modify default_grad.spec test=develop

* Add API test code test=develop

* clean code in search_computer.h

* modify PADDLE_ENFORCE and clean search_compute.h test=develop

* fix code style test=develop

78a3d837

Z

merge develop to solve conflict, also fix API doc, test=develop (#18823) · 5b6673c4
由 Zeng Jinle 提交于 8月 19, 2019

5b6673c4

add fl_listen_and_serv &fl_transpiler,test=develop (#19091) · 539c8707

由 zhang wenhui 提交于 8月 19, 2019

add fl_listen_and_serv op for Federated_learning and fl_distribute_transpiler add this op to pserver program . This op just listen the endpoint and sum&scale.

539c8707

S
change PADDLE_ENFORCE to PADDLE_ENFORCE_CUDA_SUCCESS (#19205) · af0fbd90
由 silingtong123 提交于 8月 19, 2019
```
* print error code if cuda related API fails
```
af0fbd90

18 8月, 2019 1 次提交
- G
  Unset unittests http_proxy env to avoid timeout. (#19269) · fd4b15a2
  由 gongweibao 提交于 8月 18, 2019
```
Unset unittests http_proxy env to avoid timeout.
```
  fd4b15a2
16 8月, 2019 2 次提交
- K
  fix temporal_shift OP PADDLE_ENFORCE. test=develop (#19161) · 2848cb79
  由 Kaipeng Deng 提交于 8月 16, 2019
```
* fix temporal_shift OP PADDLE_ENFORCE. test=develop

* fix HasInput/HasOutpu ENFORECE. test=develop
```
  2848cb79
- Z
  
  move_flags_to_unified_files_for_management, test=develop (#19224) · 708bd979
  由 Zeng Jinle 提交于 8月 16, 2019
  
  708bd979
15 8月, 2019 2 次提交

A
Add generalized Conv+Activation MKLDNN fuse pass creation (#19072) · b837689e
由 Adam 提交于 8月 15, 2019
```
test=develop
```
b837689e

Add padding support for crf_decoding (#19057) · 50b1cab1

由 Yibing Liu 提交于 8月 15, 2019

* Add padding support for crf_decoding

* Fixes in comupte kernel

test=develop

* Update API Spec

test=develop

* Update API.spec

test=develop

* Avoid using paddle_enforce

test=develop

* Fix enforce

test=develop

50b1cab1

14 8月, 2019 3 次提交
- C
  Fix gather op bug (#19168) · b5ba801e
  由 chengduo 提交于 8月 14, 2019
```
* fix gather op bug
test=develop
```
  b5ba801e
- L
  Remove unused DefaultGradOpDescMaker in REGISTER_OPERATOR() (#19166) · 80eab822
  由 Leo Chen 提交于 8月 14, 2019
```
* remove unused DefaultGradOpDescMaker in REGISTER_OPERATOR(), test=develop

* remove SplitIdsOpGradMaker since it is buggy and not tested, update spec file, test=develop
```
  80eab822
- C
  Use CUDAPinnedPlace in buffered_reader (#19112) · c70a97f4
  由 chengduo 提交于 8月 14, 2019
```
Use CUDAPinnedPlace in buffered_reader
```
  c70a97f4
13 8月, 2019 1 次提交

Instag Implemention (#18394) · 6ac32d09

由 Jiawei Wang 提交于 8月 13, 2019

* instag lod tensor impl

* First PR for instag

* First PR for instag

* Before adding Selection Rows.

* Change name from instag to filter_instag, add upgrade the impl of filter_instag

* Change name from instag to filter_instag, add upgrade the impl of filter_instag

* Fix yapf error in gradient_checker.py to pass Travis-CI

* Fix Filter Instag Grad test=develop

* Fix Filter Instag Grad test=develop

* 1) Fix API.spec, add filter_instag Op. 2) Add Vector Support for CUDA. test=develop

* Impl Loss_weight and empty output handler

* change Loss Weight datatype to Float32, and add Loss Weight as 2nd output

* 1) Support Tensor Input(without LOD) 2) Add Unit test

* Filter By Instag Final test=develop

* Update API.spec for filter_by_instag test=develop

* Update API.spec for filter_by_instag 2 test=develop

* Add Filter By Instag Coverage

* code format of test_layers.py

* code format test_layers.py test=develop

* Make API args more readable test=develop

* Make API args more readable and pass code format test=develop

* Filter By Instag Op, Rename Map to Index Map test=develop

* Filter By Instag Op, code format err in filter_by_instag_op.cc  test=develop

* Filter by instag op: code format of cpp files test=develop

* Filter by instag Op: Api spec modification test=develop

* Filter by instag Op: Api spec doc id modification test=develop

* Filter by instag Op: Api spec and doc preview  test=develop test=document_preview

* Filter By Instag Op, fix doc erro test=document_preview test=develop

* Filter By Instag Op, fix doc err and Api spec test=document_preview test=develop

* Filter By Instag Op, fix Api spec test=document_preview test=develop

* Filter By Instag Op, fix Paddle Encoforce deprecated warning test=document_preview test=develop

* Filter By Instag Op, fix Paddle Encoforce deprecated and code format warning test=document_preview test=develop

6ac32d09

12 8月, 2019 5 次提交
- H
  Add hard swish op (new op) (#19001) · 20f18930
  由 huangjun12 提交于 8月 12, 2019
```
* add hard_swish activation op (new op)
test=develop

* remove redundancy files

* modify document content of HardSwish OP

* add API test in test_layers.py

* add dynamic_graph for test_hard_swish
```
  20f18930
- J
  Replace Relu with bounded Relu in MobileNetV2 quantization (#18988) · bce72c7f
  由 joanna.wozna.intel 提交于 8月 12, 2019
```
test=develop
```
  bce72c7f
- W
  
  refine infer shape in box decoder and assign op, test=develop (#19118) · 1fc242a7
  由 wangguanzhong 提交于 8月 12, 2019
  
  1fc242a7
- G
  Polish fleet API to support cuda collective mode and nccl2 mode. (#18966) · 29d87812
  由 gongweibao 提交于 8月 12, 2019
```
Polish fleet API to support cuda collective mode and nccl2 mode
```
  29d87812
- K
  fix code too big test=develop (#19111) · 945f3cf6
  由 Kevin 提交于 8月 12, 2019
```
Fix seq_pool failed when input dims is too large.
Resolve issue #3023
```
  945f3cf6
09 8月, 2019 4 次提交

Z

remove unused inplace act codes, test=develop (#19079) · 88f111f8
由 Zeng Jinle 提交于 8月 09, 2019

88f111f8

add eye op, kernel and unitest test=develop (#18980) · 4397cb31

由 ShenLiang 提交于 8月 09, 2019

* add eye op,test=document_preview test=develop

* fix the API.spec, test=develop

* fix the document, test=document_preview test=develop

* add unitest for CI coverage, test=develop

4397cb31

Add trilinear_interp OP (#18711) · f86fead6

由 Kaipeng Deng 提交于 8月 09, 2019

* add trilinear interp. test=develop

* fix unittest. test=develop

* add python api and test_layers. test=develop

* refine API.spec. test=develop

* fix format. test=develop

* add python API test. test=develop

* format code. test=develop

* refine code strcuture. test=develop

* fix format

* fix doc. test=develop

* fix converage. test=develop

* fix format. test=develop

f86fead6

Z
optimize error message for "embedding" and "cross_entropy" OP (#18765) · c2063217
由 Zhang Ting 提交于 8月 09, 2019
```
* optimize error message, test=develop

* optimize error message, test=develop
```
c2063217

06 8月, 2019 1 次提交
- Y
  Add the check of lod in sequence_softmax kernel. (#18996) · a445c335
  由 Yiqun Liu 提交于 8月 06, 2019
```
* Add the check of lod in sequence_softmax kernel.
test=develop

* Refine the comments.
test=develop
```
  a445c335

PaddlePaddle / Paddle 大约 1 年 前同步成功

PaddlePaddle / Paddle
大约 1 年前同步成功