提交 · b3a3e6f60ce05a9e75f56ff097296d192c0e801a · Crayon鑫 / Paddle

13 11月, 2019 1 次提交
- G
  Use 2 cards for hallreduce unit test. (#21085) · a5fc291f
  由 gongweibao 提交于 11月 13, 2019
```
use 2 cards test=develop
```
  a5fc291f
12 11月, 2019 5 次提交

Add Asypadding for conv fusion. (#21041) · 4a544762

由 zhaoyuchen2018 提交于 11月 12, 2019

* Add Asypadding for conv fusion.

test=develop

reference: pr/20042

* Fix eigen build link error

* Change back file mode

* Use math function & add more checks.

4a544762

W

Fix dgc buffer illegal & reuse velocity (#21012) · de5d3ff6
由 WangXi 提交于 11月 12, 2019

de5d3ff6

modify the implementation of save_persistables and save_inference_model for... · 53148e06

由 lilong12 提交于 11月 12, 2019

modify the implementation of save_persistables and save_inference_model for fleet collective mode (#20802)

* modify the implementation of  save_persistables and save_inference_model functions for fleet collective, test=develop

* add ut, test=develop

53148e06

C
fix instance norm (#21042) · f62a9291
由 ceci3 提交于 11月 12, 2019
```
* fix instance norm

* update unitest,test=develop
```
f62a9291

fix the computation for dx (grad for x) for prelu operation. (#20949) · e249d9a3

由 lilong12 提交于 11月 12, 2019

* set the default value of alpha for prelu to 0.25, test=develop

* add the call to __syncthreads(), test=develop

* fix the implementation of cpu prelu, test=develop

* repair the implementation of element mode prelu, test=develop

* modify test_prelu_op.py, test=develop

e249d9a3

11 11月, 2019 3 次提交

H

Add basic Python Cond Layer (#21050) · e64d55f0
由 Huihuang Zheng 提交于 11月 11, 2019

e64d55f0
H

Disable cudnn_conv in unit tests. (#21080) · dcf371b6
由 Huihuang Zheng 提交于 11月 11, 2019

dcf371b6

Add the check of lod_level between compile-time and runtime. (#20961) · 35f17ae2

由 Yiqun Liu 提交于 11月 11, 2019

* Add the check of lod_level between compile-time and runtime.
test=develop

* Fix bug in check_compile_vs_runtime.
test=develop

* Fix the check of output when it is dispensiable or intermediate.
test=develop

* Share lod of x to out in match_matrix_tensor op in compile-time.

* Implement GetLoDLevel in InferShapeContext.

* Set the default value of check_compile_vs_runtime to False and enable it in test_sequence_pad_op.
test=develop

* Enable check_compile_vs_runtime in test_match_matrix_tensor.

* Add the implementation of SetLoDLevel in InferShapeContext.

* Remove the implementation of IncreaseLoDLevel and call Get/SetLoDLevel instead.

* Remove the implementation of DecreaseLoDLevel and call Set/GetLoDLevel instead.

* Refine some ops and unittests.
test=develop

* Fix a typo.
test=develop

* Remove the check of var type, and change int to int32_t.
test=develop

* Add unittest for Get/SetLoDLevel.
test=develop

35f17ae2

08 11月, 2019 3 次提交

Add transpose2 INT8 for mkl-dnn (#19424) · 77c20835

由 joanna.wozna.intel 提交于 11月 08, 2019

* Add transpose2 INT8 for mkl-dnn

test=develop

* Fix test_transpose_int8_mkldnn

test=develop

* Revert "Merge branch 'develop' into transpose_int8_mkldnn_2"

This reverts commit 34011bdb, reversing
changes made to 2ce6473f.

* Revert "Revert "Merge branch 'develop' into transpose_int8_mkldnn_2""

This reverts commit 23754dd7.

* Add template to TransposeMKLDNNHandler

test=develop

* Resolve conflict

test=develop

* Restore get_size and refactor

test=develop

77c20835

L

add op locality_aware_nms, test=develop (#20976) · 06063b70
由 LielinJiang 提交于 11月 08, 2019

06063b70

fix bug in pool/conv/conv_transpose: UpdatePaddingAndDilation,... · 26a6e27a

由 liym27 提交于 11月 08, 2019

fix bug in pool/conv/conv_transpose: UpdatePaddingAndDilation, _get_padding_with_SAME and conv2dtranspose_forward_naive. (#20997)

* fix bug in pool/conv/conv_transpose:
1. It should be stride[i] not stride[0] in UpdatePaddingAndDilation;
2. fix bug of func  _get_padding_with_SAME in test_conv/conv_transpose_op.py;
3. fix bug of the computation process in function conv2dtranspose_forward_naive.
test=develop

* change test to make the data of different dimensions different. test=develop

26a6e27a

07 11月, 2019 2 次提交
- A
  Add support for asymetric padding in MKLDNN pool, conv and conv_transpose (#21062) · 3fda695b
  由 Adam 提交于 11月 07, 2019
```
* Add asymetric padding support for mkldnn pooling
test=develop

* Add asymetric padding support for mkldnn conv
test=develop

* Add asymetric padding support for mkldnn conv_transpose
test=develop
```
  3fda695b
- H
  Add select_input_op and select_output_op (#21016) · 1957192f
  由 Huihuang Zheng 提交于 11月 07, 2019
```
These ops are useful in control flow.
```
  1957192f
06 11月, 2019 2 次提交
- H
  fix uniform random (#21009) · 72e0969b
  由 hong 提交于 11月 06, 2019
```
* fix uniform random; test=develop

* add uniform random test; test=develop
```
  72e0969b
- L
  
  Polish error messages of pool_2d/3d and add Raises in English document. test=develop (#21017) · f0e95a60
  由 liym27 提交于 11月 06, 2019
  
  f0e95a60
05 11月, 2019 2 次提交

H

Add grad_name Property for Class Variable (#20991) · 4cf96cd3
由 Huihuang Zheng 提交于 11月 05, 2019

4cf96cd3

simplify master+patch，remove ins when size != merge_size or has conflict slot (#20913) · 1d1a0793

由 xujiaqi01 提交于 11月 05, 2019

* remove duplicate code and duplicate config of master+patch
* drop all ins which has conflict slot or size < merge_size
* user only need to set merge size，if ins num of same id is not equal to merge size, just drop these ins
* user must make sure master data and patch data has no same slot whose feasigns are both non-zero, otherwise these ins will be dropped. (slot list should still be the same of both master and patch)
* test=develop

1d1a0793

04 11月, 2019 1 次提交
- Z
  
  lrn supports channel_last input, test=develop (#20954) · de9bec60
  由 Zhang Ting 提交于 11月 04, 2019
  
  de9bec60
02 11月, 2019 1 次提交

add launch_ps module so that we can launch a parameter server trainin… (#20936) · a6747a6e

由 Dong Daxiang 提交于 11月 02, 2019

* add launch_ps module so that we can launch a parameter server training job
1) a user can specify worker_num and server_num
2) parameter server can be killed after all workers exit
3) unit test is added
test=develop

a6747a6e

01 11月, 2019 3 次提交

L

tensor.set() supports array list and remove unused code, test=develop (#20959) · 2c3c579b
由 Leo Chen 提交于 11月 01, 2019

2c3c579b

Update Tensor.set() to support float16 (#19964) · 9974e407

由 Leo Chen 提交于 11月 01, 2019

* don't expose numerous Tensor.set(), test=develop

* fix condition, test=develop

* fix float16 bug, test=develop

* feed should be Tensor or np.array, not Variable or number, test=develop

* use forcecast to copy numpy slice to new array, test=develop

* remove float16-uint16 hacking, test=develop

9974e407

1
Optimize decay (#20816) · 20cdff0e
由 123malin 提交于 11月 01, 2019
```
* update pserver decay blocks

* update distributed notify handler
```
20cdff0e

31 10月, 2019 5 次提交

Fix Paddle Cloud role maker (#20860) · 16596f64

由 Chengmo 提交于 10月 31, 2019

* fix PaddleCloud Role maker & add warning in distribute transpiler  & change rpc_retry_times

16596f64

L

Compatible int32 and int64 for attr in concat/split/unsqueeze. test=develop (#20912) · 59de8e12
由 liym27 提交于 10月 31, 2019

59de8e12

maxout supports channel_last input (#20846) · 8d1e9f0f

由 Zhang Ting 提交于 10月 31, 2019

* maxout support channel_last input, test=develop

* modified details of Input(X) and Attr(groups, axis) in doc, test=develop

8d1e9f0f

GradMaker for dygraph (#19706) · 8c4573a3

由 hong 提交于 10月 31, 2019

* refactor dygraph,test=develop

* fix failed unittest,test=develop

* polish code,test=develop

* check windows ci error,test=develop
try to fix windows ci error by np.allclose,test=develop

* polish vlog and profiler, test=develop

* try to fix preceding ops order,test=develop

* test transformer in windows ci, test=develop

* use python c-api to speed up tracer.trace,test=develop

* test=develop, fix docker with paddle nccl problem

* test=develop, add ut for debug string and gradient_accumulator

* test=develop, add tests for layer/gradient_accumulator/prepared_op

* test=develop, fix complie error for test_prepared_op

* test=develop, add more ut for dygraph

* test=develop, create API.spec for dygraph api change

* optimize grad maker; test=develop

* optimize grad maker

* test

* grad make optim; test=develop

* fix unittest bugs; test=develop

* add dygraph grad op maker and split_op

* grad op maker refactor; test=develop

* add dygraph grad maker; test=develop

* fix op deformable_conv_v1_op bug; test=develop

* fix deformable_conv prroi pool bugs;

* fix new op grad op maker bug; test=develop

* fix split by ref bug; test=develop

* fix dygraph auto prune bug; test=develop

* fix test_trace bug; test=develop

* fix fused emb seq pool bug; test=develop

* remove useless code in op_desc file; test=develop

* remove useless code, StrVarBaseNode; test=develop

* fix review issues; test=develop

* fix rank_loss grad maker; test=develop

* remove flag in VarBase; test=develop

* fix distributed_notify_op compile bug ; test=develop

* fix reshape op double grad; test=develop

* fix expand as op; test=develop

* add impertive type_defs.h for demo_train; test=develop

* fix inference lib cmake; test=develop

* fix inference lib; test=develop

* fix infernce_lib; test=develop

* fix inference cmake; test=develop

* fix inference lib; test=develop

* fix inference lib; test=develop

* remove condition dygraph grad maker, modify local name; test=develop

* fix split grad maker bug; test=develop

* fix pyramid_op bug; test=develop

* change travis time out limit; test=develop

* restore travis; test=develop

* change timeout limit; test=develop

8c4573a3

Refine the cache of program, context and scope in executor. (#18483) · 16e4d026

由 Yiqun Liu 提交于 10月 31, 2019

* Refine the cache of program, context and scope in executor.
test=develop

* Refine the unittest test_executor_and_use_program_cache.

* Add the test the PaddingRNN with use_program_cache=True.
test=develop

* Remove a check.
test=develop

* Refine the unittest to check whether it is correct when setting use_program_cache=True.
test=develop

16e4d026

30 10月, 2019 1 次提交
- W
  fix jit_matmul bug test=develop (#20886) · b4897600
  由 Wilber 提交于 10月 30, 2019
```
* fix jit_matmul bug 

* update jit matmul and add test
```
  b4897600
29 10月, 2019 8 次提交

save load problem fix and new feature add (#20823) · ff0886a9

由 hong 提交于 10月 29, 2019

* fix persistable;

* fix save load bugs; test=develop

* fix bug; test=develop

* add example for new io api; test=develop

* addd example; test=develop

ff0886a9

Add Sequential api (#20789) · 2058bab1

由 Youwei Song 提交于 10月 29, 2019

* add Sequential api
test=develop

* fix unittest
test=develop

* refine code sample

* test=develop

2058bab1

support Tensor for split and concat, support -1 in num_or_sections, add check... · 6802539a

由 liym27 提交于 10月 29, 2019

support Tensor for split and concat, support -1 in num_or_sections, add check num_or_sections (#20780)

* improve split and concat op:
1. support Tensor for argument 'dim' in split op.
2. support Tensor for argument 'axis' in concat op.
test=develop

* redefine function GetDataFromTensor and set unknown output shape to - 1.
test=develop

* add check: Attr(sections) match Input(X). test=develop

* support Tensor for attr(sections) and attr(sections) can contain -1.
add check for attr(sections).
test=develop

* modify error message for concat and call Resize only when necessary. test=develop

6802539a

Check and correct the output's lod_level in DynamicRNN related operators (#19144) · 6fcfd32e

由 Yiqun Liu 提交于 10月 29, 2019

* Refine the InferShape of ReadFrom and WriteTo op, and add comment to explain why not call ShareLoD for runtime.
test=develop

* Add comment for ReorderLoDTensorByRank op.

* Add comment for lod_tensor_to_tensor_array op to explain why only call DecreaseLoDLevel for compile time.
test=develop

* ShrinkRNNMemory op should call ShareLoD for compile time.
test=develop

* Add the implementation of IncreaseLoDLevel and add the compile-time check of lod_level in InferShape of sequence_pool.
test=develop

* Refine the unittest of DynamicRNN.
test=develop

* Change PADDLE_ENFORCE to PADDLE_ENFORCE_NE.
test=develop

6fcfd32e

Z

fix py_reader combination ut, test=develop (#20861) · da9e9dd0
由 Zeng Jinle 提交于 10月 29, 2019

da9e9dd0

improve unsqueeze op to support int, Tensor for argument axes (#20824) · 84d221b6

由 liym27 提交于 10月 29, 2019

* improve unsqueeze op to support int, Tensor and Tensor list for argument axes.
test=develop

* call Resize only when necessary. test=develop

84d221b6

S
Make shape tensor support int32 (#20757) · 03d7f3dd
由 silingtong123 提交于 10月 29, 2019
```
*  Make shape tensor support int32
```
03d7f3dd
H

Add shape and type check at read_op (#20754) · 95ba4bd2
由 Huihuang Zheng 提交于 10月 29, 2019

95ba4bd2

28 10月, 2019 2 次提交
- A
  
  add pyramid_hash_op (#20698) · aacd16db
  由 Aurelius84 提交于 10月 28, 2019
  
  aacd16db
- W
  
  Fix roi_perspective_transform op (#20764) · c8e49be2
  由 whs 提交于 10月 28, 2019
  
  c8e49be2
25 10月, 2019 1 次提交

fix several sparse table issuses (#20686) · 48669aa8

由 xujiaqi01 提交于 10月 25, 2019

* no longer need to define all embedding layers (no one less) of all slots in each program. make trainer_param repeated in ps.proto.
* add find_distributed_lookup_table_grads instead of hard code GRAD
* support embedding stop gradient. push sparse has error before fix this.* 
* fix fill sparse, skip slots which do not have embedding. each slot's embedding in a sparse table should be used in all training programs before fix this.
* fix pull sparse, skip slots which do not have embedding.
* fix collect feasign label info, skip slots which do not have embedding.
* support when there are multi sparse tables in one or multi training programs, each program can pull/push its own related sparse tables instead of all sparse tables.
* test=develop

48669aa8

Crayon鑫 / Paddle 与 Fork 源项目一致

Crayon鑫 / Paddle
与 Fork 源项目一致