提交 · 25ffa8445d09000495a28efb96b33010df5d5ea2 · Crayon鑫 / Paddle

05 11月, 2019 2 次提交

T
refine murmurhash3_x64_128 for bloom_filter (#20996) · 25ffa844
由 Tao Luo 提交于 11月 05, 2019
```
test=develop
```
25ffa844

Support NoNeedBufferVarsInference in dygraph backward (#20868) · 878a40f5

由 Zeng Jinle 提交于 11月 05, 2019

* support no need buffer vars in dygraph, test=develop

* fix inference compilation error, test=develop

* update no_need_buffer_vars_inference, test=develop

* add unittests for no_need_buffer_vars_context, test=develop

* refine no_need_buffer_vars by return ref, test=develop

* polish some codes, test=develop

878a40f5

04 11月, 2019 4 次提交
- W
  
  refine code for code reuse test=develop (#20988) · bf379fef
  由 wangchaochaohu 提交于 11月 04, 2019
  
  bf379fef
- Z
  
  lrn supports channel_last input, test=develop (#20954) · de9bec60
  由 Zhang Ting 提交于 11月 04, 2019
  
  de9bec60
- L
  
  fix diff in dequantize op between cpu and gpu test=develop (#20953) · 9b666cae
  由 Liufang Sang 提交于 11月 04, 2019
  
  9b666cae
- Z
  
  fix bug in grad_op compute for dygraph, test=develop (#20975) · 065804d3
  由 zhongpu 提交于 11月 04, 2019
  
  065804d3
02 11月, 2019 1 次提交

fix squared_mat_sub_fuse_pass when elementwise_op input is from persistable... · c5341496

由 Wilber 提交于 11月 02, 2019

fix squared_mat_sub_fuse_pass when elementwise_op input is from persistable param test=develop (#20960)

fix squared_mat_sub_fuse_pass when elementwise_op input is from persistable param

c5341496

01 11月, 2019 8 次提交
- Z
  fix the bug of conv_transpose cudnn kernel, test=develop (#20958) · f4f85831
  由 Zhang Ting 提交于 11月 01, 2019
```
fix the bug of conv_transpose cudnn kernel: before version 1.6, the data_format is AnyLayout in inference model. When use version 1.6 and load the model which is saved by previous version, the error occurs.  This is because the cudnn kernel in version 1.6 is not compitable with Anylayout setting.
```
  f4f85831
- W
  
  gpu info query refine test=develop (#20904) · 7695b713
  由 wangchaochaohu 提交于 11月 01, 2019
  
  7695b713
- L
  
  tensor.set() supports array list and remove unused code, test=develop (#20959) · 2c3c579b
  由 Leo Chen 提交于 11月 01, 2019
  
  2c3c579b
- W
  
  And Enforce to fuse pass for DGC doesn't support fuse for now, test=develop (#20935) · eec4fa90
  由 WangXi 提交于 11月 01, 2019
  
  eec4fa90
- L
  Update Tensor.set() to support float16 (#19964) · 9974e407
  由 Leo Chen 提交于 11月 01, 2019
```
* don't expose numerous Tensor.set(), test=develop

* fix condition, test=develop

* fix float16 bug, test=develop

* feed should be Tensor or np.array, not Variable or number, test=develop

* use forcecast to copy numpy slice to new array, test=develop

* remove float16-uint16 hacking, test=develop
```
  9974e407
- Z
  Fix gru as small frame_size has error. (#20922) · 7f3a445e
  由 zhaoyuchen2018 提交于 10月 31, 2019
```
seems shuffle_sync cannot handle small size

test=develop
Signed-off-by: Nzhaoyuchen <zhaoyuchen01@baidu.com>
```
  7f3a445e
- Z
  
  refine pe when exception raises, test=develop (#20894) · b0c0ffb9
  由 Zeng Jinle 提交于 11月 01, 2019
  
  b0c0ffb9
- 1
  Optimize decay (#20816) · 20cdff0e
  由 123malin 提交于 11月 01, 2019
```
* update pserver decay blocks

* update distributed notify handler
```
  20cdff0e
31 10月, 2019 10 次提交

Fix Paddle Cloud role maker (#20860) · 16596f64

由 Chengmo 提交于 10月 31, 2019

* fix PaddleCloud Role maker & add warning in distribute transpiler  & change rpc_retry_times

16596f64

L

Compatible int32 and int64 for attr in concat/split/unsqueeze. test=develop (#20912) · 59de8e12
由 liym27 提交于 10月 31, 2019

59de8e12

maxout supports channel_last input (#20846) · 8d1e9f0f

由 Zhang Ting 提交于 10月 31, 2019

* maxout support channel_last input, test=develop

* modified details of Input(X) and Attr(groups, axis) in doc, test=develop

8d1e9f0f

Y

Optimize the kernel implementation of layernorm with openmp (#20895) · b6260f38
由 Yihua Xu 提交于 10月 31, 2019

b6260f38

GradMaker for dygraph (#19706) · 8c4573a3

由 hong 提交于 10月 31, 2019

* refactor dygraph,test=develop

* fix failed unittest,test=develop

* polish code,test=develop

* check windows ci error,test=develop
try to fix windows ci error by np.allclose,test=develop

* polish vlog and profiler, test=develop

* try to fix preceding ops order,test=develop

* test transformer in windows ci, test=develop

* use python c-api to speed up tracer.trace,test=develop

* test=develop, fix docker with paddle nccl problem

* test=develop, add ut for debug string and gradient_accumulator

* test=develop, add tests for layer/gradient_accumulator/prepared_op

* test=develop, fix complie error for test_prepared_op

* test=develop, add more ut for dygraph

* test=develop, create API.spec for dygraph api change

* optimize grad maker; test=develop

* optimize grad maker

* test

* grad make optim; test=develop

* fix unittest bugs; test=develop

* add dygraph grad op maker and split_op

* grad op maker refactor; test=develop

* add dygraph grad maker; test=develop

* fix op deformable_conv_v1_op bug; test=develop

* fix deformable_conv prroi pool bugs;

* fix new op grad op maker bug; test=develop

* fix split by ref bug; test=develop

* fix dygraph auto prune bug; test=develop

* fix test_trace bug; test=develop

* fix fused emb seq pool bug; test=develop

* remove useless code in op_desc file; test=develop

* remove useless code, StrVarBaseNode; test=develop

* fix review issues; test=develop

* fix rank_loss grad maker; test=develop

* remove flag in VarBase; test=develop

* fix distributed_notify_op compile bug ; test=develop

* fix reshape op double grad; test=develop

* fix expand as op; test=develop

* add impertive type_defs.h for demo_train; test=develop

* fix inference lib cmake; test=develop

* fix inference lib; test=develop

* fix infernce_lib; test=develop

* fix inference cmake; test=develop

* fix inference lib; test=develop

* fix inference lib; test=develop

* remove condition dygraph grad maker, modify local name; test=develop

* fix split grad maker bug; test=develop

* fix pyramid_op bug; test=develop

* change travis time out limit; test=develop

* restore travis; test=develop

* change timeout limit; test=develop

8c4573a3

support dump param of model into afs (#20302) · 59bcdc8a

由 Thunderbrook 提交于 10月 31, 2019

* support dump param to afs
test=develop

* code style
test=develop

* code style
test=develop

* dump param
test=develop

* dump param
test=develop

* dump param
test=develop

* dump param
test=develop

59bcdc8a

C

Add parameter init check add run_startup_progrom error message for fc(mul) (#20906) · 768551b2
由 Chen Weihang 提交于 10月 31, 2019

768551b2
Z

fix the bug of conv_transpose:compatible with Anylayout setting, test=develop (#20897) · c18f1bd7
由 Zhang Ting 提交于 10月 31, 2019

c18f1bd7
C

Polish and arrange code in enforce.h (#20901) · 3358455c
由 Chen Weihang 提交于 10月 31, 2019

3358455c

Refine the cache of program, context and scope in executor. (#18483) · 16e4d026

由 Yiqun Liu 提交于 10月 31, 2019

* Refine the cache of program, context and scope in executor.
test=develop

* Refine the unittest test_executor_and_use_program_cache.

* Add the test the PaddingRNN with use_program_cache=True.
test=develop

* Remove a check.
test=develop

* Refine the unittest to check whether it is correct when setting use_program_cache=True.
test=develop

16e4d026

30 10月, 2019 5 次提交
- W
  fix jit_matmul bug test=develop (#20886) · b4897600
  由 Wilber 提交于 10月 30, 2019
```
* fix jit_matmul bug 

* update jit matmul and add test
```
  b4897600
- Y
  Move the codes of fused operators to operators/fused directory. (#20881) · 03ba0fda
  由 Yiqun Liu 提交于 10月 30, 2019
```
* Move the codes of fused operators to operators/fused directory.
test=develop

* Correct the op name in cmake.

* Change the use of PADDLE_ENFORCE.
test=develop
```
  03ba0fda
- L
  
  add c++ unique_name_generator, test=develop (#20871) · a9bc92c3
  由 Leo Chen 提交于 10月 30, 2019
  
  a9bc92c3
- Z
  
  fix select_rows mergeadd bug, test=develop (#20876) · d4289125
  由 zhang wenhui 提交于 10月 30, 2019
  
  d4289125
- Z
  
  refine err msg of allocator, test=develop (#20879) · c51722c8
  由 Zeng Jinle 提交于 10月 30, 2019
  
  c51722c8
29 10月, 2019 9 次提交

save load problem fix and new feature add (#20823) · ff0886a9

由 hong 提交于 10月 29, 2019

* fix persistable;

* fix save load bugs; test=develop

* fix bug; test=develop

* add example for new io api; test=develop

* addd example; test=develop

ff0886a9

support Tensor for split and concat, support -1 in num_or_sections, add check... · 6802539a

由 liym27 提交于 10月 29, 2019

support Tensor for split and concat, support -1 in num_or_sections, add check num_or_sections (#20780)

* improve split and concat op:
1. support Tensor for argument 'dim' in split op.
2. support Tensor for argument 'axis' in concat op.
test=develop

* redefine function GetDataFromTensor and set unknown output shape to - 1.
test=develop

* add check: Attr(sections) match Input(X). test=develop

* support Tensor for attr(sections) and attr(sections) can contain -1.
add check for attr(sections).
test=develop

* modify error message for concat and call Resize only when necessary. test=develop

6802539a

W

strided_slice perforamnce improvement test=develop (#20852) · 28ca2e5f
由 wangchaochaohu 提交于 10月 29, 2019

28ca2e5f

Check and correct the output's lod_level in DynamicRNN related operators (#19144) · 6fcfd32e

由 Yiqun Liu 提交于 10月 29, 2019

* Refine the InferShape of ReadFrom and WriteTo op, and add comment to explain why not call ShareLoD for runtime.
test=develop

* Add comment for ReorderLoDTensorByRank op.

* Add comment for lod_tensor_to_tensor_array op to explain why only call DecreaseLoDLevel for compile time.
test=develop

* ShrinkRNNMemory op should call ShareLoD for compile time.
test=develop

* Add the implementation of IncreaseLoDLevel and add the compile-time check of lod_level in InferShape of sequence_pool.
test=develop

* Refine the unittest of DynamicRNN.
test=develop

* Change PADDLE_ENFORCE to PADDLE_ENFORCE_NE.
test=develop

6fcfd32e

Implement a pass detect fusion group of elementwise op (#19884) · b5f3be83

由 Yiqun Liu 提交于 10月 29, 2019

* Add fusion_group_pass and elementwise pattern.

* Rewrite the detector of elementwise group.
test=develop

* Add a comment in codegen.

* Add more unittest cases.
test=develop

* Move code_generator related code to fusion_group directory.

* Correct the including path.

* Add the definition of SubGraph and finish the insert of fusion_group op in pass.

* Insert graph_vis_pass in tester to visualize the graph for debug.

b5f3be83

improve unsqueeze op to support int, Tensor for argument axes (#20824) · 84d221b6

由 liym27 提交于 10月 29, 2019

* improve unsqueeze op to support int, Tensor and Tensor list for argument axes.
test=develop

* call Resize only when necessary. test=develop

84d221b6

S
Make shape tensor support int32 (#20757) · 03d7f3dd
由 silingtong123 提交于 10月 29, 2019
```
*  Make shape tensor support int32
```
03d7f3dd
H

Add shape and type check at read_op (#20754) · 95ba4bd2
由 Huihuang Zheng 提交于 10月 29, 2019

95ba4bd2
Z

lazy init of allocators, test=develop (#20854) · bb8d7783
由 Zeng Jinle 提交于 10月 29, 2019

bb8d7783

28 10月, 2019 1 次提交
- A
  
  add pyramid_hash_op (#20698) · aacd16db
  由 Aurelius84 提交于 10月 28, 2019
  
  aacd16db

Crayon鑫 / Paddle 与 Fork 源项目一致

Crayon鑫 / Paddle
与 Fork 源项目一致