提交 · 3358455c86b2f1a0ff72892ea361f7bfe43fda7e · Crayon鑫 / Paddle

31 10月, 2019 2 次提交

C

Polish and arrange code in enforce.h (#20901) · 3358455c
由 Chen Weihang 提交于 10月 31, 2019

3358455c

Refine the cache of program, context and scope in executor. (#18483) · 16e4d026

由 Yiqun Liu 提交于 10月 31, 2019

* Refine the cache of program, context and scope in executor.
test=develop

* Refine the unittest test_executor_and_use_program_cache.

* Add the test the PaddingRNN with use_program_cache=True.
test=develop

* Remove a check.
test=develop

* Refine the unittest to check whether it is correct when setting use_program_cache=True.
test=develop

16e4d026

30 10月, 2019 5 次提交
- W
  fix jit_matmul bug test=develop (#20886) · b4897600
  由 Wilber 提交于 10月 30, 2019
```
* fix jit_matmul bug 

* update jit matmul and add test
```
  b4897600
- Y
  Move the codes of fused operators to operators/fused directory. (#20881) · 03ba0fda
  由 Yiqun Liu 提交于 10月 30, 2019
```
* Move the codes of fused operators to operators/fused directory.
test=develop

* Correct the op name in cmake.

* Change the use of PADDLE_ENFORCE.
test=develop
```
  03ba0fda
- L
  
  add c++ unique_name_generator, test=develop (#20871) · a9bc92c3
  由 Leo Chen 提交于 10月 30, 2019
  
  a9bc92c3
- Z
  
  fix select_rows mergeadd bug, test=develop (#20876) · d4289125
  由 zhang wenhui 提交于 10月 30, 2019
  
  d4289125
- Z
  
  refine err msg of allocator, test=develop (#20879) · c51722c8
  由 Zeng Jinle 提交于 10月 30, 2019
  
  c51722c8
29 10月, 2019 9 次提交

save load problem fix and new feature add (#20823) · ff0886a9

由 hong 提交于 10月 29, 2019

* fix persistable;

* fix save load bugs; test=develop

* fix bug; test=develop

* add example for new io api; test=develop

* addd example; test=develop

ff0886a9

support Tensor for split and concat, support -1 in num_or_sections, add check... · 6802539a

由 liym27 提交于 10月 29, 2019

support Tensor for split and concat, support -1 in num_or_sections, add check num_or_sections (#20780)

* improve split and concat op:
1. support Tensor for argument 'dim' in split op.
2. support Tensor for argument 'axis' in concat op.
test=develop

* redefine function GetDataFromTensor and set unknown output shape to - 1.
test=develop

* add check: Attr(sections) match Input(X). test=develop

* support Tensor for attr(sections) and attr(sections) can contain -1.
add check for attr(sections).
test=develop

* modify error message for concat and call Resize only when necessary. test=develop

6802539a

W

strided_slice perforamnce improvement test=develop (#20852) · 28ca2e5f
由 wangchaochaohu 提交于 10月 29, 2019

28ca2e5f

Check and correct the output's lod_level in DynamicRNN related operators (#19144) · 6fcfd32e

由 Yiqun Liu 提交于 10月 29, 2019

* Refine the InferShape of ReadFrom and WriteTo op, and add comment to explain why not call ShareLoD for runtime.
test=develop

* Add comment for ReorderLoDTensorByRank op.

* Add comment for lod_tensor_to_tensor_array op to explain why only call DecreaseLoDLevel for compile time.
test=develop

* ShrinkRNNMemory op should call ShareLoD for compile time.
test=develop

* Add the implementation of IncreaseLoDLevel and add the compile-time check of lod_level in InferShape of sequence_pool.
test=develop

* Refine the unittest of DynamicRNN.
test=develop

* Change PADDLE_ENFORCE to PADDLE_ENFORCE_NE.
test=develop

6fcfd32e

Implement a pass detect fusion group of elementwise op (#19884) · b5f3be83

由 Yiqun Liu 提交于 10月 29, 2019

* Add fusion_group_pass and elementwise pattern.

* Rewrite the detector of elementwise group.
test=develop

* Add a comment in codegen.

* Add more unittest cases.
test=develop

* Move code_generator related code to fusion_group directory.

* Correct the including path.

* Add the definition of SubGraph and finish the insert of fusion_group op in pass.

* Insert graph_vis_pass in tester to visualize the graph for debug.

b5f3be83

improve unsqueeze op to support int, Tensor for argument axes (#20824) · 84d221b6

由 liym27 提交于 10月 29, 2019

* improve unsqueeze op to support int, Tensor and Tensor list for argument axes.
test=develop

* call Resize only when necessary. test=develop

84d221b6

S
Make shape tensor support int32 (#20757) · 03d7f3dd
由 silingtong123 提交于 10月 29, 2019
```
*  Make shape tensor support int32
```
03d7f3dd
H

Add shape and type check at read_op (#20754) · 95ba4bd2
由 Huihuang Zheng 提交于 10月 29, 2019

95ba4bd2
Z

lazy init of allocators, test=develop (#20854) · bb8d7783
由 Zeng Jinle 提交于 10月 29, 2019

bb8d7783

28 10月, 2019 5 次提交
- A
  
  add pyramid_hash_op (#20698) · aacd16db
  由 Aurelius84 提交于 10月 28, 2019
  
  aacd16db
- Z
  
  remove some unnecessary logs in pe, test=develop (#20848) · 98103d30
  由 Zeng Jinle 提交于 10月 28, 2019
  
  98103d30
- C
  
  delete paddle infershape enforce marco (#20832) · 8b59ac3a
  由 Chen Weihang 提交于 10月 28, 2019
  
  8b59ac3a
- W
  
  Fix roi_perspective_transform op (#20764) · c8e49be2
  由 whs 提交于 10月 28, 2019
  
  c8e49be2
- C
  Replace risky GetInputType method with secure IndicateVarDataType interface (#20668) · 26cc1fe5
  由 Chen Weihang 提交于 10月 28, 2019
```
* replace part of the old implementation, test=develop

* restore concat op, test=develop

* update all ops implemention & delete GetDataTypeOfVar func, test=develop
```
  26cc1fe5
25 10月, 2019 3 次提交

fix several sparse table issuses (#20686) · 48669aa8

由 xujiaqi01 提交于 10月 25, 2019

* no longer need to define all embedding layers (no one less) of all slots in each program. make trainer_param repeated in ps.proto.
* add find_distributed_lookup_table_grads instead of hard code GRAD
* support embedding stop gradient. push sparse has error before fix this.* 
* fix fill sparse, skip slots which do not have embedding. each slot's embedding in a sparse table should be used in all training programs before fix this.
* fix pull sparse, skip slots which do not have embedding.
* fix collect feasign label info, skip slots which do not have embedding.
* support when there are multi sparse tables in one or multi training programs, each program can pull/push its own related sparse tables instead of all sparse tables.
* test=develop

48669aa8

fix bug in reshape: (#20781) · cf717fd6

由 Yamei-Lee 提交于 10月 25, 2019

consider the situation that shape of input can contain more than one -1.
test=develop

cf717fd6

Make formatted ENFORCE stack adapt to more situations (#20826) · 1d1552d1

由 Chen Weihang 提交于 10月 25, 2019

* Make formatted ENFORCE stack adapt to more situations and polish details, test=develop

* restore template message position, test=develop

1d1552d1

24 10月, 2019 9 次提交
- Z
  
  add some docs to jit.trace, test=develop (#20811) · 378fc4fb
  由 Zeng Jinle 提交于 10月 24, 2019
  
  378fc4fb
- Z
  All elements in attr(shape) of crop_tensor can be -1 and int32/64 kernel registered (#20756) · 5a8d885d
  由 Zhang Ting 提交于 10月 24, 2019
```
* All elements in attr(shape) of crop_tensor can be -1, test=develop, test=document_preview

* fix the bug that attr(offsets) should be initialized, test=develop
```
  5a8d885d
- D
  
  fix fp16 grid_size for size=1; test=develop (#20812) · 9171f737
  由 danleifeng 提交于 10月 24, 2019
  
  9171f737
- Z
  
  refine err msg of allocator, test=develop (#20804) · cd1c4043
  由 Zeng Jinle 提交于 10月 24, 2019
  
  cd1c4043
- Z
  Add more error debug message to Operator::Run (#20793) · ac813bba
  由 Zeng Jinle 提交于 10月 24, 2019
```
* add more err msg, test=develop

* add more unittests, test=develop
```
  ac813bba
- T
  make search_compute support avx default (#20779) · efbdad05
  由 Tao Luo 提交于 10月 24, 2019
```
* make search_compute support avx only

* clean search_compute.h

* rename sse_axpy to avx_axpy

test=develop

* update CMakeLists.txt

test=develop
```
  efbdad05
- Z
  add PADDLE_ENFORCE for dygraph to optimize error throw (#19783) · 3556514e
  由 zhongpu 提交于 10月 24, 2019
```
* add PADDLE_ENFORCE for dygraph to optimize error throw, test=develop

* fix some error, test=develop

* delete PADDLE_ENFORCE_EQ in VarBase::NewVarBase, test=develop
```
  3556514e
- W
  
  Fix DGC algorithm flow to make it the same as paper (#20758) · 250e72d2
  由 WangXi 提交于 10月 24, 2019
  
  250e72d2
- W
  
  fix codetest for windows make test=develop (#20796) · ba45dce3
  由 wangchaochaohu 提交于 10月 24, 2019
  
  ba45dce3
23 10月, 2019 7 次提交
- Z
  [Dygraph to static graph]JIT/Trace (#20775) · 8ff6b289
  由 Zeng Jinle 提交于 10月 23, 2019
```
* jit/trace 1st version, test=develop

* add more unittests, test=develop
```
  8ff6b289
- Z
  Fix multihead op bug. (#20783) · 6e6eab07
  由 zhaoyuchen2018 提交于 10月 23, 2019
```
The op should handle k=1024

test=develop
Signed-off-by: Nzhaoyuchen <zhaoyuchen01@baidu.com>
```
  6e6eab07
- L
  Revert "fix_depthwise_conv_cudnn, test=develop (#20712)" (#20782) · dfa0549f
  由 lvmengsi 提交于 10月 23, 2019
```
This reverts commit dc229b41.
```
  dfa0549f
- W
  
  Add norm_by_time for warpctc op in padding mode. (#17580) · 4c7d196d
  由 whs 提交于 10月 23, 2019
  
  4c7d196d
- P
  Bug Fix: Paddle-TRT cannot handle adaptive pooling in pool2d op converter and... · e89c16b9
  由 Pei Yang 提交于 10月 23, 2019
```
Bug Fix: Paddle-TRT cannot handle adaptive pooling in pool2d op converter and "num" attribute in split op converter (#20733)

* fix pool2d trt converter, test=develop

* add fix for split op converter, test=develop
```
  e89c16b9
- T
  
  del uninstall protobuf (#20769) · 1105b932
  由 tianshuo78520a 提交于 10月 23, 2019
  
  1105b932
- T
  mv sampcd_processor.py to tools/ (#20761) · 2f5f19df
  由 Tao Luo 提交于 10月 23, 2019
```
* mv sampcd_processor.py to tools

test=develop test=document_fix

* update example script

test=develop test=document_fix
```
  2f5f19df

Crayon鑫 / Paddle 与 Fork 源项目一致

Crayon鑫 / Paddle
与 Fork 源项目一致