提交 · 2b896c1f6bd78abc32df3af1f3102f805004a614 · PaddlePaddle / Paddle

19 4月, 2020 1 次提交

Support LoDTensorArray in fetch (#23645) · 2b896c1f

由 guofei 提交于 4月 19, 2020

* Support LoDTEnsorArray in fetch op

test=develop

* Support LoDTensorArray in fetch

test=develop

* Support LoDTensorArray in fetch

test=develop

* Support LoDTensorArray in fetch

test=develop

* Support LoDTensorArray in fetch

test=develop

* Support LoDTensorArray in fetch

test=develop

* Support LoDTensorArray in fetch

test=develop

* Support LoDTensorArray in fetch

test=develop

* Support LoDTensorArray in fetch

test=develop

* Support LoDTensorArray in fetch

test=develop

2b896c1f

17 4月, 2020 2 次提交
- G
  Modify documents of executor and randn and fix other errors (#23879) · d8ca66da
  由 gfwm0502 提交于 4月 17, 2020
```
test=develop
```
  d8ca66da
- G
  OP/API (While/while_loop/DynamicRNN) : Error Message Enhancement (#23896) · a7563602
  由 gfwm0502 提交于 4月 17, 2020
```
As the title
```
  a7563602
15 4月, 2020 1 次提交
- G
  OP(compare/get_places/shrink_rnn_memory) error message enhancement (#23780) · af149f25
  由 gfwm0502 提交于 4月 15, 2020
```
As the title.
```
  af149f25
14 4月, 2020 1 次提交
- K
  optimize compare and logical ops error info, add test case for this ops · dd3ae023
  由 kinghuin 提交于 4月 14, 2020
```
* optimize compare and logical ops error info
* add out and cond dtype test
```
  dd3ae023
10 4月, 2020 1 次提交
- Z
  Solve the conflict of ops with the same name, test for CI. (#23573) · 84cd45f6
  由 Zhen Wang 提交于 4月 10, 2020
```
* solve the conflict of ops with the same name. test=develop
```
  84cd45f6
09 4月, 2020 2 次提交
- H
  API/OP (ConditionalBlock) error message enhancement (#23480) · a82ce2b1
  由 Huihuang Zheng 提交于 4月 09, 2020
```
API/OP (ConditionalBlock) error message enhancement (#23480)
```
  a82ce2b1
- Y
  
  Op(fetch) error message enhancement. (#23542) · 4489f0d3
  由 Yiqun Liu 提交于 4月 09, 2020
  
  4489f0d3
08 4月, 2020 3 次提交
- L
  
  OP (tensor_array_read_write) error message enhancement. test=develop (#23468) · dc225ed2
  由 liym27 提交于 4月 08, 2020
  
  dc225ed2
- Y
  
  Enhance the error message of feed_op. (#23526) · 55d0c8fd
  由 Yiqun Liu 提交于 4月 08, 2020
  
  55d0c8fd
- H
  OP (recurrent) error message enhancement (#23481) · 71b5f1d2
  由 Huihuang Zheng 提交于 4月 08, 2020
```
* OP (recurrent) error message enhancement
```
  71b5f1d2
05 4月, 2020 1 次提交
- T
  Revert "Solve the conflict of ops with the same name. (#23199)" (#23494) · 0b583235
  由 Tao Luo 提交于 4月 05, 2020
```
This reverts commit abe3e690.
test=develop
```
  0b583235
04 4月, 2020 2 次提交

Delete Ref & VectorRef and add GetDataSafely (#22997) · 16315d3d

由 Chen Weihang 提交于 4月 04, 2020

* delete invalid check inferface Ref & VectorRef, test=develop

* fix vector ref delete error, test=develop

* try the new check inferface, test=develop

* change all related code with new check macro, test=develop

* remove static assert, test=develop

* polish detail, test=develop

* skip coverage problem, test=develop

* add new check macro, test=develop

16315d3d

Z
Solve the conflict of ops with the same name. (#23199) · abe3e690
由 Zhen Wang 提交于 4月 04, 2020
```
* solve the conflict of ops with the same name. test=develop
```
abe3e690

03 4月, 2020 1 次提交

update linspace, equal operators to API 2.0 (#23274) · a2e10930

由 channings 提交于 4月 03, 2020

* update linspace, equal operators to API 2.0, test=develop

* equal support higher performance CUDA kernel, test=develop

* update comment of equal&linspace operator, test=develop

* update comment of equal&linspace operator, test=develop

a2e10930

01 4月, 2020 1 次提交
- W
  refine the error message (#23212) · 69e3f993
  由 wangchaochaohu 提交于 4月 01, 2020
```
* refine the error message of tensor_array_read_write Op
```
  69e3f993
09 3月, 2020 1 次提交

Imperative tracer refactoring (#22457) · d33c4343

由 Zeng Jinle 提交于 3月 09, 2020

* refine grad maker, test=develop

* refactor tracer stage 1, test=develop

* merge develop to solve conflict third times, test=develop

d33c4343

12 2月, 2020 1 次提交

Add support for dynamic_decode(while) training. (#22231) · 31b54646

由 Guo Sheng 提交于 2月 12, 2020

* Add support for dynamic_decode(while) training. test=develop

* Fix assign_op and tensor_array_read_write_op after solving conflict. test=develop

* Fix test_rnn_decode_api.py. test=develop

* Refine docs for apis in rnn.py. test=develop

* Adjust outputs of dynamic_decode. test=develop

* Remove the force_cpu update in assign_op. test=develop

* Remove the force_cpu update in assign_op. test=develop

* Make RNNCell.get_initial_states support batch_dim_idx argument. test=develop

* Rename _create_array_outof_while as _create_array_out_of_while in rnn.py.
test=develop

31b54646

16 1月, 2020 1 次提交
- Z
  
  fix typo in error message (#22312) · 805328e1
  由 zhangchunle 提交于 1月 16, 2020
  
  805328e1
06 1月, 2020 1 次提交
- J
  
  [MKL-DNN] Conv grad and Batch Norm grad NHWC support (#22088) · b0b27ff6
  由 Jacek Czaja 提交于 1月 06, 2020
  
  b0b27ff6
24 12月, 2019 1 次提交
- G
  
  Modify the while_loop API (#21844) · 46f9184a
  由 guofei 提交于 12月 24, 2019
  
  46f9184a
19 12月, 2019 1 次提交
- G
  Make While Op could run on GPU place and add while_loop unittest (#21672) · 8b7c50f4
  由 guofei 提交于 12月 19, 2019
```
1. Make while_op accept GPU conditional data
2. Add more complex test cases for while_loop API
```
  8b7c50f4
17 12月, 2019 1 次提交
- H
  
  Fix That conditional_block_op Doesn't Have InferShape (#21733) · 0677a1c1
  由 Huihuang Zheng 提交于 12月 17, 2019
  
  0677a1c1
06 12月, 2019 2 次提交

Polish op registry codes (#21561) · 0f888836

由 Zeng Jinle 提交于 12月 06, 2019

* polish infer shape registry, test=develop

* modify some operators registry, test=develop

0f888836

Add Much Complex Test and Fix Bugs for Control Flow cond API (#21532) · 1dcf6a72

由 Huihuang Zheng 提交于 12月 06, 2019

Add tests to use dy/dx to make sure the gradient values calculated by the control flow backward is correct. Also fixed bugs detected by those tests.

Fix bugs:

1. Unlike sum_op, optimizer ops don't allow uninitialized input tensor. But in conditional_block_grad_op, since the conditional_block may not run, the output gradient tensor may be uninitialized, which will cause the optimizer op error. To fix it, we should let optimizer ops support uninitialized input like sum_op or assign the uninitialized gradient to 0 when the conditional_block_grad_op doesn't run. I found there are about 10+ optimizer ops. **To be simpler, I just assign output gradient of the conditional_block_grad_op to 0 in this PR**. But it can be further explored whether we can make optimizer ops like sum_op to support uninitialized input tensor because theoretically we can speed up without the assigning in conditional_block_grad_op.

2. Infer parameter shapes during append_backward. I didn't know that all our parameters are in global block. When op_desc is inferring shapes at the sub-block, it may not know the shape of gradients of parameters whose shape information is at global block. I fixed it by inferring shapes of gradients from forward var.

This PR also did some code clean up:
1. Print the var name when sgd_op catches shape error so that it is easier to debug
2. Fix a typo: dicta -> dict

1dcf6a72

29 11月, 2019 2 次提交

Fix Cond Bug for Nested Control Flow (#21340) · 630be319

由 Huihuang Zheng 提交于 11月 29, 2019

* Commit before merging develop

test=develop

* Backup after working with Huihuang logs

* Commit before deleting Huihuang debug loggings

* Commit before debug

test=develop

* Fix bug commit

test=develop

* Backup of fixing bugs

test=develop

* Clean up code

test=develop

* Fix a bug in sum_op

test=develop

630be319

J

[MKL-DNN] LRN and Pool2d (FWD) NHWC support (#21375) · cd43c444
由 Jacek Czaja 提交于 11月 29, 2019

cd43c444

31 10月, 2019 1 次提交

GradMaker for dygraph (#19706) · 8c4573a3

由 hong 提交于 10月 31, 2019

* refactor dygraph,test=develop

* fix failed unittest,test=develop

* polish code,test=develop

* check windows ci error,test=develop
try to fix windows ci error by np.allclose,test=develop

* polish vlog and profiler, test=develop

* try to fix preceding ops order,test=develop

* test transformer in windows ci, test=develop

* use python c-api to speed up tracer.trace,test=develop

* test=develop, fix docker with paddle nccl problem

* test=develop, add ut for debug string and gradient_accumulator

* test=develop, add tests for layer/gradient_accumulator/prepared_op

* test=develop, fix complie error for test_prepared_op

* test=develop, add more ut for dygraph

* test=develop, create API.spec for dygraph api change

* optimize grad maker; test=develop

* optimize grad maker

* test

* grad make optim; test=develop

* fix unittest bugs; test=develop

* add dygraph grad op maker and split_op

* grad op maker refactor; test=develop

* add dygraph grad maker; test=develop

* fix op deformable_conv_v1_op bug; test=develop

* fix deformable_conv prroi pool bugs;

* fix new op grad op maker bug; test=develop

* fix split by ref bug; test=develop

* fix dygraph auto prune bug; test=develop

* fix test_trace bug; test=develop

* fix fused emb seq pool bug; test=develop

* remove useless code in op_desc file; test=develop

* remove useless code, StrVarBaseNode; test=develop

* fix review issues; test=develop

* fix rank_loss grad maker; test=develop

* remove flag in VarBase; test=develop

* fix distributed_notify_op compile bug ; test=develop

* fix reshape op double grad; test=develop

* fix expand as op; test=develop

* add impertive type_defs.h for demo_train; test=develop

* fix inference lib cmake; test=develop

* fix inference lib; test=develop

* fix infernce_lib; test=develop

* fix inference cmake; test=develop

* fix inference lib; test=develop

* fix inference lib; test=develop

* remove condition dygraph grad maker, modify local name; test=develop

* fix split grad maker bug; test=develop

* fix pyramid_op bug; test=develop

* change travis time out limit; test=develop

* restore travis; test=develop

* change timeout limit; test=develop

8c4573a3

29 10月, 2019 1 次提交

Check and correct the output's lod_level in DynamicRNN related operators (#19144) · 6fcfd32e

由 Yiqun Liu 提交于 10月 29, 2019

* Refine the InferShape of ReadFrom and WriteTo op, and add comment to explain why not call ShareLoD for runtime.
test=develop

* Add comment for ReorderLoDTensorByRank op.

* Add comment for lod_tensor_to_tensor_array op to explain why only call DecreaseLoDLevel for compile time.
test=develop

* ShrinkRNNMemory op should call ShareLoD for compile time.
test=develop

* Add the implementation of IncreaseLoDLevel and add the compile-time check of lod_level in InferShape of sequence_pool.
test=develop

* Refine the unittest of DynamicRNN.
test=develop

* Change PADDLE_ENFORCE to PADDLE_ENFORCE_NE.
test=develop

6fcfd32e

14 10月, 2019 1 次提交
- C
  Add sub-scope check in RecurrentOp (#20468) · 36c85ef4
  由 chengduo 提交于 10月 13, 2019
```
* fix recurrent bug
test=develop
```
  36c85ef4
11 10月, 2019 1 次提交

modify english api (#20159) · 2893cd1a

由 Wilber 提交于 10月 11, 2019

* modify english api test=develop test=document_fix

- leaky_relu
- less_than
- log
- logical_and
- logical_or
- logical_xor
- logical_not

2893cd1a

18 9月, 2019 1 次提交
- Z
  
  fix gc bug in controlflow ops, test=develop (#19827) · 3fd3b663
  由 Zeng Jinle 提交于 9月 18, 2019
  
  3fd3b663
05 9月, 2019 1 次提交
- T
  paddle::framework::vectorize() templatization (#19627) · d6c85c96
  由 Tao Luo 提交于 9月 05, 2019
```
test=develop
```
  d6c85c96
30 8月, 2019 1 次提交

[MKL-DNN] Fix to face model on AVX512 platforms (#19282) · ecd9f330

由 Jacek Czaja 提交于 8月 30, 2019

- Refactor step 1

- Compilation fix

- Yet another compilation fix

- Even more compilation fix

- Lint fixes

test=develop

- Removed deprectaed PADDLE_ENFORCE occurance

test=develop

- Candidate fix to BN forward

- Lint fixes

test=develop

- Refactoring in data_layout_transform

- compilation fix

- Another comppilation fix

- Step further into darkness

- Yet another compilation fix

- Yet another compilation fix

- missing header

- compilation fix

- Added MKLDNN -> Paddle conversion in fetch op

test=develop

- Compilation fix

test=develop

- Lint

test=develop

- Mul fix

- Fix to MKLDNN MUL op and Elementwise MUL UT

test=develop

- Workaround for diffrent weights with groups representation Paddle vs
  MKL-DNN.

test=develop

- Candidate fix for 5D convolution with groups

- Refactor of fix for conv3d and conv2d in fetch op

test=develop

- Compilation fix

- Still same compilation fix

- Compilation fix

- Compilation fix

- Reverted refactoring of fixes

- Adapted test_conv2d_int8_mkldnn so it exects data in NCHW format
  not NHWC

test=develop

- minor fix in UT

test=develop

- Lint fixes

test=develop

ecd9f330

29 8月, 2019 1 次提交

Increase num_iteration_per_drop_scope (#19075) · b6d1d890

由 chengduo 提交于 8月 29, 2019

* increase num_iteration_per_drop_scope
test=develop

* Fix bug of while_op
test=develop

* fix bug of whileOp
test=develop

b6d1d890

02 8月, 2019 1 次提交

Open gc by default (#18836) · 7ac748ad

由 Zeng Jinle 提交于 8月 02, 2019

* open gc by default, test=develop

* fix test_train_recognize_digits and disable gc when ngraph is enabled, test=develop

* fix conditional_block op eager deletion bug, test=develop

* add some comments to reviewers, test=develop

7ac748ad

19 7月, 2019 1 次提交

Support memory eager deletion on recurrent OP (#17710) · 89bc3fd8

由 Huihuang Zheng 提交于 7月 19, 2019

Test PaddingRNN on V100 GPU device.

Test configuration: large model, padding mode (which is the mode using recurrentOp), one GPU.

GPU memory (MiB): 6414 (this PR) vs 6837 (without this PR)
Speed (steps/s): 10.28 (this PR) vs 9.89 (without this PR)

89bc3fd8

08 7月, 2019 1 次提交

Inference: fix mask rcnn model diff, optim memory usage, memory leak. (#18532) · 88b52a27

由 Zhaolong Xing 提交于 7月 08, 2019

* Fix Mask rcnn predictor
    1. refine memory optim algorithm to support the model with the block op.
    2. output diff : modify the affine channel fuse
    3. add condition_block_infer op
add interface for setting trt calib table dir
test=develop

* add the missing files.
test=develop

88b52a27

10 5月, 2019 1 次提交

Double backward of conv2d. (#17211) · e32c9888

由 qingqing01 提交于 5月 10, 2019

* Add conv2d_grad_grad_op
* Extracte the cuDNN conv algo searching code in conv_cudnn_helper.h.
    - Now use it in conv2d_grad_grad.
    - Will simply the searching code in conv2d and conv2d_grad in next PR.
* Enhance and fix bug in unit testing of gradient_checker.
* Support to fetch empty variables，return None in Python.

e32c9888

17 4月, 2019 1 次提交
- Y
  Update logical_op.cc · 8cff2b42
  由 Yan Chunwei 提交于 4月 17, 2019
```
test=develop
```
  8cff2b42

PaddlePaddle / Paddle 大约 1 年 前同步成功

PaddlePaddle / Paddle
大约 1 年前同步成功