提交 · fcf53e55ff22c54175efdb32ac367c6e04f19900 · PaddlePaddle / Paddle

20 9月, 2019 6 次提交

A
support 2-level lod of input in sequence_pool (#19839) · fcf53e55
由 Aurelius84 提交于 9月 20, 2019
```
* support 2-level lod of input in sequence_pool test=develop

* fix lod level bug in .cu test=develop
```
fcf53e55
C
refine optimier function (#19886) · ae31faaa
由 chengduo 提交于 9月 20, 2019
```
test=developt
```
ae31faaa
Z
group_norm support data_layout:NHWC, test=develop, test=document_preview (#19614) · 93364b45
由 Zhang Ting 提交于 9月 20, 2019
```
1. group_norm support data_layout=NHWC
2. modified doc of group_norm
```
93364b45

modified interpolate op to support tensor attribute, test=develop, test=document_preview (#19287) · 439d95e1

由 Zhang Ting 提交于 9月 20, 2019

modified interpolate_op to support tensor attribute

1. the parameter out_shape of image_resize、resize_nearest/bilinear/trilinear can be a list or a 1-D tensor variable. If a list, each element can be an integer or a tensor variable with shape: [1].

2. the parameter scale of above Ops can be a 1-D tensor variable.
modified document of image_resize, resize_nearest, resize_bilinear, resize_trilinear and add some code example.

439d95e1

add crop_tensor_op, test=develop, test=document_preview (#19314) · b3888941

由 Zhang Ting 提交于 9月 20, 2019

add crop_tensor op. The main difference with crop is :

1. If the argument shape is a list, each element is an integer or a tensor variable with shape: [1]. This way is suitable for the case that the shape may be changed each iteration.

2. If the argument shape is a variable. Its rank must be 1. In crop op, the rank of shape must be the same as x

offsets can be a list, in which each element is an integer or a tensor variavle with shape: [1].

b3888941

C
refine executor bug info (#19887) · 1f686744
由 chengduo 提交于 9月 20, 2019
```
test=develop
```
1f686744

19 9月, 2019 9 次提交

F

hide with inference optim API (#17355) · fe18cfdb
由 flame 提交于 9月 19, 2019

fe18cfdb
A
Remove constraint that last dimension is forced to be 1 in cross_entropy (#19606) · b125e327
由 Aurelius84 提交于 9月 19, 2019
```
* Remove constraint that last dimension is forced to be 1 in cross_entropy
test=develop

* modify labels last dims test=develop
```
b125e327
G
change _origin_program test=develop (#19863) · e8d3745c
由 gongweibao 提交于 9月 19, 2019
```
change _origin_program test=develop
```
e8d3745c

add precise roi pooling op test=develop (#18960) · a7c440d3

由 wopeizl 提交于 9月 19, 2019

* add precise roi pooling op test=develop

* test=develop

* test=develop

* test=develop

* test=develop

* test=develop

* test=develop

* test=develop

* test=develop

* test=develop

* test=develop

* test=develop

* test=develop

* test=develop

* detail the description test=develop

* test=develop

* elaborate the doc for return type test=develop

* test=develop

a7c440d3

Add a pass to fuse fc+elementwise_add+layernorm (#19776) · 3cd985a6

由 Yiqun Liu 提交于 9月 19, 2019

* Add fc_elementwise_layernorm_fuse pass and unittest.

* Add fused_fc_elementwise_layernorm op and its GPU kernel.
test=develop

* Apply fc_elementwise_layernorm_fuse_pass to GPU inference.

* Add the setting of attrs in the definition of binary_op.
test=develop

* Add comment.

* Implement the unittest.
test=develop

* Change the unittest name of layer_norm.
test=develop

3cd985a6

W
distribute.launch use poll to query subprocess (#19853) · 8c2c8dc6
由 WangXi 提交于 9月 18, 2019
```
distribute.launch use poll to query subprocess
```
8c2c8dc6

Disable test_dygraph_mnist_fp16.py (#19844) · 8e927327

由 chengduo 提交于 9月 19, 2019

* Fix std::ostream& operator<<(std::ostream& os, const Tensor& t)
test=develop

* Fix test_dygraph_mnist_fp16
test=develop

* disable test_dygraph_mnist_fp16
test=develop

* revert tensor_util.cc fix
test=develop

8e927327

J
Optimize amp for multi-gpu to enable FP16 gradients transfer across gpus. (#19714) · d9db94d7
由 Jie Fang 提交于 9月 19, 2019
```
Optimize amp for multi-gpu to enable FP16 gradients transfer across gpus
```
d9db94d7

Strided slice (#19642) · 47af618f

由 wangchaochaohu 提交于 9月 19, 2019

* strided_slice op basic function test=develop

* test=develop rewrite and fix

* fix bug test=develop

* fix for the PADDLE_ENFORCE usage

* add some unit testw

* fix for the aip  test and copright and fix test=develop

* fix API.spec test=develop

* fix API.spec test=develop

* add axis parameter test=develop

* fix for the build error test=develop

* fix python api  test=develop

* fix the build test=develop

* fix build test=develop

* fix API spec test=develop

* test=develop add some comment and single op test

* fix API spece test=develop

* fix test=develop

* fix test=develop

* fix api test=develop

* fix api test=develop

* fix API.spec test=develop

* fix typo test=develop

* fix API.spec test=develop

* fix API typo test=develop

* fix doc and API.spec test=develop

47af618f

18 9月, 2019 6 次提交
- Z
  
  remove some flags and add comments to some flags, test=develop (#19813) · 13ca364c
  由 Zeng Jinle 提交于 9月 18, 2019
  
  13ca364c
- H
  
  Return correct currrent block of a var (#19850) · 3e1e1fee
  由 Huihuang Zheng 提交于 9月 18, 2019
  
  3e1e1fee
- 1
  add retry function to try to solve grpc error code 14 (#19661) · 1bc285a5
  由 123malin 提交于 9月 18, 2019
```
* rpc retry for asycsend/get/prefetch

* test=develop, change retry vlog level to 3

* test=develop, set default grpc_retry_times is 3
```
  1bc285a5
- B
  Support dispensable student_loss in PaddleSlim distillation (#19824) · e2c6bada
  由 Bai Yifan 提交于 9月 18, 2019
```
* support_dispensable_student_loss, test=develop

* add distillation test, test=develop

* fix distillation test non convergence problem, test=develop

* fix test_distillation fail problem, test=develop
```
  e2c6bada
- L
  
  fix_roi_transform_bug (#19785) · 6d72a86b
  由 LielinJiang 提交于 9月 18, 2019
  
  6d72a86b
- Z
  [Bug fix] Disable memory reuse on feeded variables (#19835) · db26de83
  由 Zeng Jinle 提交于 9月 18, 2019
```
* fix memory reuse bug on feeding variables, test=develop

* add comments to reference count members, test=develop
```
  db26de83
17 9月, 2019 12 次提交

C
add deformable conv v1 op and cpu version of deformable conv v2 (#18500) · 00efd1d8
由 chengjuntao 提交于 9月 17, 2019
```
* add deformable conv v1 op, test=develop
```
00efd1d8
C
Add fp16 support for dygraph (#19828) · b99fc38c
由 chengduo 提交于 9月 17, 2019
```
* Add fp16 support for dygraph
test=develop

* Add unit test
test=develop
```
b99fc38c

Enhance OpTest to support double grad inplace check (#19826) · 5fbf03d6

由 Leo Chen 提交于 9月 17, 2019

* update OpTest to support double grad inplace check, test=develop

* keep consistency of _calc_output function, test=develop

5fbf03d6

fix pow op, support tensor for agument factor. (#19313) · 677e7144

由 liym27 提交于 9月 17, 2019

improve pow op according to reviews:
1. Delete unnecessary judgement statements in PowGradOpDescMaker;
2. Improve test of test_api;

overload GetKernelTypeForVar

add stop_gradient=True when attr(factor) is tensor Variable, change examples in API pow.
test=develop,test=document_preview

677e7144

add tensor support for argument shape in reshape op; (#19268) · bd89a273

由 liym27 提交于 9月 17, 2019

add support parameter inference when argument shape is a list containing integer and tensor variable;
test=develop

fix reshape op according to reviews:
1. improve or message;
2. improve test of test_api.
test=develop,test=document_preview

fix reshape op: Add error message in nn.py, test=develop

add stop_gradient=True when attr(shape) is tensor Variable.
change examples in API reshape.
test=develop,test=document_preview

bd89a273

add tensor(tensor and tensor in list) support for argument starts and ends in slice op; (#19208) · 88628016

由 liym27 提交于 9月 17, 2019

add support parameter inference when arguments starts or ends is a list containing integer and tensor variable;
test=develop,test=document_preview

improve slice op according to review(from hongyu). test=develop

fix slice op according to review: infer_flags, test=develop

fix slice op: improve overload operator __getitem__ to support attrs(starts and ends) are Variable.
test=develop,test=document_preview

fix test_slice_op: add TestSliceOp_decs_dim_6 to resolve conflict with test_slice_ngraph_op. test=develop

add stop_gradient=True when attr(starts) or attr(ends) is tensor Variable.
test=develop,test=document_preview

88628016

fix expand op: (#19302) · e9e3c087

由 liym27 提交于 9月 17, 2019

1. add tensor support for argument expand_times in expand op;
2. add support parameter inference when argument expand_times is a list containing integer and tensor variable;

improve expand op according to reviews:
1. add doc of ExpandTimes in expand_op.cc;
2. improve the test of test_api.

add stop_gradient=True when attr(expand_times) is tensor Variable, change code examples.
test=develop,test=document_preview

e9e3c087

X
support preload thread, optimize hdfs log, fix master+patch bug (#19695) · 6bf298bf
由 xujiaqi01 提交于 9月 17, 2019
```
* support preload thread
* sleep before fleet wrapper exit for pslib core dump
* optimize hdfs log
* fix master+patch bug
```
6bf298bf

Feature/add transform data dygraph (#19707) · cc311bdf

由 Jiabin Yang 提交于 9月 17, 2019

* refactor dygraph,test=develop

* fix failed unittest,test=develop

* polish code,test=develop

* check windows ci error,test=develop
try to fix windows ci error by np.allclose,test=develop

* polish vlog and profiler, test=develop

* try to fix preceding ops order,test=develop

* test transformer in windows ci, test=develop

* use python c-api to speed up tracer.trace,test=develop

* test=develop, fix docker with paddle nccl problem

* test=develop, add ut for debug string and gradient_accumulator

* test=develop, add tests for layer/gradient_accumulator/prepared_op

* test=develop, fix complie error for test_prepared_op

* test=develop, add more ut for dygraph

* test=develop, create API.spec for dygraph api change

* add transform_data to dygraph

* test=develop, refoctor name to make it easier to understand

* test=develop, refoctor name to make it easier to understand

* add test and change input to const ref for safety

* test=develop, fix multi-gpu failed problem , add Tracer tests, change PADDLEENFORCE to PADDLEENFORCE_EQ

* add ut for data transform

* refine ut for data_transform

* test=develop, fix ut failed on parallel se-resnext

* test=develop, change one more PADDLE_ENFORCE

* add test_tracer on multiple devices

* test=develop, change place to mutable for data transform

* test=develop, add transform data on same place test and remove useless log

* test=develop, Add to do for data layout and and ut for conv2d with no bias

cc311bdf

L
cpu Conv double grad (#19672) · b76343c3
由 lvmengsi 提交于 9月 17, 2019
```
* cpu conv_grad_grad
```
b76343c3

翟

Implement FusedEmbeddingSeqPoolGradKernel with cblas_saxpy (#19770) · 93c85c93

由翟飞跃提交于 9月 17, 2019

* Implement the operator with sprase matrix multiply

* Update the URL of mklml library.

test=develop

* Disable MKLML implematation when using no-linux.

test=develop

* optimize bp with mkl sparse matrix
test=develop

* tmp add fused_emb_seq layer

* Add the support of padding_idx attribute.

test=develop

* add padding_idx support
test=develop

* implement grad refer lego
test=develop

93c85c93

C
Fix example error of Variable and Operator (#19821) · 2729c174
由 chengduo 提交于 9月 17, 2019
```
* fix example error
test=develop

* Remove set_desc
test=develop
```
2729c174

16 9月, 2019 4 次提交
- R
  add unittest for square error cost op (#19746) · a0e9b7b9
  由 ruri 提交于 9月 16, 2019
```
* add unit test for square error cost op
```
  a0e9b7b9
- Z
  add kernel for squeeze_op, test=develop (#19656) · 52673956
  由 zhongpu 提交于 9月 16, 2019
```
* add kernel for squeeze_op, test=develop

* delete comment, test=develop
```
  52673956
- C
  
  Add prune_backward function to cover complicated test_program.clone situation (#19772) · 00d5375e
  由 Chen Weihang 提交于 9月 16, 2019
  
  00d5375e
- T
  fix sync_with_distributed_lookup_table, test=develop (#19737) · 6a1db204
  由 tangwei12 提交于 9月 16, 2019
```
fix wrong place with distributed_lookup_table
```
  6a1db204
12 9月, 2019 2 次提交
- A
  Remove constraint that last dimension is forced to be 1 by adding one_hot_v2 (#19716) · 8c7e4119
  由 Aurelius84 提交于 9月 12, 2019
```
* add one_hot_v2_op to remove last_dims==1 test=develop

* add api unittest code for CI_Coverage test=develop

* improve CI_Coverage rate by adding test_with_depth test=develop
```
  8c7e4119
- J
  
  modify activation op API, delete use_cudnn args, test=develop, (#19758) · e352467c
  由 JesseyXujin 提交于 9月 12, 2019
  
  e352467c
11 9月, 2019 1 次提交

Replace TemporaryAllocator by CUDADeviceContextAllocator (#18989) · 12542320

由 Huihuang Zheng 提交于 9月 11, 2019

TemporaryAllocator is a singleton used for allocating memory for Cudnn. Since it is a singleton, we can delete it for better performance in memory.

We replace TemporaryAllocator by CUDADeviceContextAllocator and CUDADeviceContextAllocation, which uses stream callback to delete the memory allocated for the stream to avoid singleton.

Also added data_feed_proto to operator to fix CI in CPU compilation

12542320

PaddlePaddle / Paddle 1 年多 前同步成功

PaddlePaddle / Paddle
1 年多前同步成功