提交 · c0a82748cf6144463977c5c12965ca5fd7708c0c · BaiXuePrincess / Paddle

11 7月, 2019 2 次提交

G

Polish backwards optimizer dependency codes and use more default values. (#18255) · c0a82748
由 gongweibao 提交于 7月 11, 2019

c0a82748

Feature/buffer_shared_inplace (#17911) · d3003a16

由 Zeng Jinle 提交于 7月 11, 2019

* feature/buffer_shared_inplace, test=develop

* refine code, test=develop

* fix elementwise_add op cpu inplace and sum inplace bug, test=develop

* add unittest and debug log, test=develop

* fix parallel_executor scope bug, polish code, test=develop

* fix sum op, activation op, single_in_place_inference bug, test=develop

* remove kLocalExecScopeName, test=develop

* fix unittest,test=develop

* fix out_var first version bug, test=develop

* follow comments,test=develop

d3003a16

10 7月, 2019 1 次提交
- L
  update dygraph api doc for web (#18550) · b6d5c74f
  由 lujun 提交于 7月 10, 2019
```
remove dygraph.enable from __all__
hidden dygraph. profiler
add doc to dygraph. no_grad
```
  b6d5c74f
09 7月, 2019 2 次提交
- P
  
  Add mkldnn int8 mul-op kernel (#17834) · 0caa08ea
  由 Physher 提交于 7月 09, 2019
  
  0caa08ea
- L
  Fix roi_perspective_transform_op bug (#18522) · 24d1c44a
  由 LielinJiang 提交于 7月 09, 2019
```
* fix transform matrix bug, test=develop

* modify API.spec
```
  24d1c44a
05 7月, 2019 3 次提交

Fix topk cannot handle 1D vector bug (#18466) · 832d8191

由 zhaoyuchen2018 提交于 7月 05, 2019

* Fix topk cannot handle 1D vector bug

Add path to handle 1D vector

test=develop
Signed-off-by: Nzhaoyuchen <zhaoyuchen01@baidu.com>

* refine code

test=develop
Signed-off-by: Nzhaoyuchen <zhaoyuchen01@baidu.com>

832d8191

Hide no support (#18515) · 7586cdd5

由 Jiabin Yang 提交于 7月 05, 2019

* test=develop, fix docker with paddle nccl problem

* test=develop, hide no_support api and add ut for it

7586cdd5

Add distributions of normal and uniform (#18023) · 43e17c79

由 LielinJiang 提交于 7月 05, 2019

* add_distributions_of_normal_and_uniform

* paddle/fluid/API.spec

* modify API.spec

* modified paddle/fluid/API.spec, test=develop

* modify paddle/fluid/API.spec, test=develop

* modify paddle/fluid/API.spec, test=develop

* fix some comment, test=develop

* modify API.spec, test=develop

* add comment for init function, modify hard code, test=develop

* modify API.spec, test=develop

* modify API.spec, test=develop

* make unit test function shorter, test=develop

* modify paddle/fluid/API.spec

43e17c79

04 7月, 2019 2 次提交
- Q
  Enhance linear_lr_warmup (#18463) · 602cb6a5
  由 qingqing01 提交于 7月 04, 2019
```
* make it support float/int learning as input.
```
  602cb6a5
- C
  
  Make fuse_all_reduce_op_pass support mix_precision (#17652) · 74538573
  由 chengduo 提交于 7月 04, 2019
  
  74538573
03 7月, 2019 7 次提交
- Z
  
  support Tensor input for edit_distance op (#18162) · 7c6f2350
  由 zhoukunsheng 提交于 7月 03, 2019
  
  7c6f2350
- Z
  support Tensor input for chunk_eval op (#18226) · 26318544
  由 zhoukunsheng 提交于 7月 03, 2019
```
* test=develop
support Tensor input for chunk_eval op

* test=develop
fix testcase for chunk_eval op

* test=develop
fix typos in nn.py
```
  26318544
- Z
  
  add unique kernel and op (#17557) · 206c44e2
  由 zhoukunsheng 提交于 7月 03, 2019
  
  206c44e2
- Z
  
  upgrade hash op to support Tensor and LoDTensor input (#17998) · 71af72b1
  由 zhoukunsheng 提交于 7月 03, 2019
  
  71af72b1
- Z
  
  add ones_like op (#17388) · d3b3443d
  由 zhoukunsheng 提交于 7月 03, 2019
  
  d3b3443d
- Z
  
  add size op (#17412) · 67b48d7f
  由 zhoukunsheng 提交于 7月 03, 2019
  
  67b48d7f
- H
  Refactor for Pipeline Thread Check (#18459) · 6e0df310
  由 hutuxian 提交于 7月 03, 2019
```
move the thread-check code from train_from_dataset to a single function
add UT for the thread check function
```
  6e0df310
02 7月, 2019 2 次提交

supports collective training with programs (#18392) · a873fa84

由 Yi Liu 提交于 7月 02, 2019

1. Since allreduce op has 4 reduce types, We split these four reduce types into four ops
2. We also refined the collective op code, e.g. we separated the collective op kernel into CPUKernel and CUDAKernel, and remove the device specified DeviceContext parameter in template as we already knew the target DeviceContext
3. We remove the newly added Collective op role to reduce the complexity of program and graph analysis

a873fa84

C
Add find_no_grad_vars in backward.py (#17942) · e0d8c6ac
由 chengduo 提交于 7月 02, 2019
```
* add not_been_used_vars to no_grad_set
test=develop
```
e0d8c6ac

01 7月, 2019 1 次提交

Make roi_perspective_transform op return mask and transform matrix (#18371) · 449c7a9f

由 LielinJiang 提交于 7月 01, 2019

* modify roi_perspective_transform_op to output mask and transform matrix

* modify comment

* modify comment

* modify API.spec

* update API.spec

* remove no use header, test=develop

* resolve conflict

449c7a9f

27 6月, 2019 2 次提交

add WITH_COVERAGE option, default OFF (#17872) · 27fb9cad

由 kh2se2013 提交于 6月 27, 2019

* add WITH_COVERAGE option, default OFF

test=develop

* add coverage for python sdk

test=develop

* fix code style

* fix COVERAGE_FILE path

test=develop

* remove coverage package

test=develop

* test = develop, run coverage as module

27fb9cad

supports collective communicated training (#18175) · b7128bac

由 HaoRen 提交于 6月 27, 2019

* fix prepare context redundant code problem, optimize executor by caching create_varaiables
test=develop

* supports collective training in executor

* make fetch_list runable with variables, add more unittest for use_program_cache
test=develop

* fix comment
test=develop

* use unique name for nccl_id

* supports output to stream in program_to_code

* insert sync_comm_stream before regularization; add skip_op_callstack capability in program_to_code

* set op role in collective training

* add collective op role

* remove orig file

* add build optimizer by strategy

* add collective strategy

* refine collective strategy

* add multi-process role maker

* refine strategy building factory so that we can easily plugin more strategy

* scale loss grad in collective sgd transpiler

* add support for distributed fc

* code format

* revert some features for dist fc

* add support for distributed fc training

* fix prepare context redundant code problem, optimize executor by caching create_varaiables
test=develop

* supports collective training in executor

* make fetch_list runable with variables, add more unittest for use_program_cache
test=develop

* use unique name for nccl_id

* supports output to stream in program_to_code

* insert sync_comm_stream before regularization; add skip_op_callstack capability in program_to_code

* set op role in collective training

* add collective op role

* fix comment
test=develop

* remove orig file

* add build optimizer by strategy

* add collective strategy

* refine collective strategy

* add multi-process role maker

* refine strategy building factory so that we can easily plugin more strategy

* scale loss grad in collective sgd transpiler

* add support for distributed fc

* code format

* revert some features for dist fc

* add support for distributed fc training

* test=develop
add collective op unittest standard

* test=develop
remove the test_collective directory

* test=develop
remove the test_collective directory

* remove slicegather test

* code format for reducescatter

* update attr of shard_index_op

* Modify macro nccl_helper

* remove test without distribute

* macro collective_helper

* marcro update

* test=develop
update support python3.5

* test=develop change gpu memory use to 0.1 when test

* test=develop
update ut equal func

* test=develop
set flags to 1.5

* test=develop fix pickle dumple  py35

* test=develop
fix divide in slice and add sync_comm_stream
update atol and rtol to 1e-05
rm shard_index op and test
modify read input from file to read from memory
remove origin_program in framework and add i/o in c_sync_calc_stream

* test=develop update unittest sync operator I/O

b7128bac

26 6月, 2019 4 次提交
- H
  
  add ut for pipeline training (#18289) · e42057cd
  由 hutuxian 提交于 6月 26, 2019
  
  e42057cd
- J
  
  test=develop, recover ocr ut on dygraph (#18166) · bd61d899
  由 Jiabin Yang 提交于 6月 26, 2019
  
  bd61d899
- Y
  Update lamb optimizer (#18333) · 23941e43
  由 Yibing Liu 提交于 6月 26, 2019
```
* Update lamb optimizer

test=develop, test=document_preview

* Regenerate api spec

test=develop, test=document_preview
```
  23941e43
- J
  
  test=develop, disable basic gru related ut (#18329) · 79bcdbbf
  由 Jiabin Yang 提交于 6月 26, 2019
  
  79bcdbbf
25 6月, 2019 1 次提交

Sequence mask support tensor (#18249) · df2eee71

由 Hongyu Liu 提交于 6月 25, 2019

* sequnce mask support max length tensor input; test=develop

* add rnn_impl.py; test=develop

* add basic gru lstm unittest; test=develop

* fix api spec; test=develop

* fix sequence_mask op bug;
test=develop
test=document_preview

* change +-*x to elmentwise_op; test=develop

* add mkl flag; test=develop

* fix rnn impl bug; test=develop

* update api spec; test=develop

* fix doc bug; test=develop

* fix lstm bugs; test=develop

df2eee71

23 6月, 2019 1 次提交
- C
  add random seed for recurrent op test (#18274) · d54e13bb
  由 chengduo 提交于 6月 23, 2019
```
test=develop
```
  d54e13bb
21 6月, 2019 2 次提交
- X
  set src_idx > 0 for bilinear_interp_op (#18238) · b58bb802
  由 xiaoting 提交于 6月 21, 2019
```
* set src_idx > 0, test=develop

* add unittest and cu, test=develop
```
  b58bb802
- G
  add more print function for timeout issue, make timeout value larger (#18219) · 7d76e34e
  由 guru4elephant 提交于 6月 21, 2019
```
* add more print function for timeout issue, make timeout value larger
```
  7d76e34e
20 6月, 2019 3 次提交

Fix create_lod_tensor (#18196) · ec970f12

由 Zeng Jinle 提交于 6月 20, 2019

* fix_create_lod_tensor, test=develop

* remove program_guard import,test=develop

* fix windows numpy default int32 error, test=develop

ec970f12

Fix slice op shape=-1 bug (#18107) · cefd0fb5

由 Hongyu Liu 提交于 6月 20, 2019

* fix slice op bug; test=develop

* fix variabel test bug; test=develop

* remove slice while true; test=develop

cefd0fb5

J
test=develop, fix test_imperative_transformer and ocr (#18127) · b3cbc5be
由 Jiabin Yang 提交于 6月 20, 2019
```
* test=develop, fix test_imperative_transformer and ocr

* test=develop, remove ocr recovery part
```
b3cbc5be

19 6月, 2019 3 次提交
- Q
  
  disable test_async_ssa_graph_executor_mnist test=develop (#18165) · 778f6acf
  由 Qiao Longfei 提交于 6月 19, 2019
  
  778f6acf
- 翟
  fix spelling errors (#17941) · 802ea509
  由翟飞跃提交于 6月 19, 2019
```
* fix spelling errors; test=develop

* Update API.spec

update md5

* Update API.spec

* change the order of api;test=develop
```
  802ea509
- J
  test=develop, add add_multi_gpu_install_check (#18157) · 991c94f1
  由 Jiabin Yang 提交于 6月 19, 2019
```
* test=develop, add add_multi_gpu_install_check

* test=develop, refine warning doc

* test=develop, refine warning doc

* test=develop, refine warning doc

* test=develop, support multi cpu
```
  991c94f1
18 6月, 2019 1 次提交
- C
  Remove nccl dep when the number of GPU is 1 (#18158) · 4978db2c
  由 chengduo 提交于 6月 18, 2019
```
* remove nccl dep when the number of GPU is 1
test=develop
```
  4978db2c
17 6月, 2019 1 次提交
- Z
  Fix py_reader iterable bug (#18108) · 6eec66a1
  由 Zeng Jinle 提交于 6月 17, 2019
```
* fix py_reader iterable bug, test=develop

* move data from buffered_reader,test=develop
```
  6eec66a1
16 6月, 2019 2 次提交

Update backward appending stragety to support double backward and fix some bug. (#18104) · 80d2e66f

由 qingqing01 提交于 6月 16, 2019

* Update backward.py:
     - If there is no input grad var in all outputs of previous ops, do not append this op into graph.
     - Only apply this stragety when double backward.
* Update some double backward op.
* Update sum_op to judge whether a tensor is empty by numel or IsInitialized().

80d2e66f

add detection output operator for supporting retinanet (#17896) · ff83655f

由 FlyingQianMM 提交于 6月 16, 2019

* test=develop
add detection output for supporting retinanet

* test=develop
add test_layers.py

* test=develop
add API.spec

* test=develop
alter test_retinanet_detection_output.py

* test=develop
alter round 2

* test=develop
alter retinanet_detection_output

* test=develop
alter paddle/fluid/API.spec

* test=devlop
alter detection.py

* test=develop
alter retinanet_detection_output

* test=develop
alter paddle/fluid/API.spec

* test=develop
alter detection.py

* test=develop
alter API.spec

* test=develop
alter retinanet_detection_output

* test=develop
alter paddle/fluid/API.spec

* test=develop
alter python/paddle/fluid/tests/unittests/test_retinanet_detection_output.py

* test=develop
alter python/paddle/fluid/tests/unittests/test_retinanet_detection_output.py

* test=develop
fix grammer error

* test=develop
fix grammer error

* test=develop
fix grammer error

* test=develop
alter python/paddle/fluid/tests/unittests/test_layers.py

* test=develop
alter paddle/fluid/API.spec

ff83655f

BaiXuePrincess / Paddle 与 Fork 源项目一致

BaiXuePrincess / Paddle
与 Fork 源项目一致