提交 · 9931bc64f580cd42893b49bf73dde7a55f1ff221 · BaiXuePrincess / Paddle

27 6月, 2019 5 次提交

翟

Remove all the code, API and doc of MKL-DNN INT8v1 (#18347) · 19da59ed
由翟飞跃提交于 6月 27, 2019

19da59ed

Fix Bug-prone code of PE (#18354) · 8ed33bf9

由 chengduo 提交于 6月 27, 2019

* update pe reduce config
test=develop

*  drop the local_exe_scopes of the previous parallel_executor
test=develop

8ed33bf9

T
fix communicator with pyreader (#18350) · 999d9a59
由 tangwei12 提交于 6月 27, 2019
```
* add is_runnning in communicator, test=develop
```
999d9a59

add WITH_COVERAGE option, default OFF (#17872) · 27fb9cad

由 kh2se2013 提交于 6月 27, 2019

* add WITH_COVERAGE option, default OFF

test=develop

* add coverage for python sdk

test=develop

* fix code style

* fix COVERAGE_FILE path

test=develop

* remove coverage package

test=develop

* test = develop, run coverage as module

27fb9cad

supports collective communicated training (#18175) · b7128bac

由 HaoRen 提交于 6月 27, 2019

* fix prepare context redundant code problem, optimize executor by caching create_varaiables
test=develop

* supports collective training in executor

* make fetch_list runable with variables, add more unittest for use_program_cache
test=develop

* fix comment
test=develop

* use unique name for nccl_id

* supports output to stream in program_to_code

* insert sync_comm_stream before regularization; add skip_op_callstack capability in program_to_code

* set op role in collective training

* add collective op role

* remove orig file

* add build optimizer by strategy

* add collective strategy

* refine collective strategy

* add multi-process role maker

* refine strategy building factory so that we can easily plugin more strategy

* scale loss grad in collective sgd transpiler

* add support for distributed fc

* code format

* revert some features for dist fc

* add support for distributed fc training

* fix prepare context redundant code problem, optimize executor by caching create_varaiables
test=develop

* supports collective training in executor

* make fetch_list runable with variables, add more unittest for use_program_cache
test=develop

* use unique name for nccl_id

* supports output to stream in program_to_code

* insert sync_comm_stream before regularization; add skip_op_callstack capability in program_to_code

* set op role in collective training

* add collective op role

* fix comment
test=develop

* remove orig file

* add build optimizer by strategy

* add collective strategy

* refine collective strategy

* add multi-process role maker

* refine strategy building factory so that we can easily plugin more strategy

* scale loss grad in collective sgd transpiler

* add support for distributed fc

* code format

* revert some features for dist fc

* add support for distributed fc training

* test=develop
add collective op unittest standard

* test=develop
remove the test_collective directory

* test=develop
remove the test_collective directory

* remove slicegather test

* code format for reducescatter

* update attr of shard_index_op

* Modify macro nccl_helper

* remove test without distribute

* macro collective_helper

* marcro update

* test=develop
update support python3.5

* test=develop change gpu memory use to 0.1 when test

* test=develop
update ut equal func

* test=develop
set flags to 1.5

* test=develop fix pickle dumple  py35

* test=develop
fix divide in slice and add sync_comm_stream
update atol and rtol to 1e-05
rm shard_index op and test
modify read input from file to read from memory
remove origin_program in framework and add i/o in c_sync_calc_stream

* test=develop update unittest sync operator I/O

b7128bac

26 6月, 2019 6 次提交
- Q
  Simplify multi_box_head API in detection.py and remove assign op. (#18310) · 9047ac68
  由 qingqing01 提交于 6月 26, 2019
```
* Simplify multi_box_head API in detection.py and remove assign op.
```
  9047ac68
- H
  
  add ut for pipeline training (#18289) · e42057cd
  由 hutuxian 提交于 6月 26, 2019
  
  e42057cd
- J
  
  test=develop, recover ocr ut on dygraph (#18166) · bd61d899
  由 Jiabin Yang 提交于 6月 26, 2019
  
  bd61d899
- Y
  Update lamb optimizer (#18333) · 23941e43
  由 Yibing Liu 提交于 6月 26, 2019
```
* Update lamb optimizer

test=develop, test=document_preview

* Regenerate api spec

test=develop, test=document_preview
```
  23941e43
- W
  Fix checkpoint of Light-NAS (#18330) · 1bdfd2eb
  由 whs 提交于 6月 26, 2019
```
Socket can't be pickled.
test=develop
```
  1bdfd2eb
- J
  
  test=develop, disable basic gru related ut (#18329) · 79bcdbbf
  由 Jiabin Yang 提交于 6月 26, 2019
  
  79bcdbbf
25 6月, 2019 8 次提交

Add install check for multigpu (#18323) · 831a3e62

由 Jiabin Yang 提交于 6月 25, 2019

* test=develop, add_install_check_for_multigpu

* test=develop, refine code to use cuda_devices

831a3e62

Z

fix lod_tensor.py grammar error, test=develop (#18308) · f88e07a0
由 Zeng Jinle 提交于 6月 25, 2019

f88e07a0

Sequence mask support tensor (#18249) · df2eee71

由 Hongyu Liu 提交于 6月 25, 2019

* sequnce mask support max length tensor input; test=develop

* add rnn_impl.py; test=develop

* add basic gru lstm unittest; test=develop

* fix api spec; test=develop

* fix sequence_mask op bug;
test=develop
test=document_preview

* change +-*x to elmentwise_op; test=develop

* add mkl flag; test=develop

* fix rnn impl bug; test=develop

* update api spec; test=develop

* fix doc bug; test=develop

* fix lstm bugs; test=develop

df2eee71

J
test=develop, Revert "Add multi gpu install check" (#18313) · 9cb799be
由 Jiabin Yang 提交于 6月 25, 2019
```
* Revert "Add multi gpu install check (#18229)"

This reverts commit 61ed06b2.

* test=develop, start ci
```
9cb799be

optimize communicator merge sparse gradient test=develop (#18159) · 0e08e91c

由 Qiao Longfei 提交于 6月 25, 2019

* optimize communicator merge sparse gradient test=develop

* revert multithread selected rows merge add test=develop

* follow comment test=develop

0e08e91c

J
init black/white lists (#17847) · 172c2fac
由 Jie Fang 提交于 6月 25, 2019
```
test=develop
```
172c2fac
C
Fix default value of fluid.memory_optimize (#18295) · e06c69c7
由 chengduo 提交于 6月 25, 2019
```
* fix default value of fluid.memory_optimize
test=develop

* fix api.spec
test=develop
```
e06c69c7
Z
fix split and sampled softmax (#18280) · 6978b2e4
由 Zhaolong Xing 提交于 6月 25, 2019
```
test=develop
```
6978b2e4

24 6月, 2019 2 次提交
- H
  
  add api desc for pipeline training (#18293) · 6ed73830
  由 hutuxian 提交于 6月 24, 2019
  
  6ed73830
- L
  improve doc of lstm, sequence_enumerate, softmax_with_cross_entropy, space_to_depth APIs (#18261) · a736c03b
  由 liuwei1031 提交于 6月 24, 2019
```
* improve doc of lstm, sequence_enumerate, softmax_with_cross_entropy, space_to_depth APIs, test=develop

* update API.spec, test=develop
```
  a736c03b
23 6月, 2019 4 次提交
- C
  add random seed for recurrent op test (#18274) · d54e13bb
  由 chengduo 提交于 6月 23, 2019
```
test=develop
```
  d54e13bb
- L
  
  improve the hint message of memory optimize, test=develop (#18260) · 4151d90c
  由 liuwei1031 提交于 6月 23, 2019
  
  4151d90c
- G
  fix paddle cloud role maker bug (#18269) · ff399fd7
  由 guru4elephant 提交于 6月 23, 2019
```
* fix paddle cloud role maker bug
```
  ff399fd7
- Y
  Fix ema's example & fp16 update (#18273) · 412951d7
  由 Yibing Liu 提交于 6月 23, 2019
```
test=develop, test=document_preview
```
  412951d7
22 6月, 2019 2 次提交
- F
  fix double buffer example (#18169) · fdf798f9
  由 flame 提交于 6月 22, 2019
```
test=develop
test=document_preview
```
  fdf798f9
- B
  
  fix api doc example, test=develop (#18266) · 23b8b18e
  由 Bai Yifan 提交于 6月 22, 2019
  
  23b8b18e
21 6月, 2019 13 次提交

P

fix a bug in examples of metrics.Acc · cd9d57f5
由 pkpk 提交于 6月 21, 2019

cd9d57f5
T
refine core cmake warning and print more info (#18248) · 68da8b2a
由 tensor-tang 提交于 6月 21, 2019
```
* refine core cmake warning and print more info

test=develop

* fix comments

test=develop
```
68da8b2a
Z
Add StaticRNN.output code example (#18251) · 32c95f17
由 zhaoyuchen2018 提交于 6月 21, 2019
```
refine StaticRNN api doc
test=develop
test=document_preview
```
32c95f17
X

fix yolo_box example,test=develop (#18247) · 2f0d6826
由 xiaoting 提交于 6月 21, 2019

2f0d6826

fix some bug when merge sparse embedding parameters, test=develop (#18223) · 6b3d9625

由 songhao 提交于 6月 21, 2019

1. fix the bug that out_put_var in SaveSelectedRows would be empty string
2. use merge_sparse_lookup_table to replace sum op for load_persistables_for_inference
3. fix the bug in _clone_var_in_block_ when the var is SELECTED_ROWS.

6b3d9625

dataset (#17973) · 3f8031e2

由 jiaqi 提交于 6月 21, 2019

(1) use channel instead of vector/BlockingQueue in Dataset，to keep same with existing implementation, and make code more readable and flexible (dataset single output channel or multi output channel). one previous memory out of limit problem is cause by not release memory after training.
(2) add Record because MultiSlotType costs too much memory (80B)，fix memory out of limit problem.
(3) add Channel, Archive in paddle/fluid/framework
(4) change dataset from shared_ptr to unique_ptr in pybind
(5) move create/destroy readers from trainer to dataset
(6) move shuffle from datafeed to dataset. dataset holds memory, datafeed is only for load data and feed data to network.
(7) fix thread num bug of Dataset when filelist size < thread num
(8) support set_queue_num in InMemoryDataset

3f8031e2

L
improve the doc of DataFeeder and default_main_program (#18241) · 5d54ed4a
由 liuwei1031 提交于 6月 21, 2019
```
* improve the doc of DataFeeder and default_main_program

* update API.spec, test=develop
```
5d54ed4a
A

fix BilinearInitializer doc (#18242) · 4f3acb39
由 AIFollowers 提交于 6月 21, 2019

4f3acb39
S

fix bug in Class MultiSlotDataGenerator's function _gen_str, test=develop (#18222) · 432fda51
由 songhao 提交于 6月 21, 2019

432fda51

Add multi gpu install check (#18229) · 61ed06b2

由 Jiabin Yang 提交于 6月 21, 2019

* test=develop, add add_multi_gpu_install_check

* test=develop, refine warning doc

* test=develop, refine warning doc

* test=develop, refine warning doc

* test=develop, support multi cpu

* test=develop, find right num of cuda device

* test=develop, find right num of cuda device

* test=develop, fix multigpu processing and fix type bug in dygraph

* test=develop, fix multigpu processing and fix type bug in dygraph

61ed06b2

X
set src_idx > 0 for bilinear_interp_op (#18238) · b58bb802
由 xiaoting 提交于 6月 21, 2019
```
* set src_idx > 0, test=develop

* add unittest and cu, test=develop
```
b58bb802
G
add more print function for timeout issue, make timeout value larger (#18219) · 7d76e34e
由 guru4elephant 提交于 6月 21, 2019
```
* add more print function for timeout issue, make timeout value larger
```
7d76e34e
H
fix errors in python3 (#18239) · cf15c3ff
由 hutuxian 提交于 6月 21, 2019
```
* fix relative import error in python3
* fix debug string info
```
cf15c3ff

BaiXuePrincess / Paddle 与 Fork 源项目一致

BaiXuePrincess / Paddle
与 Fork 源项目一致