提交 · 745385732457656ab3517f9427315b98b13676cf · BaiXuePrincess / Paddle

04 7月, 2019 2 次提交
- C
  
  Make fuse_all_reduce_op_pass support mix_precision (#17652) · 74538573
  由 chengduo 提交于 7月 04, 2019
  
  74538573
- C
  Enhance execution error info (#18482) · 55baeced
  由 chengduo 提交于 7月 04, 2019
```
* enhance execution error info
test=develop
```
  55baeced
03 7月, 2019 8 次提交
- P
  Nan debugger init (#18401) · e9c7e218
  由 pkpk 提交于 7月 03, 2019
```
test=develop
```
  e9c7e218
- Z
  
  support Tensor input for edit_distance op (#18162) · 7c6f2350
  由 zhoukunsheng 提交于 7月 03, 2019
  
  7c6f2350
- Z
  support Tensor input for chunk_eval op (#18226) · 26318544
  由 zhoukunsheng 提交于 7月 03, 2019
```
* test=develop
support Tensor input for chunk_eval op

* test=develop
fix testcase for chunk_eval op

* test=develop
fix typos in nn.py
```
  26318544
- Z
  
  add unique kernel and op (#17557) · 206c44e2
  由 zhoukunsheng 提交于 7月 03, 2019
  
  206c44e2
- Z
  
  upgrade hash op to support Tensor and LoDTensor input (#17998) · 71af72b1
  由 zhoukunsheng 提交于 7月 03, 2019
  
  71af72b1
- Z
  
  add ones_like op (#17388) · d3b3443d
  由 zhoukunsheng 提交于 7月 03, 2019
  
  d3b3443d
- Z
  
  add size op (#17412) · 67b48d7f
  由 zhoukunsheng 提交于 7月 03, 2019
  
  67b48d7f
- H
  Refactor for Pipeline Thread Check (#18459) · 6e0df310
  由 hutuxian 提交于 7月 03, 2019
```
move the thread-check code from train_from_dataset to a single function
add UT for the thread check function
```
  6e0df310
02 7月, 2019 5 次提交

Z

add friendly error msg to py_reader (#18316) · 41ab76e5
由 Zeng Jinle 提交于 7月 02, 2019

41ab76e5
K

fix load attr error. test=develop (#18447) · 823ab5e8
由 Kaipeng Deng 提交于 7月 02, 2019

823ab5e8

supports collective training with programs (#18392) · a873fa84

由 Yi Liu 提交于 7月 02, 2019

1. Since allreduce op has 4 reduce types, We split these four reduce types into four ops
2. We also refined the collective op code, e.g. we separated the collective op kernel into CPUKernel and CUDAKernel, and remove the device specified DeviceContext parameter in template as we already knew the target DeviceContext
3. We remove the newly added Collective op role to reduce the complexity of program and graph analysis

a873fa84

G
make fleet support mpi job submit directly (#18441) · 357311fd
由 guru4elephant 提交于 7月 02, 2019
```
make fleet support mpi job submit directly.
```
357311fd
C
Add find_no_grad_vars in backward.py (#17942) · e0d8c6ac
由 chengduo 提交于 7月 02, 2019
```
* add not_been_used_vars to no_grad_set
test=develop
```
e0d8c6ac

01 7月, 2019 4 次提交
- L
  Make roi_perspective_transform op return mask and transform matrix (#18371) · 449c7a9f
  由 LielinJiang 提交于 7月 01, 2019
```
* modify roi_perspective_transform_op to output mask and transform matrix

* modify comment

* modify comment

* modify API.spec

* update API.spec

* remove no use header, test=develop

* resolve conflict
```
  449c7a9f
- T
  fix mac ci random fail (#18430) · a3bc804f
  由 tensor-tang 提交于 7月 01, 2019
```
* fix mac ci random fail
* use platform instead
```
  a3bc804f
- X
  replace mnist dataset url, test=develop (#18429) · dd3f9d19
  由 xiaoting 提交于 7月 01, 2019
```
replace mnist dataset url
```
  dd3f9d19
- X
  
  add "import paddle.fluid as fluid" to examples lack of it · 47e2ef38
  由 xsrobin 提交于 7月 01, 2019
  
  47e2ef38
30 6月, 2019 1 次提交
- H
  update api format (#18413) · 8a39e5c1
  由 hutuxian 提交于 6月 30, 2019
```
* update api format
test=develop

* update API.spec
test=develop
```
  8a39e5c1
29 6月, 2019 1 次提交
- T
  fix py-cpuinfo mac random fail (#18383) · ce7a024c
  由 tensor-tang 提交于 6月 29, 2019
```
* fix py-cpuinfo mac random fail
* differentiate version on windows
```
  ce7a024c
28 6月, 2019 5 次提交
- J
  init custom black white list (#18377) · 2b4ef509
  由 Jie Fang 提交于 6月 28, 2019
```
test=develop
```
  2b4ef509
- G
  add MultiSlotStringDataGenerator for speedup of string based user inp… (#18390) · e83f902b
  由 guru4elephant 提交于 6月 28, 2019
```
* add MultiSlotStringDataGenerator for speedup of string based user input data
```
  e83f902b
- J
  Fix/program doc (#17908) · 43f64a17
  由 Jiabin Yang 提交于 6月 28, 2019
```
* test=develop, add some comments for Program.clone

* test=develop, add API.spec

* test=develop, refine comments

* refine Program doc and clone doc

* test=develop, refine doc
```
  43f64a17
- C
  Add is_compiled_with_cuda (#18356) · 871cc15e
  由 chengduo 提交于 6月 28, 2019
```
*  add cuda_is_available
test=develop

* Fix api.spec
test=develop

* fix api doc
test=develop
```
  871cc15e
- W
  Call the test_slim_int8_* tests through absolute path (#18386) · 8ed819d8
  由 Wojciech Uss 提交于 6月 28, 2019
```
test=develop
```
  8ed819d8
27 6月, 2019 6 次提交

L
Fix dygraph show style (#18297) · fd6631ef
由 lujun 提交于 6月 27, 2019
```
Fix dygraph show style for FluidDoc.
```
fd6631ef
翟

Remove all the code, API and doc of MKL-DNN INT8v1 (#18347) · 19da59ed
由翟飞跃提交于 6月 27, 2019

19da59ed

Fix Bug-prone code of PE (#18354) · 8ed33bf9

由 chengduo 提交于 6月 27, 2019

* update pe reduce config
test=develop

*  drop the local_exe_scopes of the previous parallel_executor
test=develop

8ed33bf9

T
fix communicator with pyreader (#18350) · 999d9a59
由 tangwei12 提交于 6月 27, 2019
```
* add is_runnning in communicator, test=develop
```
999d9a59

add WITH_COVERAGE option, default OFF (#17872) · 27fb9cad

由 kh2se2013 提交于 6月 27, 2019

* add WITH_COVERAGE option, default OFF

test=develop

* add coverage for python sdk

test=develop

* fix code style

* fix COVERAGE_FILE path

test=develop

* remove coverage package

test=develop

* test = develop, run coverage as module

27fb9cad

supports collective communicated training (#18175) · b7128bac

由 HaoRen 提交于 6月 27, 2019

* fix prepare context redundant code problem, optimize executor by caching create_varaiables
test=develop

* supports collective training in executor

* make fetch_list runable with variables, add more unittest for use_program_cache
test=develop

* fix comment
test=develop

* use unique name for nccl_id

* supports output to stream in program_to_code

* insert sync_comm_stream before regularization; add skip_op_callstack capability in program_to_code

* set op role in collective training

* add collective op role

* remove orig file

* add build optimizer by strategy

* add collective strategy

* refine collective strategy

* add multi-process role maker

* refine strategy building factory so that we can easily plugin more strategy

* scale loss grad in collective sgd transpiler

* add support for distributed fc

* code format

* revert some features for dist fc

* add support for distributed fc training

* fix prepare context redundant code problem, optimize executor by caching create_varaiables
test=develop

* supports collective training in executor

* make fetch_list runable with variables, add more unittest for use_program_cache
test=develop

* use unique name for nccl_id

* supports output to stream in program_to_code

* insert sync_comm_stream before regularization; add skip_op_callstack capability in program_to_code

* set op role in collective training

* add collective op role

* fix comment
test=develop

* remove orig file

* add build optimizer by strategy

* add collective strategy

* refine collective strategy

* add multi-process role maker

* refine strategy building factory so that we can easily plugin more strategy

* scale loss grad in collective sgd transpiler

* add support for distributed fc

* code format

* revert some features for dist fc

* add support for distributed fc training

* test=develop
add collective op unittest standard

* test=develop
remove the test_collective directory

* test=develop
remove the test_collective directory

* remove slicegather test

* code format for reducescatter

* update attr of shard_index_op

* Modify macro nccl_helper

* remove test without distribute

* macro collective_helper

* marcro update

* test=develop
update support python3.5

* test=develop change gpu memory use to 0.1 when test

* test=develop
update ut equal func

* test=develop
set flags to 1.5

* test=develop fix pickle dumple  py35

* test=develop
fix divide in slice and add sync_comm_stream
update atol and rtol to 1e-05
rm shard_index op and test
modify read input from file to read from memory
remove origin_program in framework and add i/o in c_sync_calc_stream

* test=develop update unittest sync operator I/O

b7128bac

26 6月, 2019 6 次提交
- Q
  Simplify multi_box_head API in detection.py and remove assign op. (#18310) · 9047ac68
  由 qingqing01 提交于 6月 26, 2019
```
* Simplify multi_box_head API in detection.py and remove assign op.
```
  9047ac68
- H
  
  add ut for pipeline training (#18289) · e42057cd
  由 hutuxian 提交于 6月 26, 2019
  
  e42057cd
- J
  
  test=develop, recover ocr ut on dygraph (#18166) · bd61d899
  由 Jiabin Yang 提交于 6月 26, 2019
  
  bd61d899
- Y
  Update lamb optimizer (#18333) · 23941e43
  由 Yibing Liu 提交于 6月 26, 2019
```
* Update lamb optimizer

test=develop, test=document_preview

* Regenerate api spec

test=develop, test=document_preview
```
  23941e43
- W
  Fix checkpoint of Light-NAS (#18330) · 1bdfd2eb
  由 whs 提交于 6月 26, 2019
```
Socket can't be pickled.
test=develop
```
  1bdfd2eb
- J
  
  test=develop, disable basic gru related ut (#18329) · 79bcdbbf
  由 Jiabin Yang 提交于 6月 26, 2019
  
  79bcdbbf
25 6月, 2019 2 次提交
- J
  Add install check for multigpu (#18323) · 831a3e62
  由 Jiabin Yang 提交于 6月 25, 2019
```
* test=develop, add_install_check_for_multigpu

* test=develop, refine code to use cuda_devices
```
  831a3e62
- Z
  
  fix lod_tensor.py grammar error, test=develop (#18308) · f88e07a0
  由 Zeng Jinle 提交于 6月 25, 2019
  
  f88e07a0

BaiXuePrincess / Paddle 与 Fork 源项目一致

BaiXuePrincess / Paddle
与 Fork 源项目一致