提交 · 2bb296dfe9fd16f14f16bf9d5822ff7384680619 · Crayon鑫 / Paddle

31 7月, 2019 1 次提交
- D
  make dist unit test exclusive run (#18865) · 2bb296df
  由 Dong Daxiang 提交于 7月 31, 2019
```
make dist unit test exclusive run
```
  2bb296df
30 7月, 2019 1 次提交
- C
  add CPUInplaceTestWithFuseOptimizationOps (#18867) · ecd2bdad
  由 chengduo 提交于 7月 30, 2019
```
test=develop
```
  ecd2bdad
28 7月, 2019 1 次提交
- Z
  
  fix affine_channel no_need buffer bug, test=develop (#18844) · 9a8a7a1d
  由 Zeng Jinle 提交于 7月 28, 2019
  
  9a8a7a1d
26 7月, 2019 1 次提交

Feature/mem opt pass refactor (#18735) · a802da65

由 Zeng Jinle 提交于 7月 26, 2019

* first version memory optimize pass, test=develop

* remove move_tensor_sharing_pass, test=develop

* refine code comments, add unittests, test=develop

* turn off memory_optimize by default, test=develop

* follow huihuang's comments, test=develop

* follow chengduoZH's comments, test=develop

* fix grammar error, add const qualifier, fix pass_test exception message, test=develop

* follow chengduoZH's comments 2nd, test=develop

a802da65

25 7月, 2019 1 次提交
- G
  split test_dist_se_resnext.py into 4 testcases (#18743) · 2efb282c
  由 guru4elephant 提交于 7月 25, 2019
```
* split test_dist_se_resnext.py into 4 testcases
```
  2efb282c
24 7月, 2019 1 次提交

Extend Matmul to support matrix multiplication with multiple heads (#18570) · 220eef60

由 Bob Zhu 提交于 7月 24, 2019

* extend matmul op to support multiple head multiplication

With the support of multiple head, the multiplication of two big matrixes is
split into multiplication of several (head_number) small matrixes. e.g. if
Mat A is [3, 24] and Mat B is [24, 4], when multiple A and B with head_number
as 4, Mat A will be split as 4 matrix of [3, 6] and Mat B will be 4 matrix of
[6, 4]. The result of final matrix will be 4 matrix of [3, 4], i.e. [3, 16].

220eef60

22 7月, 2019 1 次提交
- G
  split different comm method for mnist distributed training (#18715) · ebf9797e
  由 guru4elephant 提交于 7月 22, 2019
```
* split different comm method for mnist distributed training
```
  ebf9797e
18 7月, 2019 1 次提交

Feature/auto_growth_allocator (#18561) · ae58afc5

由 Zeng Jinle 提交于 7月 18, 2019

* feature/auto_growth_allocator, test=develop

* add unittest of AlignedAllocator, test=develop

* try to turn on auto_growth to test on CI, test=develop

* fix segmentation fault in mixed_vector.h, test=develop

* add unittests, test=develop

ae58afc5

15 7月, 2019 1 次提交
- G
  increase timeout again (#18628) · b71b4543
  由 guru4elephant 提交于 7月 15, 2019
```
test=develop
```
  b71b4543
12 7月, 2019 1 次提交
- K
  1）change to parallel mode on python coverage run (#18594) · 9ad57f2d
  由 kh2se2013 提交于 7月 12, 2019
```
2）add pip install coverage in Dockerfile.tmp
test=develop
```
  9ad57f2d
11 7月, 2019 1 次提交

Feature/buffer_shared_inplace (#17911) · d3003a16

由 Zeng Jinle 提交于 7月 11, 2019

* feature/buffer_shared_inplace, test=develop

* refine code, test=develop

* fix elementwise_add op cpu inplace and sum inplace bug, test=develop

* add unittest and debug log, test=develop

* fix parallel_executor scope bug, polish code, test=develop

* fix sum op, activation op, single_in_place_inference bug, test=develop

* remove kLocalExecScopeName, test=develop

* fix unittest,test=develop

* fix out_var first version bug, test=develop

* follow comments,test=develop

d3003a16

27 6月, 2019 2 次提交

add WITH_COVERAGE option, default OFF (#17872) · 27fb9cad

由 kh2se2013 提交于 6月 27, 2019

* add WITH_COVERAGE option, default OFF

test=develop

* add coverage for python sdk

test=develop

* fix code style

* fix COVERAGE_FILE path

test=develop

* remove coverage package

test=develop

* test = develop, run coverage as module

27fb9cad

supports collective communicated training (#18175) · b7128bac

由 HaoRen 提交于 6月 27, 2019

* fix prepare context redundant code problem, optimize executor by caching create_varaiables
test=develop

* supports collective training in executor

* make fetch_list runable with variables, add more unittest for use_program_cache
test=develop

* fix comment
test=develop

* use unique name for nccl_id

* supports output to stream in program_to_code

* insert sync_comm_stream before regularization; add skip_op_callstack capability in program_to_code

* set op role in collective training

* add collective op role

* remove orig file

* add build optimizer by strategy

* add collective strategy

* refine collective strategy

* add multi-process role maker

* refine strategy building factory so that we can easily plugin more strategy

* scale loss grad in collective sgd transpiler

* add support for distributed fc

* code format

* revert some features for dist fc

* add support for distributed fc training

* fix prepare context redundant code problem, optimize executor by caching create_varaiables
test=develop

* supports collective training in executor

* make fetch_list runable with variables, add more unittest for use_program_cache
test=develop

* use unique name for nccl_id

* supports output to stream in program_to_code

* insert sync_comm_stream before regularization; add skip_op_callstack capability in program_to_code

* set op role in collective training

* add collective op role

* fix comment
test=develop

* remove orig file

* add build optimizer by strategy

* add collective strategy

* refine collective strategy

* add multi-process role maker

* refine strategy building factory so that we can easily plugin more strategy

* scale loss grad in collective sgd transpiler

* add support for distributed fc

* code format

* revert some features for dist fc

* add support for distributed fc training

* test=develop
add collective op unittest standard

* test=develop
remove the test_collective directory

* test=develop
remove the test_collective directory

* remove slicegather test

* code format for reducescatter

* update attr of shard_index_op

* Modify macro nccl_helper

* remove test without distribute

* macro collective_helper

* marcro update

* test=develop
update support python3.5

* test=develop change gpu memory use to 0.1 when test

* test=develop
update ut equal func

* test=develop
set flags to 1.5

* test=develop fix pickle dumple  py35

* test=develop
fix divide in slice and add sync_comm_stream
update atol and rtol to 1e-05
rm shard_index op and test
modify read input from file to read from memory
remove origin_program in framework and add i/o in c_sync_calc_stream

* test=develop update unittest sync operator I/O

b7128bac

26 6月, 2019 3 次提交
- H
  
  add ut for pipeline training (#18289) · e42057cd
  由 hutuxian 提交于 6月 26, 2019
  
  e42057cd
- J
  
  test=develop, recover ocr ut on dygraph (#18166) · bd61d899
  由 Jiabin Yang 提交于 6月 26, 2019
  
  bd61d899
- J
  
  test=develop, disable basic gru related ut (#18329) · 79bcdbbf
  由 Jiabin Yang 提交于 6月 26, 2019
  
  79bcdbbf
25 6月, 2019 1 次提交

Sequence mask support tensor (#18249) · df2eee71

由 Hongyu Liu 提交于 6月 25, 2019

* sequnce mask support max length tensor input; test=develop

* add rnn_impl.py; test=develop

* add basic gru lstm unittest; test=develop

* fix api spec; test=develop

* fix sequence_mask op bug;
test=develop
test=document_preview

* change +-*x to elmentwise_op; test=develop

* add mkl flag; test=develop

* fix rnn impl bug; test=develop

* update api spec; test=develop

* fix doc bug; test=develop

* fix lstm bugs; test=develop

df2eee71

21 6月, 2019 1 次提交
- G
  add more print function for timeout issue, make timeout value larger (#18219) · 7d76e34e
  由 guru4elephant 提交于 6月 21, 2019
```
* add more print function for timeout issue, make timeout value larger
```
  7d76e34e
20 6月, 2019 1 次提交
- J
  test=develop, fix test_imperative_transformer and ocr (#18127) · b3cbc5be
  由 Jiabin Yang 提交于 6月 20, 2019
```
* test=develop, fix test_imperative_transformer and ocr

* test=develop, remove ocr recovery part
```
  b3cbc5be
19 6月, 2019 2 次提交
- Q
  
  disable test_async_ssa_graph_executor_mnist test=develop (#18165) · 778f6acf
  由 Qiao Longfei 提交于 6月 19, 2019
  
  778f6acf
- J
  test=develop, add add_multi_gpu_install_check (#18157) · 991c94f1
  由 Jiabin Yang 提交于 6月 19, 2019
```
* test=develop, add add_multi_gpu_install_check

* test=develop, refine warning doc

* test=develop, refine warning doc

* test=develop, refine warning doc

* test=develop, support multi cpu
```
  991c94f1
18 6月, 2019 1 次提交
- C
  Remove nccl dep when the number of GPU is 1 (#18158) · 4978db2c
  由 chengduo 提交于 6月 18, 2019
```
* remove nccl dep when the number of GPU is 1
test=develop
```
  4978db2c
12 6月, 2019 1 次提交
- T
  fix save/load in fleet (#17675) · 101f74cb
  由 tangwei12 提交于 6月 12, 2019
```
* fix save/load in Fleet
* add UT framework of Fleet
```
  101f74cb
06 6月, 2019 3 次提交
- G
  
  Add backward and optimizer operator dependency pass. (#17746) · fbbdc9cc
  由 gongweibao 提交于 6月 06, 2019
  
  fbbdc9cc
- H
  SERIAL flaky imperative unit tests for CI cuda9 (#17892) · 83e51ded
  由 Huihuang Zheng 提交于 6月 06, 2019
```
test=develop
```
  83e51ded
- G
  
  Fine tuning launch.py (#17223) · 6a1df469
  由 gongweibao 提交于 6月 06, 2019
  
  6a1df469
31 5月, 2019 1 次提交

Split the unittest test_dist_mmist into multiple unittests (test_dist_mnist,... · bfcc97d9

由 lilong12 提交于 5月 31, 2019

Split the unittest test_dist_mmist into multiple unittests (test_dist_mnist, test_dist_mnist_nccl and test_dist_mnist_lars) to avoid timeout (#17707)

bfcc97d9

30 5月, 2019 2 次提交
- H
  
  remove ocr unit test; test=develop (#17755) · 552f8395
  由 Hongyu Liu 提交于 5月 30, 2019
  
  552f8395
- H
  
  fix ocr; test=develop (#17751) · 0a02451e
  由 Hongyu Liu 提交于 5月 30, 2019
  
  0a02451e
29 5月, 2019 1 次提交

test=develop, add ocr in dygraph test (#17470) · 33a791dd

由 Jiabin Yang 提交于 5月 29, 2019

* test=develop, add ocr in dygraph test

* test=develop, add cudnn determinist

* test=develop, remove useless code

* test=develop, fix cmake error

33a791dd

23 5月, 2019 1 次提交
- J
  
  test=develop, fix test_imperative_resnet failed on CI (#17583) · 3ee3611a
  由 Jiabin Yang 提交于 5月 23, 2019
  
  3ee3611a
21 5月, 2019 1 次提交
- T
  remove unused SERIAL compiler option (#17500) · 3d19f44a
  由 Tao Luo 提交于 5月 21, 2019
```
test=develop
```
  3d19f44a
13 5月, 2019 1 次提交

test=develop, add gradient sort backward strategy (#17125) · 4624d7c6

由 Jiabin Yang 提交于 5月 13, 2019

* test=develop, add gradient sort backward strategy

* test=develop, fix test by add FLAGS_cudnn_deterministic on new tests

4624d7c6

07 5月, 2019 1 次提交
- T
  remove unused FLAGS_warpctc_dir (#17162) · ff1661f1
  由 Tao Luo 提交于 5月 07, 2019
```
* remove unused FLAGS_warpctc_dir

test=develop

* remove FLAGS_warpctc_dir

test=develop
```
  ff1661f1
05 5月, 2019 2 次提交
- W
  
  use two GPUs to run the exclusive test test=develop (#17187) · 83c4f772
  由 wopeizl 提交于 5月 05, 2019
  
  83c4f772
- T
  Modify test timeout (#17181) · 8092c405
  由 tianshuo78520a 提交于 5月 05, 2019
```
* test=develop

* test=deelop
```
  8092c405
25 4月, 2019 2 次提交
- Y
  ParallelDyGraph with GPU collective mode (#16827) · 0b07eef1
  由 Yan Xu 提交于 4月 25, 2019
```
implement dygraph.parallel.DataParallel to hook reduce op.
```
  0b07eef1
- T
  
  remove ut test_dist_word2vec in mac ci, will fix it in private, test=develop (#17066) · e707119a
  由 tangwei12 提交于 4月 25, 2019
  
  e707119a
23 4月, 2019 1 次提交

Support backward of backward for Relu and add a new gradient checker by... · c1c2633a

由 qingqing01 提交于 4月 23, 2019

Support backward of backward for Relu and add a new gradient checker by comparing theoretical and numerical Jacobian. (#16862)

* Support backward of backward and a new gradient checker
* Rename decorators.py to decorator_helper.py, since Python on Windows CI has decorators package.

1. Add ReluDoubleGradMaker when register relu_grad.
2. Add a new gradient checker by comparing theoretical and numerical Jacobian.  Check double gradients by double_grad_check.

c1c2633a

22 4月, 2019 1 次提交

Move gc test to each test of op (#16999) · f188b370

由 Zeng Jinle 提交于 4月 22, 2019

* move gc test to op_test
test=develop

* Revert "move gc test to op_test"

This reverts commit cf15da65.

* enable gc test in some ops
test=develop

f188b370

Crayon鑫 / Paddle 与 Fork 源项目一致

Crayon鑫 / Paddle
与 Fork 源项目一致