提交 · 0313b98ae029a694ea7eb2c5b4f8195c6ffd830d · Crayon鑫 / Paddle

09 10月, 2019 1 次提交
- C
  temporally disable test_parallel_executor_fetch_feed in Windows CI (#20288) · 0313b98a
  由 chengduo 提交于 10月 09, 2019
```
test=develop
```
  0313b98a
30 9月, 2019 1 次提交
- C
  Add GEO-SGD distribute training algorithm (#20018) · 728ec1b4
  由 Chengmo 提交于 9月 30, 2019
```
* refector geo sgd & communicator
```
  728ec1b4
28 9月, 2019 1 次提交
- G
  
  change dist tests to serial test=develop (#20051) · d4bca811
  由 gongweibao 提交于 9月 28, 2019
  
  d4bca811
26 9月, 2019 1 次提交
- G
  
  Add `RUN_SERIAL` attribute to `exclusive` test. (#20026) · afc40a59
  由 gongweibao 提交于 9月 26, 2019
  
  afc40a59
25 9月, 2019 1 次提交
- S
  Avoid treating broadcast as initialization operation (#19857) · 5920d69d
  由 ShenLiang 提交于 9月 25, 2019
```
* treat broadcast as non-initial, test=develop

* rename the class name

* rename the class name, test=develop
```
  5920d69d
23 9月, 2019 1 次提交

Unify DataLoader APIs (#19305) · 0436efd6

由 Zeng Jinle 提交于 9月 23, 2019

* unify DataLoader APIs, test=develop

* integrate iterable CPU Dataset, test=develop
add GPU dataset supporting, test=develop

* add unittests for dataset, test=develop

* add more docs to dataloader apis, test=develop, test=document_preview

* refine doc, test=develop

* refine doc again, test=develop

* increase coverage, test=develop

0436efd6

20 9月, 2019 1 次提交
- Z
  
  fix readers bug, test=develop (#19868) · cee0079a
  由 Zeng Jinle 提交于 9月 20, 2019
  
  cee0079a
11 9月, 2019 1 次提交
- T
  
  remove trainer desc test in windows temporarily (#19753) · bda92434
  由 Thunderbrook 提交于 9月 11, 2019
  
  bda92434
10 9月, 2019 2 次提交
- Z
  
  add logs to left var memory size, test=develop (#19722) · bb4f8dee
  由 Zeng Jinle 提交于 9月 10, 2019
  
  bb4f8dee
- C
  increase timelimit test_pe_serexnext (#19702) · 2c30e64b
  由 chengduo 提交于 9月 10, 2019
```
test=develop
```
  2c30e64b
06 9月, 2019 1 次提交

Make test_pe_seresnext serial (#19634) · 5c4eb394

由 chengduo 提交于 9月 06, 2019

* make test_pe_seresnext serial
test=develop

* Increase test_pe_seresnext time limit on MAC
test=develop

5c4eb394

05 9月, 2019 1 次提交

Refactor dygraph (#19107) · e9233d1c

由 Jiabin Yang 提交于 9月 05, 2019

* refactor dygraph,test=develop

* fix failed unittest,test=develop

* polish code,test=develop

* check windows ci error,test=develop
try to fix windows ci error by np.allclose,test=develop

* polish vlog and profiler, test=develop

* try to fix preceding ops order,test=develop

* test transformer in windows ci, test=develop

* use python c-api to speed up tracer.trace,test=develop

* test=develop, fix docker with paddle nccl problem

* test=develop, add ut for debug string and gradient_accumulator

* test=develop, add tests for layer/gradient_accumulator/prepared_op

* test=develop, fix complie error for test_prepared_op

* test=develop, add more ut for dygraph

* test=develop, create API.spec for dygraph api change

* test=develop, refoctor name to make it easier to understand

* test=develop, refoctor name to make it easier to understand

* test=develop, fix multi-gpu failed problem , add Tracer tests, change PADDLEENFORCE to PADDLEENFORCE_EQ

* test=develop, fix ut failed on parallel se-resnext

* test=develop, change one more PADDLE_ENFORCE

e9233d1c

03 9月, 2019 2 次提交

Update UT test_boxps (#19599) · 66ad68ed

由 hutuxian 提交于 9月 03, 2019

Disable test_boxps in win32.
Adjust filename to avoid latent multi-thread problem.

66ad68ed

replace PADDLE_ASSERT with PADDLE_ASSERT_MSG (#19586) · 49523ea1

由 Tao Luo 提交于 9月 03, 2019

* remove unused PADDLE_ASSERT(_IS_NOT_ERROR)

* replace PADDLE_ASSERT with PADDLE_ASSERT_MSG

test=develop

49523ea1

02 9月, 2019 1 次提交
- G
  
  Delete pserver complete file before executor running. (#19468) · 57f0f0f2
  由 gongweibao 提交于 9月 02, 2019
  
  57f0f0f2
30 8月, 2019 2 次提交
- S
  add gather_nd op and unit test (#19366) · 85914f7a
  由 ShenLiang 提交于 8月 30, 2019
```
* fixed the code for coverage

* fixed the document,test=document_preview test=develop
```
  85914f7a
- C
  Support feed single persistable variable to PE (#19417) · e340df01
  由 chengduo 提交于 8月 30, 2019
```
* update executor feed
```
  e340df01
27 8月, 2019 1 次提交

supports multiple NCCL communicators preserved in NCCLCommContext (#19407) · efb05ba2

由 Yi Liu 提交于 8月 27, 2019

* supports multiple NCCL communicators preserved in NCCLCommContext
test=develop

* add ut for c_comm_init_all operator and fix cuda resource release problem
test=develop

efb05ba2

22 8月, 2019 1 次提交

Split test_parallel_executor_seresnext to three unit test (#19239) · 6a163231

由 chengduo 提交于 8月 22, 2019

* increase test_parallel_executor_seresnext time limit
test=develop

* split test_parallel_executor_seresnext
test=develop

* temporally disable reduce_and_allreduce test because of the random failure.
test=develop

* split gpu and cpu
test=develop

6a163231

19 8月, 2019 2 次提交

Add match_matrix_tensor op (#18525) · 78a3d837

由 Aurelius84 提交于 8月 19, 2019

* add matrch_matrix_tensor op test=develop

* fix ignore unittest if with_mkl=off test=develop

* clean code and rm is_test param test=develop

* modify API.spec test=develop

* rm useless code in search_compute.h test=develop

* modify api.spec test=develop

* modify default_grad.spec test=develop

* Add API test code test=develop

* clean code in search_computer.h

* modify PADDLE_ENFORCE and clean search_compute.h test=develop

* fix code style test=develop

78a3d837

Z

merge develop to solve conflict, also fix API doc, test=develop (#18823) · 5b6673c4
由 Zeng Jinle 提交于 8月 19, 2019

5b6673c4

18 8月, 2019 1 次提交
- G
  Unset unittests http_proxy env to avoid timeout. (#19269) · fd4b15a2
  由 gongweibao 提交于 8月 18, 2019
```
Unset unittests http_proxy env to avoid timeout.
```
  fd4b15a2
12 8月, 2019 1 次提交
- G
  Polish fleet API to support cuda collective mode and nccl2 mode. (#18966) · 29d87812
  由 gongweibao 提交于 8月 12, 2019
```
Polish fleet API to support cuda collective mode and nccl2 mode
```
  29d87812
06 8月, 2019 1 次提交

Add var_conv_2d op (#18518) · e681d655

由 Kevin 提交于 8月 06, 2019

* fix overflow by int32 mul test=develop

* fix reference nullptr

* fix codestyle test=develop

* modify to point in ContextProjectFunctor test=develop

* modify to point in ContextProjectFunctor test=develop

* modify . to -> test=develop

* add var_conv_2d op test=develop

* edit api.spec test=develop

* ignore unittest if with_mkl=off test=develop

* fix python3 division test=develop

* fix ignore unittest bug test=develop

* remove useless code test=develop

* modify api.spec test=develop

* modify default_grad.spec test=develop

e681d655

04 8月, 2019 1 次提交
- D
  make listen and server as exclusive run (#18990) · c97ea53c
  由 Dong Daxiang 提交于 8月 04, 2019
```
make listen and server as exclusive run 
```
  c97ea53c
31 7月, 2019 1 次提交
- D
  make dist unit test exclusive run (#18865) · 2bb296df
  由 Dong Daxiang 提交于 7月 31, 2019
```
make dist unit test exclusive run
```
  2bb296df
30 7月, 2019 1 次提交
- C
  add CPUInplaceTestWithFuseOptimizationOps (#18867) · ecd2bdad
  由 chengduo 提交于 7月 30, 2019
```
test=develop
```
  ecd2bdad
28 7月, 2019 1 次提交
- Z
  
  fix affine_channel no_need buffer bug, test=develop (#18844) · 9a8a7a1d
  由 Zeng Jinle 提交于 7月 28, 2019
  
  9a8a7a1d
26 7月, 2019 1 次提交

Feature/mem opt pass refactor (#18735) · a802da65

由 Zeng Jinle 提交于 7月 26, 2019

* first version memory optimize pass, test=develop

* remove move_tensor_sharing_pass, test=develop

* refine code comments, add unittests, test=develop

* turn off memory_optimize by default, test=develop

* follow huihuang's comments, test=develop

* follow chengduoZH's comments, test=develop

* fix grammar error, add const qualifier, fix pass_test exception message, test=develop

* follow chengduoZH's comments 2nd, test=develop

a802da65

25 7月, 2019 1 次提交
- G
  split test_dist_se_resnext.py into 4 testcases (#18743) · 2efb282c
  由 guru4elephant 提交于 7月 25, 2019
```
* split test_dist_se_resnext.py into 4 testcases
```
  2efb282c
24 7月, 2019 1 次提交

Extend Matmul to support matrix multiplication with multiple heads (#18570) · 220eef60

由 Bob Zhu 提交于 7月 24, 2019

* extend matmul op to support multiple head multiplication

With the support of multiple head, the multiplication of two big matrixes is
split into multiplication of several (head_number) small matrixes. e.g. if
Mat A is [3, 24] and Mat B is [24, 4], when multiple A and B with head_number
as 4, Mat A will be split as 4 matrix of [3, 6] and Mat B will be 4 matrix of
[6, 4]. The result of final matrix will be 4 matrix of [3, 4], i.e. [3, 16].

220eef60

22 7月, 2019 1 次提交
- G
  split different comm method for mnist distributed training (#18715) · ebf9797e
  由 guru4elephant 提交于 7月 22, 2019
```
* split different comm method for mnist distributed training
```
  ebf9797e
18 7月, 2019 1 次提交

Feature/auto_growth_allocator (#18561) · ae58afc5

由 Zeng Jinle 提交于 7月 18, 2019

* feature/auto_growth_allocator, test=develop

* add unittest of AlignedAllocator, test=develop

* try to turn on auto_growth to test on CI, test=develop

* fix segmentation fault in mixed_vector.h, test=develop

* add unittests, test=develop

ae58afc5

15 7月, 2019 1 次提交
- G
  increase timeout again (#18628) · b71b4543
  由 guru4elephant 提交于 7月 15, 2019
```
test=develop
```
  b71b4543
12 7月, 2019 1 次提交
- K
  1）change to parallel mode on python coverage run (#18594) · 9ad57f2d
  由 kh2se2013 提交于 7月 12, 2019
```
2）add pip install coverage in Dockerfile.tmp
test=develop
```
  9ad57f2d
11 7月, 2019 1 次提交

Feature/buffer_shared_inplace (#17911) · d3003a16

由 Zeng Jinle 提交于 7月 11, 2019

* feature/buffer_shared_inplace, test=develop

* refine code, test=develop

* fix elementwise_add op cpu inplace and sum inplace bug, test=develop

* add unittest and debug log, test=develop

* fix parallel_executor scope bug, polish code, test=develop

* fix sum op, activation op, single_in_place_inference bug, test=develop

* remove kLocalExecScopeName, test=develop

* fix unittest,test=develop

* fix out_var first version bug, test=develop

* follow comments,test=develop

d3003a16

27 6月, 2019 2 次提交

add WITH_COVERAGE option, default OFF (#17872) · 27fb9cad

由 kh2se2013 提交于 6月 27, 2019

* add WITH_COVERAGE option, default OFF

test=develop

* add coverage for python sdk

test=develop

* fix code style

* fix COVERAGE_FILE path

test=develop

* remove coverage package

test=develop

* test = develop, run coverage as module

27fb9cad

supports collective communicated training (#18175) · b7128bac

由 HaoRen 提交于 6月 27, 2019

* fix prepare context redundant code problem, optimize executor by caching create_varaiables
test=develop

* supports collective training in executor

* make fetch_list runable with variables, add more unittest for use_program_cache
test=develop

* fix comment
test=develop

* use unique name for nccl_id

* supports output to stream in program_to_code

* insert sync_comm_stream before regularization; add skip_op_callstack capability in program_to_code

* set op role in collective training

* add collective op role

* remove orig file

* add build optimizer by strategy

* add collective strategy

* refine collective strategy

* add multi-process role maker

* refine strategy building factory so that we can easily plugin more strategy

* scale loss grad in collective sgd transpiler

* add support for distributed fc

* code format

* revert some features for dist fc

* add support for distributed fc training

* fix prepare context redundant code problem, optimize executor by caching create_varaiables
test=develop

* supports collective training in executor

* make fetch_list runable with variables, add more unittest for use_program_cache
test=develop

* use unique name for nccl_id

* supports output to stream in program_to_code

* insert sync_comm_stream before regularization; add skip_op_callstack capability in program_to_code

* set op role in collective training

* add collective op role

* fix comment
test=develop

* remove orig file

* add build optimizer by strategy

* add collective strategy

* refine collective strategy

* add multi-process role maker

* refine strategy building factory so that we can easily plugin more strategy

* scale loss grad in collective sgd transpiler

* add support for distributed fc

* code format

* revert some features for dist fc

* add support for distributed fc training

* test=develop
add collective op unittest standard

* test=develop
remove the test_collective directory

* test=develop
remove the test_collective directory

* remove slicegather test

* code format for reducescatter

* update attr of shard_index_op

* Modify macro nccl_helper

* remove test without distribute

* macro collective_helper

* marcro update

* test=develop
update support python3.5

* test=develop change gpu memory use to 0.1 when test

* test=develop
update ut equal func

* test=develop
set flags to 1.5

* test=develop fix pickle dumple  py35

* test=develop
fix divide in slice and add sync_comm_stream
update atol and rtol to 1e-05
rm shard_index op and test
modify read input from file to read from memory
remove origin_program in framework and add i/o in c_sync_calc_stream

* test=develop update unittest sync operator I/O

b7128bac

26 6月, 2019 2 次提交
- H
  
  add ut for pipeline training (#18289) · e42057cd
  由 hutuxian 提交于 6月 26, 2019
  
  e42057cd
- J
  
  test=develop, recover ocr ut on dygraph (#18166) · bd61d899
  由 Jiabin Yang 提交于 6月 26, 2019
  
  bd61d899

Crayon鑫 / Paddle 与 Fork 源项目一致

Crayon鑫 / Paddle
与 Fork 源项目一致