提交 · 0ec3a42e9740a5f5066053bb49a923d538eba24a · PaddlePaddle / Paddle

19 5月, 2020 1 次提交

由 hutuxian 提交于 5月 19, 2020

* Refactor code for dump_field & dump_param: abstracting the common function in base class.
* Support dump randomly & random with lineid
* Support specify the random interval, which avoids printing too much logs.

0ec3a42e

01 4月, 2020 1 次提交
- X
  add fleet pslib pull and push sparse op and push dense op (#23139) · 3a45767d
  由 xujiaqi01 提交于 4月 01, 2020
```
* add fleet pslib pull and push sparse op and push dense op
* test=develop
```
  3a45767d
02 2月, 2020 1 次提交
- X
  add GeneralRoleMaker (#22295) · 371f377b
  由 xujiaqi01 提交于 2月 02, 2020
```
* add GeneralRoleMaker which is for general usage
* test=develop
```
  371f377b
18 12月, 2019 1 次提交
- X
  fix compiled error when with_pslib=on (#21769) · 0eb4d990
  由 xujiaqi01 提交于 12月 18, 2019
```
* fix compiled error of butil when with_pslib=on and with_testing=on
* test=develop
```
  0eb4d990
28 11月, 2019 1 次提交

remove -Wno-error=sign-compare, make warning as error (#21358) · c0656dcb

由 Tao Luo 提交于 11月 28, 2019

* remove -Wno-error=sign-compare, make warning as error

test=develop test=document_fix

* fix exist compile warning

test=develop

c0656dcb

15 10月, 2019 1 次提交

Fix communicator slow bug & fix communicator stop bug (#20366) · 940c6ff1

由 Chengmo 提交于 10月 15, 2019

* test=develop,Fix communicator slow bug

* test=develop, delete if() in stop_worker()

* test=develop

* fix UT, test=develop

* fix bug in fetch handler, test=develop

* fix bug in fetch handler, test=develop

* test=develop, fix fetch barrier bug

* test=develop, bug fix

* test=develop, bug fix

* test=develop, fix bug

940c6ff1

14 10月, 2019 1 次提交
- T
  dump fix dov vec file num (#20539) · f76a32df
  由 Thunderbrook 提交于 10月 14, 2019
```
* support dump multi file
test=develop

* dump fix num file
test=develop
```
  f76a32df
24 9月, 2019 1 次提交

support change shuffle and train thread num (#19841) · cedc0477

由 xujiaqi01 提交于 9月 24, 2019

* support change shuffle thread num
* support change train thread num
* fix receive shuffle data of each channel
* data norm stop gradient
* add check thread_tensor type and root_tensor type when merge metric
* remove sleep in shuffle, add config
* add config of pslib client to client communication
* fix xbox str
* add data norm op testcase
* add flush in trainer finalize

cedc0477

30 8月, 2019 1 次提交

add thread scope stat accurate metrics test=develop (#19480) · 10ca3f96

由 yaoxuefeng 提交于 8月 30, 2019

* add thread scope stat accurate metrics test=develop

* fix style

* fix style

* fix style

* fix style test=develop

* fix style test=develop

* fix style test=develop

* fix style test=develop

* fix style test=develop

* fix style test=develop

* fix style test=develop

* fix conflict

* fix style

* fix style test=develop

* fix error test=develop

* fix error test=develop

10ca3f96

29 8月, 2019 1 次提交

support debug each output of each ins (#19004) · 1fe468d3

由 Thunderbrook 提交于 8月 29, 2019

* dump slot

* test

* proto

* dump slot

* test

* proto

* code style

* code style

* code style

* style

* add delete after unseen days

* add unseen days

* code style

* conflict solve
test=develop

* add clear model

* code style
test=develop

* code style
test=develop

* support debug tensor of each ins
test=develop

* support debug tensor of each ins
test=develop

* learning rate

* code style

* code style

* code style

* code style

* code style

* code style

* code style

* code style

* code style

* code style

* code style

* code style

* code style
test=develop

* code style
test=develop

* unitest

* style

* style

* multi phase

* add channel

* code style

* style

* style

* unitest

* style

* define

* define
test=develop

* style
test=develop

* rm define
test=develop

* linux

* linux
test=develop

* style
test=develop

* output format
test=develop

* windows ci
test=develop

1fe468d3

21 6月, 2019 1 次提交

dataset (#17973) · 3f8031e2

由 jiaqi 提交于 6月 21, 2019

(1) use channel instead of vector/BlockingQueue in Dataset，to keep same with existing implementation, and make code more readable and flexible (dataset single output channel or multi output channel). one previous memory out of limit problem is cause by not release memory after training.
(2) add Record because MultiSlotType costs too much memory (80B)，fix memory out of limit problem.
(3) add Channel, Archive in paddle/fluid/framework
(4) change dataset from shared_ptr to unique_ptr in pybind
(5) move create/destroy readers from trainer to dataset
(6) move shuffle from datafeed to dataset. dataset holds memory, datafeed is only for load data and feed data to network.
(7) fix thread num bug of Dataset when filelist size < thread num
(8) support set_queue_num in InMemoryDataset

3f8031e2

29 3月, 2019 15 次提交
- D
  move root_scope->DropKids() into Finalize() so that we do not have to drop all the kids · ba15d6b1
  由 dongdaxiang 提交于 3月 24, 2019
```
test=develop
```
  ba15d6b1
- X
  
  support multi dataset && add init model && fix bug · a5b1a0e1
  由 xujiaqi01 提交于 3月 20, 2019
  
  a5b1a0e1
- D
  
  add trainfileswithprofiler for downpour worker · 6af697ad
  由 dongdaxiang 提交于 3月 15, 2019
  
  6af697ad
- D
  add comment for MPI Symetric role maker · 2644b886
  由 dongdaxiang 提交于 3月 14, 2019
```
test=develop
```
  2644b886
- D
  
  add distributed optimizer factory · cf45c543
  由 dongdaxiang 提交于 3月 13, 2019
  
  cf45c543
- X
  
  fix bug && add DestroyReaders in trainer · 39449ba0
  由 xujiaqi01 提交于 3月 13, 2019
  
  39449ba0
- D
  refactor downpour optimization · 328f11b8
  由 dongdaxiang 提交于 3月 12, 2019
```
test=develop
```
  328f11b8
- D
  
  fix data reading bugs in api, add VLOG(3) log for setup · b66f0074
  由 dongdaxiang 提交于 3月 10, 2019
  
  b66f0074
- D
  
  make Dataset* as an argument · b415ec27
  由 dongdaxiang 提交于 3月 09, 2019
  
  b415ec27
- X
  
  modify c++ and python dataset related code & fix bug · dd67ad08
  由 xjqbest 提交于 3月 09, 2019
  
  dd67ad08
- D
  
  add RunFromDataset in executor · 24863897
  由 dongdaxiang 提交于 3月 08, 2019
  
  24863897
- X
  
  add DataSet and InMemoryDataFeed, support load data into memory and shuffle data · 824b84d1
  由 xjqbest 提交于 3月 06, 2019
  
  824b84d1
- D
  
  fix class register problem · 39014b9f
  由 dongdaxiang 提交于 2月 02, 2019
  
  39014b9f
- D
  refine device_worker and trainer code · c1650120
  由 dongdaxiang 提交于 2月 02, 2019
```
test=develop
```
  c1650120
- D
  add dist_multi_trainer for distributed training, add trainer_factory and... · 855bf579
  由 dongdaxiang 提交于 1月 28, 2019
```
add dist_multi_trainer for distributed training, add trainer_factory and device_worker_factory so that we can easily extend new training mode, add pull dense worker which is a singleton for parameter fetching
```
  855bf579

PaddlePaddle / Paddle 大约 1 年 前同步成功

PaddlePaddle / Paddle
大约 1 年前同步成功