提交 · 02c370c3dc5424941efa6e231a122e8ee80593d6 · BaiXuePrincess / Paddle

02 8月, 2019 1 次提交

support filelist size < trainer num && fix pull dense (#18956) · 02c370c3

由 jiaqi 提交于 8月 02, 2019

* support filelist size < trainer num
* pull dense when stop, to make sure local dense params are same as pserver, so save paddle model will save dense model same as pserver
*  enable QueueDataset train same filelist for serveral times

02c370c3

31 7月, 2019 1 次提交

set fleet_send_batch_num a default value according to trainer num · 233746d8

由 jiaqi 提交于 7月 31, 2019

(1) set fleet_send_batch_num a default value according to trainer num， the previous 80000 is fixed，if trainer num is much less or larger than 100，global shuffle may have timeout error.

(2) fix load one table bug, add barrier

233746d8

23 7月, 2019 1 次提交

support patch data, add load_one_table, fix bug (#18509) · d18aabb4

由 jiaqi 提交于 7月 23, 2019

（1）support patch data （merge slots of instances of same line id, modify dense layer which
changes its size）
（2）add fleet load_one_table interface, support load from paddle model and load from pslib model
（3）fix push sparse bug which cause push sparse cost more time（about 10% in my testcase）
（4）when some slots are not in one of your network (join/update, etc.)，data feed、collect label info、push/pull sparse will skip these slots， instead of throw error.
（5）add more debug info in TrainFilesWithProfiler

d18aabb4

21 6月, 2019 1 次提交

dataset (#17973) · 3f8031e2

由 jiaqi 提交于 6月 21, 2019

(1) use channel instead of vector/BlockingQueue in Dataset，to keep same with existing implementation, and make code more readable and flexible (dataset single output channel or multi output channel). one previous memory out of limit problem is cause by not release memory after training.
(2) add Record because MultiSlotType costs too much memory (80B)，fix memory out of limit problem.
(3) add Channel, Archive in paddle/fluid/framework
(4) change dataset from shared_ptr to unique_ptr in pybind
(5) move create/destroy readers from trainer to dataset
(6) move shuffle from datafeed to dataset. dataset holds memory, datafeed is only for load data and feed data to network.
(7) fix thread num bug of Dataset when filelist size < thread num
(8) support set_queue_num in InMemoryDataset

3f8031e2

11 6月, 2019 1 次提交

Pipeline Concurrency (#17402) · 969e6378

由 hutuxian 提交于 6月 11, 2019

Add Pipeline Concurrency Train Mode:
- Cpp: pipeline_trainer & section_worker
- Python: PipelineOptimizer
- Add a new data_feed type: PrivateInstantDataFeed
- Add a test demo of pipeline trainer and the test model is gnn
- Do not support win32 now

969e6378

18 5月, 2019 1 次提交

examples use code-block in dataset.py (#17451) · e32f4c4f

由 jiaqi 提交于 5月 18, 2019

* examples use code-block in dataset.py
test=develop
test=document_preview

* add QueueDataset example
test=develop
test=document_preview

e32f4c4f

15 5月, 2019 1 次提交

add save/load model, shrink table, cvm, config file & fix pull dense bug (#17118) · 66d51206

由 jiaqi 提交于 5月 15, 2019

* add save/load model, shrink table, cvm, config file & fix pull dense bug
test=develop

* fix global shuffle bug, fix pull dense bug, fix release memeory bug, fix shrink error
add client flush, add get data size
test=develop

* fix global shuffle bug
test=develop

* fix global shuffle bug
test=develop

* fix code style
test=develop

* fix code style & modify pslib cmake
test=develop

* fix error of _role_maker
test=develop

* fix code style
test=develop

* fix code style
test=develop

* fix code style
test=develop

* fix code style
test=develop

* fix code style
test=develop

* fix windows compile error of fleet
test=develop

* fix global shuffle bug

* add comment
test=develop

* update pslib.cmake
test=develop

* fix fill sparse bug
test=develop

* fix push sparse bug
test=develop

66d51206

25 4月, 2019 1 次提交
- T
  Fleet unify distributed training (#16791) · 1a4a51db
  由 tangwei12 提交于 4月 25, 2019
```
* implement distributed transpiler with fleet
```
  1a4a51db
11 4月, 2019 2 次提交
- X
  
  fix code style · 0ef19e2d
  由 xjqbest 提交于 4月 11, 2019
  
  0ef19e2d
- X
  fix release_memory not found & fix doc string example error · efe4311a
  由 xjqbest 提交于 4月 11, 2019
```
test=develop
```
  efe4311a
10 4月, 2019 2 次提交
- G
  
  Update dataset.py · ba98872d
  由 guru4elephant 提交于 4月 10, 2019
  
  ba98872d
- D
  add gpu training for Executor.train_from_dataset · 05464e7c
  由 dongdaxiang 提交于 4月 10, 2019
```
test=develop
```
  05464e7c
04 4月, 2019 1 次提交
- X
  remove trainer_id in datafeed and dataset · 6a57e807
  由 xjqbest 提交于 4月 04, 2019
```
test=develop
```
  6a57e807
03 4月, 2019 1 次提交
- X
  fix dataset bug · 514d727a
  由 xjqbest 提交于 4月 03, 2019
```
test=develop
```
  514d727a
01 4月, 2019 1 次提交
- D
  refine dataset API · 2c5839f7
  由 dongdaxiang 提交于 4月 01, 2019
```
test=develop
```
  2c5839f7
29 3月, 2019 12 次提交
- D
  fix API spec about infer_from_dataset · 3829eac2
  由 dongdaxiang 提交于 3月 29, 2019
```
test=develop
```
  3829eac2
- X
  fix code style & runtime error · a38b98cb
  由 xjqbest 提交于 3月 26, 2019
```
test=develop
```
  a38b98cb
- X
  fix code style & fix register bug & add release_memory · be74de2c
  由 xjqbest 提交于 3月 24, 2019
```
test=develop
```
  be74de2c
- D
  add comment for dataset · b8382076
  由 dongdaxiang 提交于 3月 23, 2019
```
test=develop
```
  b8382076
- X
  
  fix bug of gen_worker_desc and set_filelist, add some doc · b7940c29
  由 xjqbest 提交于 3月 22, 2019
  
  b7940c29
- X
  
  support multi dataset && add init model && fix bug · a5b1a0e1
  由 xujiaqi01 提交于 3月 20, 2019
  
  a5b1a0e1
- D
  
  fix dataset float32 type problem · f6c9232a
  由 dongdaxiang 提交于 3月 18, 2019
  
  f6c9232a
- X
  
  add dataset factory && fix style · ecfc7df9
  由 xujiaqi01 提交于 3月 13, 2019
  
  ecfc7df9
- X
  
  store memory data in Dataset && fix bug · 3cea00bd
  由 xujiaqi01 提交于 3月 12, 2019
  
  3cea00bd
- D
  
  fix data reading bugs in api, add VLOG(3) log for setup · b66f0074
  由 dongdaxiang 提交于 3月 10, 2019
  
  b66f0074
- X
  
  modify c++ and python dataset related code & fix bug · dd67ad08
  由 xjqbest 提交于 3月 09, 2019
  
  dd67ad08
- D
  add dataset_generator.py · c28bbdf8
  由 dongdaxiang 提交于 2月 28, 2019
```
dataset_generator.py is a framework for generating data with python
the generated data with a fixed format will be feeded into c++ reader
test=develop
```
  c28bbdf8

BaiXuePrincess / Paddle 与 Fork 源项目一致

BaiXuePrincess / Paddle
与 Fork 源项目一致