提交 · 3f4c088ad8fe99286649194b438fef5a4784056c · 机器未来 / Paddle

08 8月, 2019 1 次提交

fix QueueDataset queue size (#19016) · fc038da7

由 jiaqi 提交于 8月 08, 2019

* fix QueueDataset queue size，set queue size = batch size * 100, to avoid too many instances in channel when training is much slower than reading data.

fc038da7

23 7月, 2019 1 次提交

support patch data, add load_one_table, fix bug (#18509) · d18aabb4

由 jiaqi 提交于 7月 23, 2019

（1）support patch data （merge slots of instances of same line id, modify dense layer which
changes its size）
（2）add fleet load_one_table interface, support load from paddle model and load from pslib model
（3）fix push sparse bug which cause push sparse cost more time（about 10% in my testcase）
（4）when some slots are not in one of your network (join/update, etc.)，data feed、collect label info、push/pull sparse will skip these slots， instead of throw error.
（5）add more debug info in TrainFilesWithProfiler

d18aabb4

17 7月, 2019 1 次提交
- G
  remove async executor and add data_feed.proto to the deps of train demo (#18659) · d714bf03
  由 guru4elephant 提交于 7月 17, 2019
```
* remove async executor and add data_feed.proto to the deps of train demo
```
  d714bf03
21 6月, 2019 1 次提交

dataset (#17973) · 3f8031e2

由 jiaqi 提交于 6月 21, 2019

(1) use channel instead of vector/BlockingQueue in Dataset，to keep same with existing implementation, and make code more readable and flexible (dataset single output channel or multi output channel). one previous memory out of limit problem is cause by not release memory after training.
(2) add Record because MultiSlotType costs too much memory (80B)，fix memory out of limit problem.
(3) add Channel, Archive in paddle/fluid/framework
(4) change dataset from shared_ptr to unique_ptr in pybind
(5) move create/destroy readers from trainer to dataset
(6) move shuffle from datafeed to dataset. dataset holds memory, datafeed is only for load data and feed data to network.
(7) fix thread num bug of Dataset when filelist size < thread num
(8) support set_queue_num in InMemoryDataset

3f8031e2

11 6月, 2019 1 次提交

Pipeline Concurrency (#17402) · 969e6378

由 hutuxian 提交于 6月 11, 2019

Add Pipeline Concurrency Train Mode:
- Cpp: pipeline_trainer & section_worker
- Python: PipelineOptimizer
- Add a new data_feed type: PrivateInstantDataFeed
- Add a test demo of pipeline trainer and the test model is gnn
- Do not support win32 now

969e6378

16 5月, 2019 1 次提交
- G
  add inductive shape index (#17435) · 43c9561e
  由 guru4elephant 提交于 5月 16, 2019
```
add inductive shape index
```
  43c9561e
15 5月, 2019 1 次提交

add save/load model, shrink table, cvm, config file & fix pull dense bug (#17118) · 66d51206

由 jiaqi 提交于 5月 15, 2019

* add save/load model, shrink table, cvm, config file & fix pull dense bug
test=develop

* fix global shuffle bug, fix pull dense bug, fix release memeory bug, fix shrink error
add client flush, add get data size
test=develop

* fix global shuffle bug
test=develop

* fix global shuffle bug
test=develop

* fix code style
test=develop

* fix code style & modify pslib cmake
test=develop

* fix error of _role_maker
test=develop

* fix code style
test=develop

* fix code style
test=develop

* fix code style
test=develop

* fix code style
test=develop

* fix code style
test=develop

* fix windows compile error of fleet
test=develop

* fix global shuffle bug

* add comment
test=develop

* update pslib.cmake
test=develop

* fix fill sparse bug
test=develop

* fix push sparse bug
test=develop

66d51206

09 5月, 2019 1 次提交

fix infer_from_dataset and train_from_dataset (#17243) · 5d6a1fcf

由 guru4elephant 提交于 5月 09, 2019

* fix train_from_dataset and infer_from_dataset example

* add inductive dim for data_reader, example: shape=[-1, 1], then -1 will be inducted through run-time reading of number of elements

5d6a1fcf

16 4月, 2019 4 次提交
- X
  
  fix bug of num > INT_MAX · 10991e00
  由 xjqbest 提交于 4月 16, 2019
  
  10991e00
- X
  
  fix bug of num > INT_MAX · 241120d9
  由 xjqbest 提交于 4月 16, 2019
  
  241120d9
- X
  
  fix bug of num > INT_MAX · dac70ad4
  由 xjqbest 提交于 4月 16, 2019
  
  dac70ad4
- X
  
  fix bug of num > INT_MAX · 74471397
  由 xjqbest 提交于 4月 16, 2019
  
  74471397
10 4月, 2019 2 次提交
- D
  remove comment in data_feed.cc · ea07eb8c
  由 dongdaxiang 提交于 4月 10, 2019
```
develop=test
```
  ea07eb8c
- D
  add gpu training for Executor.train_from_dataset · 05464e7c
  由 dongdaxiang 提交于 4月 10, 2019
```
test=develop
```
  05464e7c
04 4月, 2019 1 次提交
- X
  remove trainer_id in datafeed and dataset · 6a57e807
  由 xjqbest 提交于 4月 04, 2019
```
test=develop
```
  6a57e807
03 4月, 2019 1 次提交
- X
  fix dataset bug · 271b7147
  由 xjqbest 提交于 4月 03, 2019
```
test=develop
```
  271b7147
30 3月, 2019 1 次提交
- X
  fix client to client communication bug · a99c8d0c
  由 xjqbest 提交于 3月 30, 2019
```
test=develop
```
  a99c8d0c
29 3月, 2019 23 次提交
- D
  add WIN32 for rand_r and usleep · f7e48138
  由 dongdaxiang 提交于 3月 27, 2019
```
test=develop
```
  f7e48138
- D
  add more _LINUX maroc on data_feed.cc for mac and window compile · cedbc161
  由 dongdaxiang 提交于 3月 27, 2019
```
test=develop
```
  cedbc161
- D
  add _LINUX macro · c5980c35
  由 dongdaxiang 提交于 3月 27, 2019
```
test=develop
```
  c5980c35
- D
  fix windows compile problem · e3107a6a
  由 dongdaxiang 提交于 3月 26, 2019
```
test=develop
```
  e3107a6a
- D
  remove local random engine in fleet with rand_r() · d4514949
  由 dongdaxiang 提交于 3月 26, 2019
```
test=develop
```
  d4514949
- D
  run pre-commit check files and fix code style problem · 45eb6f07
  由 dongdaxiang 提交于 3月 26, 2019
```
test=develop
```
  45eb6f07
- X
  fix code style & fix register bug & add release_memory · be74de2c
  由 xjqbest 提交于 3月 24, 2019
```
test=develop
```
  be74de2c
- X
  
  support multi dataset && add init model && fix bug · a5b1a0e1
  由 xujiaqi01 提交于 3月 20, 2019
  
  a5b1a0e1
- X
  
  add some log && fix error · d25389fe
  由 xujiaqi01 提交于 3月 14, 2019
  
  d25389fe
- X
  
  fix bug && add DestroyReaders in trainer · 39449ba0
  由 xujiaqi01 提交于 3月 13, 2019
  
  39449ba0
- X
  
  add dataset factory && fix style · ecfc7df9
  由 xujiaqi01 提交于 3月 13, 2019
  
  ecfc7df9
- X
  
  store memory data in Dataset && fix bug · 3cea00bd
  由 xujiaqi01 提交于 3月 12, 2019
  
  3cea00bd
- D
  
  fix data reading bugs in api, add VLOG(3) log for setup · b66f0074
  由 dongdaxiang 提交于 3月 10, 2019
  
  b66f0074
- X
  
  modify c++ and python dataset related code & fix bug · dd67ad08
  由 xjqbest 提交于 3月 09, 2019
  
  dd67ad08
- D
  
  fix some conflict for compilation · cc4def6b
  由 dongdaxiang 提交于 3月 08, 2019
  
  cc4def6b
- H
  
  refactor & fix bug · 9bca1926
  由 heqiaozhi 提交于 3月 08, 2019
  
  9bca1926
- X
  
  add DataSet and InMemoryDataFeed, support load data into memory and shuffle data · 2e9a836c
  由 xjqbest 提交于 3月 06, 2019
  
  2e9a836c
- D
  
  add RunFromDataset in executor · 24863897
  由 dongdaxiang 提交于 3月 08, 2019
  
  24863897
- X
  
  add DataSet and InMemoryDataFeed, support load data into memory and shuffle data · 824b84d1
  由 xjqbest 提交于 3月 06, 2019
  
  824b84d1
- D
  
  add pipe command io interface · 687cb79d
  由 dongdaxiang 提交于 2月 26, 2019
  
  687cb79d
- D
  move fs.cc and shell.cc into paddle/fluid/framework/io · 1fe54416
  由 dongdaxiang 提交于 2月 22, 2019
```
test=develop
```
  1fe54416
- D
  
  add fs_local_open example · 53fbab5d
  由 dongdaxiang 提交于 2月 22, 2019
  
  53fbab5d
- D
  
  add fs_local_open example · afaf9370
  由 dongdaxiang 提交于 2月 22, 2019
  
  afaf9370

机器未来 / Paddle 与 Fork 源项目一致

机器未来 / Paddle
与 Fork 源项目一致