提交 · 893ea7e01c61bb766eb343bb24afb12244feaabb · 机器未来 / Paddle

02 12月, 2019 1 次提交

[cherry-pick] find lookup table in order & support dump param (#21347) · 893ea7e0

由 Thunderbrook 提交于 12月 02, 2019

* support dump param of model into afs (#20302)

* support dump param to afs
test=develop

* code style
test=develop

* code style
test=develop

* dump param
test=develop

* dump param
test=develop

* dump param
test=develop

* dump param
test=develop

* find lookup table in order (#20932)

test=develop

* cherry-pick
test=develop

* solve pslib core in stop worker
test=develop

* print table stat info for pslib
test=develop

893ea7e0

28 11月, 2019 1 次提交

cherry-pick1.6 fix cache table bug, add save_paddle_inference_model, fix hdfs util bug (#21339) · 072eb5b6

由 xujiaqi01 提交于 11月 28, 2019

* fix cache table bug, add save_paddle_inference_model, fix hdfs util bug (#21052)

* fix cache table bug
* add save_paddle_inference_model
* fix hdfs util bug
* test=develop

* fix several sparse table issuses (#20686)

* no longer need to define all embedding layers (no one less) of all slots in each program. make trainer_param repeated in ps.proto.
* add find_distributed_lookup_table_grads instead of hard code GRAD
* support embedding stop gradient. push sparse has error before fix this.* 
* fix fill sparse, skip slots which do not have embedding. each slot's embedding in a sparse table should be used in all training programs before fix this.
* fix pull sparse, skip slots which do not have embedding.
* fix collect feasign label info, skip slots which do not have embedding.
* support when there are multi sparse tables in one or multi training programs, each program can pull/push its own related sparse tables instead of all sparse tables.
* test=develop

* add copy table (#21086)

* copy some feasigns and corresponding embeddings from one sparse table to another
* copy all feasigns and corresponding embeddings from one sparse table to another
* copy all dense params from one table to another
* copy some local vars to other local vars

* fix fs_client_param bug (#21212)

* fix fs_client_param bug， user can set this config through fleet_desc_file or fleet config
* test=develop

* fix fleet util bug (#21254)

* fix fleet util bug in save paddle inference model
* test=develop

072eb5b6

01 11月, 2019 1 次提交
- X
  add check nan / inf in downpour worker (#20694) (#20925) · 5c3656bb
  由 xujiaqi01 提交于 11月 01, 2019
```
* add check nan / inf in downpour worker during training
* test=develop
```
  5c3656bb
17 9月, 2019 1 次提交
- T
  rm return in vfork (#19734) · 40c66f8d
  由 Thunderbrook 提交于 9月 17, 2019
```
* rm return in vfork

* rm return in vfork
test=develop
```
  40c66f8d
30 8月, 2019 1 次提交

add thread scope stat accurate metrics test=develop (#19480) · 10ca3f96

由 yaoxuefeng 提交于 8月 30, 2019

* add thread scope stat accurate metrics test=develop

* fix style

* fix style

* fix style

* fix style test=develop

* fix style test=develop

* fix style test=develop

* fix style test=develop

* fix style test=develop

* fix style test=develop

* fix style test=develop

* fix conflict

* fix style

* fix style test=develop

* fix error test=develop

* fix error test=develop

10ca3f96

29 8月, 2019 1 次提交

support debug each output of each ins (#19004) · 1fe468d3

由 Thunderbrook 提交于 8月 29, 2019

* dump slot

* test

* proto

* dump slot

* test

* proto

* code style

* code style

* code style

* style

* add delete after unseen days

* add unseen days

* code style

* conflict solve
test=develop

* add clear model

* code style
test=develop

* code style
test=develop

* support debug tensor of each ins
test=develop

* support debug tensor of each ins
test=develop

* learning rate

* code style

* code style

* code style

* code style

* code style

* code style

* code style

* code style

* code style

* code style

* code style

* code style

* code style
test=develop

* code style
test=develop

* unitest

* style

* style

* multi phase

* add channel

* code style

* style

* style

* unitest

* style

* define

* define
test=develop

* style
test=develop

* rm define
test=develop

* linux

* linux
test=develop

* style
test=develop

* output format
test=develop

* windows ci
test=develop

1fe468d3

01 8月, 2019 1 次提交
- J
  adjust ins weight according to nid slot (#18784) · 768059b3
  由 jiaqi 提交于 8月 01, 2019
```
adjust ins weight according to nid slot , user can specify adjust_ins_weight in strategy
```
  768059b3
25 7月, 2019 1 次提交

Fix shrink-dense and add scale-datanorm (#18746) · c167a4b4

由 fuyinno4 提交于 7月 25, 2019

Fix FleetWrapper:
1. fix shrink dense: just scale show
2. add datanorm scale: divide datanorm's gradient by batch_size

c167a4b4

24 7月, 2019 1 次提交

add slot to sparse table (#18686) · d8396281

由 Thunderbrook 提交于 7月 24, 2019

The change includes 2 things:

1. save delta model and shrink table are control by the same parameter before, now add delete_after_unseen_days to control shrink table.
2. value in sparse table has no slot before, now add slot in sparse table, and add DownpureCtrAccessor to support the new meta.
test=develop

d8396281

23 7月, 2019 1 次提交

support patch data, add load_one_table, fix bug (#18509) · d18aabb4

由 jiaqi 提交于 7月 23, 2019

（1）support patch data （merge slots of instances of same line id, modify dense layer which
changes its size）
（2）add fleet load_one_table interface, support load from paddle model and load from pslib model
（3）fix push sparse bug which cause push sparse cost more time（about 10% in my testcase）
（4）when some slots are not in one of your network (join/update, etc.)，data feed、collect label info、push/pull sparse will skip these slots， instead of throw error.
（5）add more debug info in TrainFilesWithProfiler

d18aabb4

15 5月, 2019 1 次提交

add save/load model, shrink table, cvm, config file & fix pull dense bug (#17118) · 66d51206

由 jiaqi 提交于 5月 15, 2019

* add save/load model, shrink table, cvm, config file & fix pull dense bug
test=develop

* fix global shuffle bug, fix pull dense bug, fix release memeory bug, fix shrink error
add client flush, add get data size
test=develop

* fix global shuffle bug
test=develop

* fix global shuffle bug
test=develop

* fix code style
test=develop

* fix code style & modify pslib cmake
test=develop

* fix error of _role_maker
test=develop

* fix code style
test=develop

* fix code style
test=develop

* fix code style
test=develop

* fix code style
test=develop

* fix code style
test=develop

* fix windows compile error of fleet
test=develop

* fix global shuffle bug

* add comment
test=develop

* update pslib.cmake
test=develop

* fix fill sparse bug
test=develop

* fix push sparse bug
test=develop

66d51206

09 5月, 2019 1 次提交

fix infer_from_dataset and train_from_dataset (#17243) · 5d6a1fcf

由 guru4elephant 提交于 5月 09, 2019

* fix train_from_dataset and infer_from_dataset example

* add inductive dim for data_reader, example: shape=[-1, 1], then -1 will be inducted through run-time reading of number of elements

5d6a1fcf

11 4月, 2019 1 次提交
- D
  remove all warnings · 3c2d2368
  由 dongdaxiang 提交于 4月 11, 2019
```
test=develop
```
  3c2d2368
29 3月, 2019 18 次提交
- D
  fix pull sparse slow problem · 98dda08a
  由 dongdaxiang 提交于 3月 29, 2019
```
test=develop
```
  98dda08a
- X
  fix FillSparseValue error · 030c7e7e
  由 xjqbest 提交于 3月 28, 2019
```
test=develop
```
  030c7e7e
- D
  refine document of python API, make device_worker and trainer's API private · d87ba58c
  由 dongdaxiang 提交于 3月 26, 2019
```
test=develop
```
  d87ba58c
- D
  add doc string for executor and update API.spec · b95b80bc
  由 dongdaxiang 提交于 3月 25, 2019
```
test=develop
```
  b95b80bc
- D
  
  refine print fetch list · 6bf796df
  由 dongdaxiang 提交于 3月 21, 2019
  
  6bf796df
- D
  add fetch var function · 68d7bf3d
  由 dongdaxiang 提交于 3月 20, 2019
```
test=develop
```
  68d7bf3d
- D
  add data_generator into paddle.fluid.incubate.data_generator, add op run log... · 73b1f396
  由 dongdaxiang 提交于 3月 17, 2019
```
add data_generator into paddle.fluid.incubate.data_generator, add op run log in hogwild_device_worker and downpour_device_worker
test=develop
```
  73b1f396
- D
  
  add training speed log · 73544e8b
  由 dongdaxiang 提交于 3月 15, 2019
  
  73544e8b
- D
  
  add IO percent for multi_trainer · 9419de52
  由 dongdaxiang 提交于 3月 15, 2019
  
  9419de52
- D
  
  add trainfileswithprofiler for downpour worker · 6af697ad
  由 dongdaxiang 提交于 3月 15, 2019
  
  6af697ad
- D
  add comment for MPI Symetric role maker · 2644b886
  由 dongdaxiang 提交于 3月 14, 2019
```
test=develop
```
  2644b886
- H
  
  refactor & fix bug · 9bca1926
  由 heqiaozhi 提交于 3月 08, 2019
  
  9bca1926
- D
  
  add printer for fetch variable · cf136064
  由 dongdaxiang 提交于 2月 18, 2019
  
  cf136064
- D
  
  make pull dense worker work · 97d5cd30
  由 dongdaxiang 提交于 2月 02, 2019
  
  97d5cd30
- D
  
  fix class register problem · 39014b9f
  由 dongdaxiang 提交于 2月 02, 2019
  
  39014b9f
- D
  
  make s_instance_ private to ensure singleton · 378037c5
  由 dongdaxiang 提交于 2月 02, 2019
  
  378037c5
- D
  refine device_worker and trainer code · c1650120
  由 dongdaxiang 提交于 2月 02, 2019
```
test=develop
```
  c1650120
- D
  add dist_multi_trainer for distributed training, add trainer_factory and... · 855bf579
  由 dongdaxiang 提交于 1月 28, 2019
```
add dist_multi_trainer for distributed training, add trainer_factory and device_worker_factory so that we can easily extend new training mode, add pull dense worker which is a singleton for parameter fetching
```
  855bf579

机器未来 / Paddle 与 Fork 源项目一致

机器未来 / Paddle
与 Fork 源项目一致