提交 · c35413bf5c42fe22821c7fbb9c9ac837dfa13865 · Crayon鑫 / Paddle

11 2月, 2020 1 次提交

cherry-pick 22509. test=develop test=release/1.7 (#22527) · 49a80b45

由 Wilber 提交于 2月 11, 2020

[cherry-pick] #22509

支持不依赖nccl进行编译。

多卡下，如果没有打开WITH_NCCL开关编译，多卡不能通信，则只能选择一张卡使用

49a80b45

05 2月, 2020 2 次提交
- W
  cherry-pick 22384 and 22371. test=develop test=release/1.7 (#22453) · fb98116c
  由 Wilber 提交于 2月 05, 2020
```
[cherry-pick] #22384 and #22371

22384增加了WITH_NCCL开关

22371修改了fluid依赖lite的commit id
```
  fb98116c
- X
  add GeneralRoleMaker (#22295) (#22446) · 7171b20e
  由 xujiaqi01 提交于 2月 05, 2020
```
* add GeneralRoleMaker which is for general usage
* test=develop
```
  7171b20e
04 2月, 2020 1 次提交
- X
  add collective communication library in fleet (#22211) (#22435) · be528bf2
  由 xujiaqi01 提交于 2月 04, 2020
```
* add collective communication library in fleet to replace mpi
* test=develop
```
  be528bf2
20 12月, 2019 1 次提交

add table id in cache shuffle (#21585) · c3cf42d0

由 Thunderbrook 提交于 12月 20, 2019

* general table

* add sparse table
test=develop

* no cvm
test=develop

* add no_cvm
test=develop

* add note
test=develop

* code style
test=develop

* code style
test=develop

* code style
test=develop

* code style
test=develop

* code style
test=develop

* add key of optimizer
test=develop

* solve pslib stop core
test=develop

* barrier
test=develop

* add notes
test=develop

* add table id in cache shuffle
test=develop

* table id
test=develop

* code style
test=develop

c3cf42d0

10 12月, 2019 1 次提交
- X
  fix code style of fleet_wrapper (#21639) · c05706fe
  由 xujiaqi01 提交于 12月 10, 2019
```
* fix code style of fleet_wrapper
* test=develop
```
  c05706fe
25 11月, 2019 1 次提交
- T
  print table stat info for pslib (#21296) · 9a7832f8
  由 Thunderbrook 提交于 11月 25, 2019
```
* print table stat
test=develop

* notes
test=develop

* notes
test=develop
```
  9a7832f8
21 11月, 2019 1 次提交

solve pslib core in stop worker (#21263) · 0d17c1b8

由 Thunderbrook 提交于 11月 21, 2019

* general table

* add sparse table
test=develop

* no cvm
test=develop

* add no_cvm
test=develop

* add note
test=develop

* code style
test=develop

* code style
test=develop

* code style
test=develop

* code style
test=develop

* code style
test=develop

* add key of optimizer
test=develop

* solve pslib stop core
test=develop

* barrier
test=develop

* add notes
test=develop

0d17c1b8

20 11月, 2019 1 次提交

support general embedding params (#21217) · 349e82d6

由 Thunderbrook 提交于 11月 20, 2019

* general table

* add sparse table
test=develop

* no cvm
test=develop

* add no_cvm
test=develop

* add note
test=develop

* code style
test=develop

* code style
test=develop

* code style
test=develop

* code style
test=develop

* code style
test=develop

* add key of optimizer
test=develop

349e82d6

15 11月, 2019 2 次提交

X
fix cache table bug, add save_paddle_inference_model, fix hdfs util bug (#21052) · 23876de5
由 xujiaqi01 提交于 11月 15, 2019
```
* fix cache table bug
* add save_paddle_inference_model
* fix hdfs util bug
* test=develop
```
23876de5

add copy table (#21086) · 9e045170

由 xujiaqi01 提交于 11月 15, 2019

* copy some feasigns and corresponding embeddings from one sparse table to another
* copy all feasigns and corresponding embeddings from one sparse table to another
* copy all dense params from one table to another
* copy some local vars to other local vars

9e045170

25 10月, 2019 1 次提交

fix several sparse table issuses (#20686) · 48669aa8

由 xujiaqi01 提交于 10月 25, 2019

* no longer need to define all embedding layers (no one less) of all slots in each program. make trainer_param repeated in ps.proto.
* add find_distributed_lookup_table_grads instead of hard code GRAD
* support embedding stop gradient. push sparse has error before fix this.* 
* fix fill sparse, skip slots which do not have embedding. each slot's embedding in a sparse table should be used in all training programs before fix this.
* fix pull sparse, skip slots which do not have embedding.
* fix collect feasign label info, skip slots which do not have embedding.
* support when there are multi sparse tables in one or multi training programs, each program can pull/push its own related sparse tables instead of all sparse tables.
* test=develop

48669aa8

24 9月, 2019 1 次提交

support change shuffle and train thread num (#19841) · cedc0477

由 xujiaqi01 提交于 9月 24, 2019

* support change shuffle thread num
* support change train thread num
* fix receive shuffle data of each channel
* data norm stop gradient
* add check thread_tensor type and root_tensor type when merge metric
* remove sleep in shuffle, add config
* add config of pslib client to client communication
* fix xbox str
* add data norm op testcase
* add flush in trainer finalize

cedc0477

17 9月, 2019 1 次提交
- X
  support preload thread, optimize hdfs log, fix master+patch bug (#19695) · 6bf298bf
  由 xujiaqi01 提交于 9月 17, 2019
```
* support preload thread
* sleep before fleet wrapper exit for pslib core dump
* optimize hdfs log
* fix master+patch bug
```
  6bf298bf
31 8月, 2019 1 次提交

Paddlebox Framework (#18982) · c756b5d2

由 hutuxian 提交于 8月 31, 2019

* Support looking up embeddings from BoxPS.
* Add a _pull_box_sparse op, for now this op is not exposed to users.
* Add a BoxHelper class, providing 'BeginPass', 'EndPass', 'FeedPass' functions and so on.
* Add 'BoxPSDataset' in python code.
* Add a compile options WITH_BOX_PS and a MACRO PADDLE_WITH_BOX_PS.
* Add UT.
* More concrete information pls refer to: https://github.com/PaddlePaddle/Paddle/pull/18982

c756b5d2

14 8月, 2019 1 次提交

add get_last_save_xbox_base/get_last_save_xbox (#19122) · b104ea06

由 jiaqi 提交于 8月 14, 2019

* add get_last_save_xbox_base/get_last_save_xbox
* fix fleet_util bug of load paddle model
* add doc string in fleet api

b104ea06

11 8月, 2019 1 次提交

add save cache model api in fleet& add slots shuffle in dataset module & add... · 9150cf50

由 yaoxuefeng 提交于 8月 11, 2019

add save cache model api in fleet& add slots shuffle in dataset module & add metric op to calculate ctr related metrics (#18871)

* add ctr related metric layer test=develop

* add save cache and slots shuffle test=develop

* add save cache and slots shuffle test=develop

* fix error

* fix error

* fix style for ci

* fix for comments

* change SlotsShuffle input to std::strinf for generality

* fix style

* fix style

* fix style

* fix style

* fix style

* fix style

* fix stylr

* fix style

* fix style

* fix style

* fix style

* fix style

* fix style

* fix style

* fix style

* fix style

* fix style

* fix style

* fix style

* fix style

* change non-const reference to pointer

* fix style

* fix style

* fix style test=develop

* fix style  test=develop

* add return ins num in ctr metric op

* change dtype to float in metric_op.py

* fix error test=develop

* fix style test=develop

* fix API spec

* fix API spec

* fix API spec test=develop

* add UT test=develop

9150cf50

29 7月, 2019 1 次提交

add clear_model interface in fleetwrapper (#18815) · 52c1431e

由 Thunderbrook 提交于 7月 29, 2019

* dump slot

* test

* proto

* dump slot

* test

* proto

* code style

* code style

* code style

* style

* add delete after unseen days

* add unseen days

* code style

* conflict solve
test=develop

* add clear model

* code style
test=develop

* code style
test=develop

52c1431e

25 7月, 2019 1 次提交

Fix shrink-dense and add scale-datanorm (#18746) · c167a4b4

由 fuyinno4 提交于 7月 25, 2019

Fix FleetWrapper:
1. fix shrink dense: just scale show
2. add datanorm scale: divide datanorm's gradient by batch_size

c167a4b4

24 7月, 2019 1 次提交

add slot to sparse table (#18686) · d8396281

由 Thunderbrook 提交于 7月 24, 2019

The change includes 2 things:

1. save delta model and shrink table are control by the same parameter before, now add delete_after_unseen_days to control shrink table.
2. value in sparse table has no slot before, now add slot in sparse table, and add DownpureCtrAccessor to support the new meta.
test=develop

d8396281

23 7月, 2019 1 次提交

support patch data, add load_one_table, fix bug (#18509) · d18aabb4

由 jiaqi 提交于 7月 23, 2019

（1）support patch data （merge slots of instances of same line id, modify dense layer which
changes its size）
（2）add fleet load_one_table interface, support load from paddle model and load from pslib model
（3）fix push sparse bug which cause push sparse cost more time（about 10% in my testcase）
（4）when some slots are not in one of your network (join/update, etc.)，data feed、collect label info、push/pull sparse will skip these slots， instead of throw error.
（5）add more debug info in TrainFilesWithProfiler

d18aabb4

15 5月, 2019 1 次提交

add save/load model, shrink table, cvm, config file & fix pull dense bug (#17118) · 66d51206

由 jiaqi 提交于 5月 15, 2019

* add save/load model, shrink table, cvm, config file & fix pull dense bug
test=develop

* fix global shuffle bug, fix pull dense bug, fix release memeory bug, fix shrink error
add client flush, add get data size
test=develop

* fix global shuffle bug
test=develop

* fix global shuffle bug
test=develop

* fix code style
test=develop

* fix code style & modify pslib cmake
test=develop

* fix error of _role_maker
test=develop

* fix code style
test=develop

* fix code style
test=develop

* fix code style
test=develop

* fix code style
test=develop

* fix code style
test=develop

* fix windows compile error of fleet
test=develop

* fix global shuffle bug

* add comment
test=develop

* update pslib.cmake
test=develop

* fix fill sparse bug
test=develop

* fix push sparse bug
test=develop

66d51206

22 4月, 2019 1 次提交
- W
  fix nccl wrapper on windows · 51a0243a
  由 wopeizl 提交于 4月 22, 2019
```
test=develop
```
  51a0243a
17 4月, 2019 2 次提交
- D
  
  fix GPU compile error problem · 2ab2869c
  由 dongdaxiang 提交于 4月 17, 2019
  
  2ab2869c
- D
  add pybind dependency · 466d177d
  由 dongdaxiang 提交于 4月 16, 2019
```
test=develop
```
  466d177d
16 4月, 2019 1 次提交
- D
  
  add nccl wrapper for python API · b0911390
  由 dongdaxiang 提交于 4月 16, 2019
  
  b0911390
15 4月, 2019 1 次提交
- D
  
  add nccl_wrapper · fff795e5
  由 dongdaxiang 提交于 4月 15, 2019
  
  fff795e5
04 4月, 2019 1 次提交
- X
  fix runtime error · 5e513928
  由 xjqbest 提交于 4月 04, 2019
```
test=develop
```
  5e513928
30 3月, 2019 3 次提交
- D
  fix fleet code style · 718ea6db
  由 dongdaxiang 提交于 3月 30, 2019
```
test=develop
```
  718ea6db
- X
  add some doc · 782ab2e2
  由 xjqbest 提交于 3月 30, 2019
```
test=develop
```
  782ab2e2
- X
  fix client to client communication bug · a99c8d0c
  由 xjqbest 提交于 3月 30, 2019
```
test=develop
```
  a99c8d0c
29 3月, 2019 9 次提交
- D
  fix pull sparse slow problem · 98dda08a
  由 dongdaxiang 提交于 3月 29, 2019
```
test=develop
```
  98dda08a
- D
  fix fleet_wrapper compile on windows · 2708108a
  由 dongdaxiang 提交于 3月 27, 2019
```
test=develop
```
  2708108a
- D
  remove local random engine in fleet with rand_r() · d4514949
  由 dongdaxiang 提交于 3月 26, 2019
```
test=develop
```
  d4514949
- D
  
  fix code style · a0b59773
  由 dongdaxiang 提交于 3月 23, 2019
  
  a0b59773
- D
  support win32 flag in io.cc shell.cc, fix code style problem in fleet_wrapper,... · 365be5d5
  由 dongdaxiang 提交于 3月 23, 2019
```
support win32 flag in io.cc shell.cc, fix code style problem in fleet_wrapper, fix lodtensor_printer_test problem
test=develop
```
  365be5d5
- X
  
  fix bug of gen_worker_desc and set_filelist, add some doc · b7940c29
  由 xjqbest 提交于 3月 22, 2019
  
  b7940c29
- X
  
  add some doc · a34fe624
  由 xjqbest 提交于 3月 20, 2019
  
  a34fe624
- X
  
  fix runtime error · f5c6a14b
  由 xujiaqi01 提交于 3月 20, 2019
  
  f5c6a14b
- X
  
  support multi dataset && add init model && fix bug · a5b1a0e1
  由 xujiaqi01 提交于 3月 20, 2019
  
  a5b1a0e1

Crayon鑫 / Paddle 与 Fork 源项目一致

Crayon鑫 / Paddle
与 Fork 源项目一致