提交 · 1c10dac4f2903524f801061ef4eb684c4ad636d3 · PaddlePaddle / Paddle

07 5月, 2019 1 次提交

由 zhaoyuchen2018 提交于 5月 07, 2019

* optimize sum op

fuse multi eigen kernel calls into one cuda kernel.
refine code

test=develop.
Signed-off-by: Nzhaoyuchen <zhaoyuchen01@baidu.com>

* Refine code.

test=develop
Signed-off-by: Nzhaoyuchen <zhaoyuchen01@baidu.com>

* Refine code according to comments.

test=develop

* refine code

delete sum_op_gpu.h
test=develop

* Fix test error.

test=develop
Signed-off-by: Nzhaoyuchen <zhaoyuchen01@baidu.com>

* refine code in format.

test=develop.

* refine code

test=develop
Signed-off-by: Nzhaoyuchen <zhaoyuchen01@baidu.com>

* refine code

test=develop
Signed-off-by: Nzhaoyuchen <zhaoyuchen01@baidu.com>

32b62c25

19 12月, 2018 1 次提交
- S
  rewrite variable type · ae6f46a1
  由 sneaxiy 提交于 12月 19, 2018
```
test=develop
```
  ae6f46a1
26 11月, 2018 1 次提交

Fix save and load lookup table/optimizer vars (#14301) · 3639d99f

由 tangwei12 提交于 11月 26, 2018

*  fix mkdir conflict

*  fix load/save lookup tables

 test=develop

* add lookup_table_utils

* fix load optimize vars on pserver

* delete lookup table utils

* fix save and load lookup tables

* fix load optimizer var

* fix load optimizer var, test=develop

* fix python 3 style, test=develop

* move lookup_table_utils to contrib utils

3639d99f

07 11月, 2018 1 次提交

Add fp16 backward support (#14202) · a9b5d42d

由 chengduo 提交于 11月 07, 2018

* add fp16 backward support
test=develop

* add sum_op fp16 test

* disable test_dist_save_load
test=develop

* add check_grad for sum

* add unit test for softmax_grad fp16
test=develop

* add scale_op unit test

* add mul_grad_op unit test for fp16

* add cross_entropy_grad and eman_grad unit test for fp16
test=develop

* fix cross_entropy unit test

* add pool2d fp16 unit test

* refine conv2d fp16 unit test
test=develop

* refine activation unit test
test=develop

* fix ci
test=develop

* follow zhihong's comment, copy from https://github.com/PaddlePaddle/Paddle/pull/12796
test=develop

a9b5d42d

28 10月, 2018 1 次提交
- Q
  
  sum selected rows check empty · 72aef6b1
  由 Qiao Longfei 提交于 10月 28, 2018
  
  72aef6b1
27 10月, 2018 3 次提交
- Q
  optimize code · 575f2271
  由 Qiao Longfei 提交于 10月 27, 2018
```
test=develop
```
  575f2271
- Q
  
  optimize code · 96d55009
  由 Qiao Longfei 提交于 10月 27, 2018
  
  96d55009
- Q
  
  sum op handle empty input · dd78b5df
  由 Qiao Longfei 提交于 10月 27, 2018
  
  dd78b5df
18 10月, 2018 1 次提交
- Q
  
  Small changes for sum_op to avoid zero setting. (#13923) · 5dbb2e99
  由 qingqing01 提交于 10月 18, 2018
  
  5dbb2e99
17 10月, 2018 1 次提交
- Q
  
  sum_op support inplace · bd2b6d7f
  由 Qiao Longfei 提交于 10月 17, 2018
  
  bd2b6d7f
08 10月, 2018 1 次提交
- Q
  
  update test_sum_op · 1a598800
  由 qiaolongfei 提交于 10月 08, 2018
  
  1a598800
27 9月, 2018 1 次提交

Add distributed unit tests about text_classification/simnet-bow/ctr (#12812) · 97cf1eb6

由 tangwei12 提交于 9月 27, 2018

* add dist ut for text_classification

* add dist ut for text_classification

* add simnet bow unittest

* add dist ut for simnet bow

* add trainning data url for simnet bow

* add trainning data url for simnet bow

* modify simnet test_reader to train reader

* add test_dist_ctr

* test_dist_ctr can run now

* dense update is good

* add unit test for selected rows

* debug unit test

* fix dist sparse update problem

* Constant args at init

* optimize code

* simnet optimize

* fix DebugStringEx

* optimize sum_op.h

* add ScaleOpVarTypeInference

* clean code

* fix test_dist_transpiler.py

* code optimize

* modify delta

* fix sparse update bug

* dist test use one cpu

* update some data

* remove unused code

* add use cuda config

* unit test fix

* unit test fix

* unit test fix

* unit test fix

* dist_word2vec use CPU

* unit test fix

* unit test fix

* code clean

* code clean

* merge develop

* api spec update

* Revert: api spec update

* replace simnet data with fake

* replace simnet data with fake

* update dim

* add batch auc

* code clean

* code clean

* modify print to stderr

* update simnet delta -> 1e-5

* update RUN_STEP

* add use_reader_alloc

* add use_reader_alloc

* add use_reader_alloc

* modify delta

* add use_reader_alloc

* fix stderr write

* python3 compatibility

test=develop

* python3 compatibility, test=develop

* Update dist_text_classification.py

* test=develop

97cf1eb6

20 9月, 2018 2 次提交
- Y
  Revert "Revert "Merge pull request #13431 from chengduoZH/refine_lod"" · 6d2c6f96
  由 Yu Yang 提交于 9月 20, 2018
```
This reverts commit a6c8d6b9.
```
  6d2c6f96
- Y
  Revert "Merge pull request #13431 from chengduoZH/refine_lod" · a6c8d6b9
  由 Yu Yang 提交于 9月 20, 2018
```
This reverts commit bd79e046, reversing
changes made to 6b4d290c.
```
  a6c8d6b9
17 9月, 2018 1 次提交
- C
  
  refine · cdb9605b
  由 chengduoZH 提交于 9月 17, 2018
  
  cdb9605b
18 8月, 2018 1 次提交
- T
  
  bug fix when all inputs are empty · b4f52b01
  由 tangwei12 提交于 8月 18, 2018
  
  b4f52b01
17 8月, 2018 2 次提交
- T
  
  code fix · ac9ae970
  由 tangwei12 提交于 8月 17, 2018
  
  ac9ae970
- D
  Revert ""cherry picked operators changes" (#12184)" (#12747) · 4069262f
  由 dzhwinter 提交于 8月 17, 2018
```
This reverts commit bf3c3496.
```
  4069262f
16 8月, 2018 2 次提交
- T
  
  remove assignment and add vlog · 26b228e4
  由 tangwei12 提交于 8月 16, 2018
  
  26b228e4
- D
  "cherry picked operators changes" (#12184) · bf3c3496
  由 dzhwinter 提交于 8月 16, 2018
```
* "cherry picked operators changes"

* "remove duplicated code"

* "add constant setter"

* "add get expected kernel"

* "fix ci"

* "add fill constant"
```
  bf3c3496
01 8月, 2018 1 次提交
- T
  
  sum_op selectedRows dim bug fix · 766ac488
  由 tangwei12 提交于 8月 01, 2018
  
  766ac488
31 7月, 2018 1 次提交
- T
  
  sum_op selectedRows dim bug fix · c4c8f60b
  由 tangwei12 提交于 7月 31, 2018
  
  c4c8f60b
09 4月, 2018 1 次提交
- A
  
  Fix CPPLint issues in spp_op, sum_op, topk_op, transpose_op, unpool_op and warpctc_op · 981d7d01
  由 Abhinav Arora 提交于 4月 08, 2018
  
  981d7d01
09 3月, 2018 1 次提交
- Y
  Fix sparse update memory error for distributed training (#8837) · 84680379
  由 Yancey 提交于 3月 09, 2018
```
Fix sparse update memory error for distributed training
```
  84680379
15 2月, 2018 1 次提交

Update tensor_util.h (#8422) · cfffb1a3

由 Yi Wang 提交于 2月 14, 2018

* Update tensor_util.h

* Update with moved TensorDesc

* Fix tensur_utils.cu

* Update

* Update

* Update

* Update

* Make tensor_util.cu a symbolic link

cfffb1a3

12 2月, 2018 1 次提交
- Q
  
  Fix the grammar in copyright. (#8403) · 24509f4a
  由 qingqing01 提交于 2月 12, 2018
  
  24509f4a
11 2月, 2018 1 次提交
- Y
  Merge selected rows with dynamic variable count (#8023) · caf9a09d
  由 Yancey 提交于 2月 11, 2018
```
* dynamic send/recv selected rows

* update by comment

* fix by comment
```
  caf9a09d
10 2月, 2018 2 次提交
- Y
  
  Correct #include path · fc374821
  由 Yi Wang 提交于 2月 09, 2018
  
  fc374821
- Y
  
  Move file to fluid/; Edit CMakeLists.txt · 90648f33
  由 Yi Wang 提交于 2月 09, 2018
  
  90648f33
30 1月, 2018 1 次提交
- Y
  Test word2vec for parallel.do · 7c0cc113
  由 Yang Yu 提交于 1月 30, 2018
```
* Polish sum_op support SelectedRows in_place
```
  7c0cc113
09 1月, 2018 2 次提交
- Y
  Test dist word2vec (#7334) · e249ad12
  由 Yancey 提交于 1月 09, 2018
```
* test dist word2vec

* multiple trainers work
```
  e249ad12
- Y
  Rename CopyFrom to Copy for tensors (#7292) · ce6dad3b
  由 Yu Yang 提交于 1月 09, 2018
```
* Rename Tensor::CopyFrom to Tensor::Copy

* Fix CI

* Fix compile
```
  ce6dad3b
28 12月, 2017 2 次提交
- Y
  
  Update · 96bc3352
  由 Yang Yu 提交于 12月 28, 2017
  
  96bc3352
- Y
  
  Refine · f5c2d175
  由 Yang Yu 提交于 12月 28, 2017
  
  f5c2d175
26 12月, 2017 2 次提交
- Y
  
  Revert debug code · 87288850
  由 Yang Yu 提交于 12月 26, 2017
  
  87288850
- Y
  
  Update code · 82a22d32
  由 Yang Yu 提交于 12月 26, 2017
  
  82a22d32
12 12月, 2017 1 次提交

Refine device context (#6433) · 61ec0b95

由 QI JUN 提交于 12月 12, 2017

There are mainly following fixes:

- take `DeviceContext` as the template parameter of math functors and OpKernel instead of `Place`
- remove `eigen_device` interface in base class  `DeviceContext`
- remove `GetEigenDevice` interface in `ExecutionContext` and base class `DeviceContext`
- remove unused `platform::EigenDeviceConverter`
- rename `REGISTER_OP_GPU_KERNEL` to `REGISTER_OP_CUDA_KERNEL`
- rename `USE_GPU_ONLY_OP` to `USE_CUDA_ONLY_OP`

61ec0b95

04 12月, 2017 1 次提交

While op forward for sentimental analysis (#6140) · d5e32794

由 Yu Yang 提交于 12月 04, 2017

* Add DataFeeder

A v2 API like data feeder for book demos.
We can feed data directly from reader.

* Fix CI

* Add an unittest for while/rnn op forward

* Add unittest for raw while op backward

* Fix CI

d5e32794

01 12月, 2017 1 次提交
- Y
  Fix the proformance problem of enforce (#6085) · 8ac02279
  由 Yu Yang 提交于 12月 01, 2017
```
* Fix Proformance problem of enforce

* Fix missing `;` in code

* Fix CI
```
  8ac02279
26 11月, 2017 1 次提交

Feature/copytensor (#5455) · 45062fe5

由 dzhwinter 提交于 11月 26, 2017

* "make global tensor function independently"

* "replace functor"

* "fix inline template error"

* "fix tensor array with CopyFrom"

* "fix other case use CopyFrom"

* "move the op interface hardly"

* "fix operators"

* "fix typo"

* "delete dynamic recurrent rnn and fix gru_unit in debugmode"

* "fix unique_ptr copy"

* "fix cuda copy"

* "fix namespace error"

* "removed nccl python test"

* "fix include error"

* "fix typo"

* fix copy util test

45062fe5

PaddlePaddle / Paddle 大约 1 年 前同步成功

PaddlePaddle / Paddle
大约 1 年前同步成功