提交 · 866851903c36943c76de0b438a8b6baff738114f · Crayon鑫 / Paddle

09 7月, 2021 1 次提交

Use CBLAS for SelectedRows elementwise add operation. (#34008) · 1412d3bc

由 arlesniak 提交于 7月 09, 2021

* Use CBLAS for SelectedRows elementwise add operation. It's faster.

* template compilation fix

* reverted template compilation fix

* slimmed template compilation fix
Co-authored-by: NAdam Osewski <adam.osewski@intel.com>

1412d3bc

21 6月, 2021 1 次提交

Add AXPY oneDNN handler (#33632) · 773aabc7

由 lidanqing 提交于 6月 21, 2021

* Add oneDNN AXPY handler.

* Add fallback for small tensors.

* Fix ifdefs

* Remove unnecessary namespace prefixes and add missing headers.

* Guard handler_axpy with proper ifdefs.

* Compilation of this function is possible only when Paddle is not build
with CUDA nor HIP.

* Move AXPY handler code to separate files.

* Use oneDNN AXPY handler in SGD op.

* Use axpy handler only when Paddle is built with oneDNN.

* Add test for SUM BF16 with big rows.

* Fix SFINAE rules for elementwise_add_to.

* Add test case for SGD with big rows.

* update

* update
Co-authored-by: NAdam Osewski <adam.osewski@intel.com>

773aabc7

26 5月, 2021 1 次提交
- C
  modify matmul Op to complex template types (#33130) · 6c07cd7e
  由 chentianyu03 提交于 5月 26, 2021
```
* modify matmul Op to complex template types

* remove complex64/128 head file
```
  6c07cd7e
06 5月, 2021 1 次提交
- A
  
  Sum kernel for CPU supporting BF16 and SelectedRows (#32631) · 9599c3b3
  由 Adam Osewski 提交于 5月 06, 2021
  
  9599c3b3
04 2月, 2021 1 次提交
- W
  use iwyu clean include second time, test=develop (#30829) · 35c5b23f
  由 wanghuancoder 提交于 2月 04, 2021
```
* use iwyu clean include second time, test=develop
```
  35c5b23f
25 12月, 2020 1 次提交

[Complex] Add support for complex grad accumulated (#29889) · 1a304e6c

由 Chen Weihang 提交于 12月 25, 2020

* add support for complex grad accumulated

* add unittest for coverage

* update test dtype

* remove useless blank line

1a304e6c

10 9月, 2020 1 次提交
- S
  update error info for selected_rows_functor · 50e60e87
  由 Steffy-zxf 提交于 9月 10, 2020
```
update error info for selected_rows_functor
```
  50e60e87
11 5月, 2020 1 次提交

Add macro BOOST_GET to enrich the error information of boost :: get (#24175) · aa0f254f

由 Chen Weihang 提交于 5月 11, 2020

* add new macro BOOST_GET_SAFELY & unittests, test=develop

* add different macro type, test=develop

* fix get macro type in executor, test=develop

* four macro part change backup

* using one macro for all case, test=develop

* revert attribute change, test=develop

* change to three func to solve gcc4.8 bug, test=develop

* polish some details, test=develop

aa0f254f

30 10月, 2019 1 次提交
- Z
  
  fix select_rows mergeadd bug, test=develop (#20876) · d4289125
  由 zhang wenhui 提交于 10月 30, 2019
  
  d4289125
05 9月, 2019 1 次提交
- 1
  fix the diff between async mode and async_half mode (#19535) · 2f037c31
  由 123malin 提交于 9月 05, 2019
```
* test=develop,  communicator merge add => merge average
```
  2f037c31
12 4月, 2019 1 次提交
- Q
  
  optimize merge add if input rows of all selected rows is not duplicated · 920a9609
  由 Qiao Longfei 提交于 4月 12, 2019
  
  920a9609
09 1月, 2019 1 次提交
- Q
  
  follow comment test=develop · c3b9edf9
  由 Qiao Longfei 提交于 1月 09, 2019
  
  c3b9edf9
28 12月, 2018 1 次提交
- Q
  
  sum op support empty selected rows as input · 25d44d40
  由 Qiao Longfei 提交于 12月 28, 2018
  
  25d44d40
14 12月, 2018 2 次提交
- M
  Add sorted_result parameter to SelectedRows Functor · 5fea8cd4
  由 minqiyang 提交于 12月 14, 2018
```
test=develop
```
  5fea8cd4
- M
  Remove BinarySearch from Adam Op · da796dfe
  由 minqiyang 提交于 12月 14, 2018
```
test=develop
```
  da796dfe
26 11月, 2018 1 次提交
- M
  Revert the changes of VLOG · 53433d7f
  由 minqiyang 提交于 11月 26, 2018
```
test=develop
```
  53433d7f
14 11月, 2018 1 次提交
- T
  fix some compiler warning · e0d4e04b
  由 Tao Luo 提交于 11月 14, 2018
```
test=develop
```
  e0d4e04b
08 11月, 2018 1 次提交
- M
  Change the origin VLOG level to 10 times · 0c3227a5
  由 minqiyang 提交于 11月 08, 2018
```
Fix code to support cpplint syntax check

test=develop
```
  0c3227a5
27 10月, 2018 2 次提交
- Q
  
  optimize code · 96d55009
  由 Qiao Longfei 提交于 10月 27, 2018
  
  96d55009
- Q
  
  sum op handle empty input · dd78b5df
  由 Qiao Longfei 提交于 10月 27, 2018
  
  dd78b5df
17 10月, 2018 1 次提交
- Q
  
  change elementwise_add to elementwise_add_to test=develop · 02259575
  由 Qiao Longfei 提交于 10月 17, 2018
  
  02259575
15 10月, 2018 5 次提交
- Q
  code optimize · 936926aa
  由 Qiao Longfei 提交于 10月 15, 2018
```
test=develop
```
  936926aa
- Q
  
  clean code · c52ccbc1
  由 Qiao Longfei 提交于 10月 15, 2018
  
  c52ccbc1
- Q
  
  optimize blas call · 6056d043
  由 Qiao Longfei 提交于 10月 15, 2018
  
  6056d043
- Q
  
  optimize code · 5db75513
  由 Qiao Longfei 提交于 10月 15, 2018
  
  5db75513
- Q
  
  change map to unordered_map · d5c64af2
  由 Qiao Longfei 提交于 10月 15, 2018
  
  d5c64af2
12 10月, 2018 1 次提交
- M
  Polish code · 3f6ec900
  由 minqiyang 提交于 10月 12, 2018
```
test=develop
```
  3f6ec900
11 10月, 2018 2 次提交
- M
  Accelerate SelectedRows Functors: · 8ec748cf
  由 minqiyang 提交于 10月 11, 2018
```
  1. Accelerate SelectedRows MergeAdd functor

  2. Add SelectedRowsSumTo functor to support MergeAdd multiple SelectedRows into one

test=develop
```
  8ec748cf
- Q
  
  optimize code · 38568519
  由 Qiao Longfei 提交于 10月 11, 2018
  
  38568519
08 10月, 2018 1 次提交
- Q
  
  selected rows merge add support multi input · 40d3bd4e
  由 qiaolongfei 提交于 10月 08, 2018
  
  40d3bd4e
18 9月, 2018 1 次提交
- S
  
  fix adam · b6f61faf
  由 sneaxiy 提交于 9月 18, 2018
  
  b6f61faf
28 4月, 2018 1 次提交

Fix more CPPlint issues in fluid/operators/math (#10249) · e7353596

由 Abhinav Arora 提交于 4月 27, 2018

* Fix CPPLint errors

* Fix CPPLint errors in sequence2batch

* Fix compilation

* Fix LSTM op and GRU op

* Fix LSTMP op

* Fix more cpplint errors in operators/math

* Address Code review feedback

e7353596

12 2月, 2018 1 次提交
- Q
  
  Fix the grammar in copyright. (#8403) · 24509f4a
  由 qingqing01 提交于 2月 12, 2018
  
  24509f4a
10 2月, 2018 2 次提交
- Y
  
  Correct #include path · fc374821
  由 Yi Wang 提交于 2月 09, 2018
  
  fc374821
- Y
  
  Move file to fluid/; Edit CMakeLists.txt · 90648f33
  由 Yi Wang 提交于 2月 09, 2018
  
  90648f33
08 2月, 2018 1 次提交
- Y
  
  Rewrite mixed_vector.h · ef1aba39
  由 Yu Yang 提交于 2月 08, 2018
  
  ef1aba39
29 12月, 2017 2 次提交
- T
  
  scatter optimizers · 1039c1e3
  由 typhoonzero 提交于 12月 29, 2017
  
  1039c1e3
- T
  
  wip · 641b4c0f
  由 typhoonzero 提交于 12月 29, 2017
  
  641b4c0f
27 12月, 2017 1 次提交
- T
  
  WIP: adding generic scattor functors · d48a0e4e
  由 typhoonzero 提交于 12月 27, 2017
  
  d48a0e4e
12 12月, 2017 1 次提交

Refine device context (#6433) · 61ec0b95

由 QI JUN 提交于 12月 12, 2017

There are mainly following fixes:

- take `DeviceContext` as the template parameter of math functors and OpKernel instead of `Place`
- remove `eigen_device` interface in base class  `DeviceContext`
- remove `GetEigenDevice` interface in `ExecutionContext` and base class `DeviceContext`
- remove unused `platform::EigenDeviceConverter`
- rename `REGISTER_OP_GPU_KERNEL` to `REGISTER_OP_CUDA_KERNEL`
- rename `USE_GPU_ONLY_OP` to `USE_CUDA_ONLY_OP`

61ec0b95

Crayon鑫 / Paddle 与 Fork 源项目一致

Crayon鑫 / Paddle
与 Fork 源项目一致