提交 · bc9f9f43733b25f5d7650f191f15de0454618813 · Crayon鑫 / Paddle

25 11月, 2021 1 次提交

Added GradTensorHolder to Eager Dygraph (#37458) · bc9f9f43

由 Zhanlue Yang 提交于 11月 25, 2021

* Added GradTensorHolder to Eager Dygraph

* Added accumulation codes to Eager Dygraph

* Fix windows-ci issue

* Fix NPU-CI issue

* Fixed CI-Coverage issue

bc9f9f43

03 2月, 2021 1 次提交
- W
  
  【kunlun】dygraph supports multi xpu card training (#30671) · b1026f64
  由 WangXi 提交于 2月 03, 2021
  
  b1026f64
14 10月, 2020 1 次提交
- W
  
  xpu support for fill_constant Op (#27675) · c5fcc96d
  由 wangchaochaohu 提交于 10月 14, 2020
  
  c5fcc96d
17 9月, 2020 1 次提交
- J
  enhance reduce op which can reduce tensor with arbitrary rank · 63203c4a
  由 Jack Zhou 提交于 9月 17, 2020
```
enhance reduce op which can reduce tensor with arbitrary rank 
```
  63203c4a
03 6月, 2020 1 次提交

Support gradient accumulation of fp16 in imperative mode (#24823) · b67ded04

由 Leo Chen 提交于 6月 03, 2020

* support gradient accumulation of fp16 in imperative mode, test=develop

* enhance coverage test, test=develop

* follow comments, test=develop

b67ded04

30 9月, 2018 1 次提交

"fix compile error" (#13579) · 26771f41

由 dzhwinter 提交于 9月 30, 2018

* "fix compile error"

* "fix ci"

* rerun ci
test=develop

* test=develop

rerun ci

26771f41

03 9月, 2018 1 次提交
- D
  
  squash commit · 379b471e
  由 dzhwinter 提交于 9月 03, 2018
  
  379b471e
25 8月, 2018 1 次提交
- D
  
  operators module (#12938) · eca4563e
  由 dzhwinter 提交于 8月 25, 2018
  
  eca4563e
24 8月, 2018 3 次提交
- D
  
  fix math_function compile · a94d4f51
  由 dzhwinter 提交于 8月 24, 2018
  
  a94d4f51
- D
  
  pre-commit · c1ad52f7
  由 dzhwinter 提交于 8月 24, 2018
  
  c1ad52f7
- D
  
  windows port · 34f8c9b6
  由 dzhwinter 提交于 8月 24, 2018
  
  34f8c9b6
05 7月, 2018 1 次提交
- D
  
  "remove lapack" (#11966) · 99a99ec7
  由 dzhwinter 提交于 7月 05, 2018
  
  99a99ec7
20 6月, 2018 1 次提交
- T
  
  enable dynamic load mklml lib on fluid · f503f129
  由 tensor-tang 提交于 6月 20, 2018
  
  f503f129
21 5月, 2018 1 次提交
- L
  Add an interface to set the number of threads for math function, and set the... · 39eb871d
  由 Liu Yiqun 提交于 5月 21, 2018
```
Add an interface to set the number of threads for math function, and set the default value to 1 for inference.
```
  39eb871d
04 5月, 2018 1 次提交
- Y
  
  Clean and extract blas · ef6ea790
  由 Yu Yang 提交于 5月 04, 2018
  
  ef6ea790
03 5月, 2018 1 次提交
- Y
  
  Clean MatMul · 815d8884
  由 Yu Yang 提交于 5月 03, 2018
  
  815d8884
28 4月, 2018 1 次提交
- Y
  
  Refactor GEMM in blas · c888e016
  由 Yu Yang 提交于 4月 28, 2018
  
  c888e016
25 4月, 2018 1 次提交
- Y
  Fix batch_gemm bugs · 2a06e307
  由 Yu Yang 提交于 4月 25, 2018
```
stride should be int64_t, not int
```
  2a06e307
08 3月, 2018 1 次提交
- L
  
  remove PADDLE_USE_ATLAS · bc0cfb22
  由 Luo Tao 提交于 3月 08, 2018
  
  bc0cfb22
12 2月, 2018 1 次提交
- Q
  
  Fix the grammar in copyright. (#8403) · 24509f4a
  由 qingqing01 提交于 2月 12, 2018
  
  24509f4a
10 2月, 2018 2 次提交
- Y
  
  Correct #include path · fc374821
  由 Yi Wang 提交于 2月 09, 2018
  
  fc374821
- Y
  
  Move file to fluid/; Edit CMakeLists.txt · 90648f33
  由 Yi Wang 提交于 2月 09, 2018
  
  90648f33
03 2月, 2018 1 次提交
- C
  
  Add layer norm [GPU] · 76e188e5
  由 chengduoZH 提交于 2月 02, 2018
  
  76e188e5
25 12月, 2017 1 次提交
- Y
  
  make forward work · fb9c08f0
  由 Yancey1989 提交于 12月 25, 2017
  
  fb9c08f0
12 12月, 2017 2 次提交

Refine device context (#6433) · 61ec0b95

由 QI JUN 提交于 12月 12, 2017

There are mainly following fixes:

- take `DeviceContext` as the template parameter of math functors and OpKernel instead of `Place`
- remove `eigen_device` interface in base class  `DeviceContext`
- remove `GetEigenDevice` interface in `ExecutionContext` and base class `DeviceContext`
- remove unused `platform::EigenDeviceConverter`
- rename `REGISTER_OP_GPU_KERNEL` to `REGISTER_OP_CUDA_KERNEL`
- rename `USE_GPU_ONLY_OP` to `USE_CUDA_ONLY_OP`

61ec0b95

T

unify MKL macro definition · 69b44f2f
由 tensor-tang 提交于 12月 12, 2017

69b44f2f

27 11月, 2017 1 次提交
- Y
  
  implement forward · 1abd3b3a
  由 Yancey1989 提交于 11月 27, 2017
  
  1abd3b3a
26 11月, 2017 1 次提交

Feature/copytensor (#5455) · 45062fe5

由 dzhwinter 提交于 11月 26, 2017

* "make global tensor function independently"

* "replace functor"

* "fix inline template error"

* "fix tensor array with CopyFrom"

* "fix other case use CopyFrom"

* "move the op interface hardly"

* "fix operators"

* "fix typo"

* "delete dynamic recurrent rnn and fix gru_unit in debugmode"

* "fix unique_ptr copy"

* "fix cuda copy"

* "fix namespace error"

* "removed nccl python test"

* "fix include error"

* "fix typo"

* fix copy util test

45062fe5

14 11月, 2017 1 次提交
- D
  
  Move RowwiseAdd functor to math_funcion and Add ColwiseSum functor. · 26736576
  由 dangqingqing 提交于 11月 14, 2017
  
  26736576
11 11月, 2017 1 次提交
- D
  
  Use G++ to compile some cu operators. · f5e36765
  由 dangqingqing 提交于 11月 11, 2017
  
  f5e36765
09 11月, 2017 1 次提交
- L
  
  remove PADDLE_USE_MKL · 7835d493
  由 Luo Tao 提交于 11月 09, 2017
  
  7835d493
08 11月, 2017 1 次提交
- Y
  
  Add `op::math::set_constant` without template · aadb0981
  由 Yu Yang 提交于 11月 07, 2017
  
  aadb0981
26 10月, 2017 1 次提交
- D
  
  Add gradient check unit testing and fix bug. · cd382866
  由 dangqingqing 提交于 10月 26, 2017
  
  cd382866
18 10月, 2017 1 次提交

MatMul operator (#4856) · 16489827

由 Markus Kliegl 提交于 10月 17, 2017

* initial matmul operator

Similar to np.matmul, but also has transpose_X and transpose_Y flags,
and only supports tensors from rank 1 to 3 inclusive.

For GPU, uses cublas?gemmStridedBatched. For CPU, uses
cblas_?gemm_batch if available via MKL; otherwise a simple serial
implementation that loops over the batch dimension is employed for now.

16489827

16 10月, 2017 1 次提交
- Q
  
  remove SelectedRows functors to selected_rows_functor.h · ab5dc9fe
  由 qijun 提交于 10月 15, 2017
  
  ab5dc9fe
14 10月, 2017 2 次提交
- Q
  
  SelectedRowsAddTensor method · 931572e2
  由 qijun 提交于 10月 13, 2017
  
  931572e2
- Q
  
  add selected_rows add cpu functor · 5be10872
  由 qijun 提交于 10月 13, 2017
  
  5be10872
29 9月, 2017 1 次提交
- Q
  
  add SetConstant method in math_function.h · c634a848
  由 qijun 提交于 9月 28, 2017
  
  c634a848
21 9月, 2017 1 次提交
- G
  
  Add gemm with stride · 9ffa79cd
  由 guosheng 提交于 9月 20, 2017
  
  9ffa79cd
19 9月, 2017 1 次提交

Remove lazy-initialization in device_context · 81d56ca8

由 Yu Yang 提交于 9月 18, 2017

* Also use `const DeviceContext&` all the time, to prevent `const_cast`

Fix #4169
Fix #3468
Fix #3475

81d56ca8

Crayon鑫 / Paddle 与 Fork 源项目一致

Crayon鑫 / Paddle
与 Fork 源项目一致