提交 · 6bfc57215fbe0f9f876dfce45ca67ec10c4f7e2b · 机器未来 / Paddle

08 12月, 2020 1 次提交

[2.0 rc1/cherrypick] cherry-pick kunlun PR:29234/29229/29293/29367/29280/29448 (#29466) · 6bfc5721

由 liuyuhui 提交于 12月 08, 2020

* add deformable_conv op on xpu (#29234)

* rebase develop

* update deformable_conv op on xpu

* update deformable_conv op on xpu

* update kunlun conv2d/softmax/elementwise implemetation (#29229)

* update conv2d & softmax to new xpu api
* test=kunlun

* remove useless comments
* test=kunlun

* remote softmax xpu op
* test=kunlun

* update kunlun softmax
* test=kunlun

* update xpu unitest
* test=kunlun

* fix elementwise_grad bug for kunlun
*test=kunlun

* support global pooling for kunlun (#29293)

* test=kunlun

* update reduce_sum op on xpu (#29367)

* update reduce_sum op on xpu

* update reduce_sum op on xpu

* support running on xpu

* fix expand/uniform_random && concat/transpose to new api on xpu (#29280)

* fix expand && concat/transpose to new api

* update uniform_random_op

* update xpu_header

* 1. fix elementwise ops'bug 2. fix softmax_with_cross_entropy_op 3. add biliner_interp_op (#29448)
Co-authored-by: Nroot <root@bjhw-sys-rpm0223.bjhw.baidu.com>
Co-authored-by: N卖鱼的哲学 <tangzhiyi11@users.noreply.github.com>
Co-authored-by: NQingshuChen <qingshu.chen714@gmail.com>
Co-authored-by: Ntaixiurong <taixiurong@126.com>
Co-authored-by: Nroot <root@bjhw-sys-rpm0223.bjhw.baidu.com>

6bfc5721

25 11月, 2020 1 次提交
- T
  
  add xpu elementwise ops (#29031) · a5aa4dc7
  由 taixiurong 提交于 11月 25, 2020
  
  a5aa4dc7
14 10月, 2020 1 次提交
- J
  Add elementwise XPU OP kernel for KUNLUN core, including (but still cannot process common broadcast · c791df09
  由 Jack Zhou 提交于 10月 14, 2020
```
Add elementwise XPU OP kernel for KUNLUN core, including (but still cannot process common broadcast
```
  c791df09
06 4月, 2020 1 次提交
- S
  
  fix conflict, test=develop (#23298) · c706ff20
  由 ShenLiang 提交于 4月 06, 2020
  
  c706ff20
19 12月, 2018 1 次提交
- H
  data_norm · 39f4e927
  由 heqiaozhi 提交于 12月 19, 2018
```
test=develop
```
  39f4e927
12 2月, 2018 1 次提交
- Q
  
  Fix the grammar in copyright. (#8403) · 24509f4a
  由 qingqing01 提交于 2月 12, 2018
  
  24509f4a
10 2月, 2018 2 次提交
- Y
  
  Correct #include path · fc374821
  由 Yi Wang 提交于 2月 09, 2018
  
  fc374821
- Y
  
  Move file to fluid/; Edit CMakeLists.txt · 90648f33
  由 Yi Wang 提交于 2月 09, 2018
  
  90648f33
26 12月, 2017 1 次提交
- L
  
  unify the indentation of license · 761b3297
  由 Luo Tao 提交于 12月 26, 2017
  
  761b3297
12 12月, 2017 2 次提交

Refine device context (#6433) · 61ec0b95

由 QI JUN 提交于 12月 12, 2017

There are mainly following fixes:

- take `DeviceContext` as the template parameter of math functors and OpKernel instead of `Place`
- remove `eigen_device` interface in base class  `DeviceContext`
- remove `GetEigenDevice` interface in `ExecutionContext` and base class `DeviceContext`
- remove unused `platform::EigenDeviceConverter`
- rename `REGISTER_OP_GPU_KERNEL` to `REGISTER_OP_CUDA_KERNEL`
- rename `USE_GPU_ONLY_OP` to `USE_CUDA_ONLY_OP`

61ec0b95

S
Add row conv operator (#6013) · 4ff6bc17
由 Siddharth Goyal 提交于 12月 11, 2017
```
* Fix documentation

* Address review comments
```
4ff6bc17

11 10月, 2017 1 次提交

Conv Shift Operator (#4591) · a281b383

由 Markus Kliegl 提交于 10月 10, 2017

* conv_shift_op: initial implementation using Eigen

Limitations:
- both gradient outputs must be specified and are always computed
- explicit for loops => could be optimized in various ways
  (e.g., different memory layout)

* conv shift - gradient fixes

fix case when not all output gradients desired

* conv shift: minor cleanup

* conv shift - more minor cleanup

* conv shift: clean up & initial GPU implementation

* fix rebase issue

a281b383

09 8月, 2017 1 次提交
- Y
  Use ostream << operator to get to_string · a573dd4c
  由 Yu Yang 提交于 8月 09, 2017
```
* Make `PADDLE_ENFORCE_EQ` supports custom class, like DDim
```
  a573dd4c
01 8月, 2017 1 次提交
- Y
  
  Follow comments and merge develop · e2fd2bd0
  由 Yu Yang 提交于 8月 01, 2017
  
  e2fd2bd0
26 7月, 2017 2 次提交
- Y
  
  Update Interface · b1b13f8f
  由 Yu Yang 提交于 7月 26, 2017
  
  b1b13f8f
- Y
  
  Update Backward · ecf23ce5
  由 Yu Yang 提交于 7月 26, 2017
  
  ecf23ce5
25 7月, 2017 1 次提交
- L
  
  ENH: Refine Tensor and And CopyFrom · de8a8fee
  由 liaogang 提交于 7月 25, 2017
  
  de8a8fee
17 7月, 2017 2 次提交
- Y
  
  Refine CMake dependencies graph · 38310f93
  由 Yu Yang 提交于 7月 17, 2017
  
  38310f93
- Y
  Add enforce switch for convient develop (#2850) · cdec5634
  由 Yan Chunwei 提交于 7月 17, 2017
```
* add NDEBUG switch to PADDLE_ENFORCE
```
  cdec5634
11 7月, 2017 2 次提交
- D
  
  "support net_proto header" · 18e65b0c
  由 dongzhihong 提交于 7月 11, 2017
  
  18e65b0c
- D
  
  "move opContext to DeviceContext" · bc021d77
  由 dongzhihong 提交于 7月 11, 2017
  
  bc021d77
06 7月, 2017 2 次提交
- L
  
  FIX: explicit construct pool element · a669bf48
  由 liaogang 提交于 7月 06, 2017
  
  a669bf48
- L
  
  ENH: add memory unit test · 74691789
  由 liaogang 提交于 7月 06, 2017
  
  74691789
05 7月, 2017 1 次提交
- L
  
  FIX: Buddy Allocator Free with Merge feature · ada1c20b
  由 liaogang 提交于 7月 05, 2017
  
  ada1c20b
04 7月, 2017 4 次提交
- L
  
  ENH: Add paddle_memory for external usage · 4dc3c9e0
  由 liaogang 提交于 7月 04, 2017
  
  4dc3c9e0
- L
  
  ENH: Add buddy allocator Free · 0ba63475
  由 liaogang 提交于 7月 04, 2017
  
  0ba63475
- L
  
  ENH: code style · ff363894
  由 liaogang 提交于 7月 04, 2017
  
  ff363894
- L
  
  ENH: add buddy alloctor Free · 4e1617d0
  由 liaogang 提交于 7月 04, 2017
  
  4e1617d0
03 7月, 2017 1 次提交
- L
  ENH: Add Alloc for buddy Allocator · bbd3eab7
  由 liaogang 提交于 7月 03, 2017
```
* Free will be added soon
```
  bbd3eab7
28 6月, 2017 2 次提交
- L
  
  FIX: Pass CI · 3e9aa7fd
  由 liaogang 提交于 6月 28, 2017
  
  3e9aa7fd
- Y
  
  Add buddy_allocator.cc and system_allocator.cc · 3e087f76
  由 Yi Wang 提交于 6月 27, 2017
  
  3e087f76

机器未来 / Paddle 与 Fork 源项目一致

机器未来 / Paddle
与 Fork 源项目一致