提交 · e4d475eabd83e7a6fa1e88c64c28747450f87d66 · 机器未来 / Paddle

06 2月, 2022 1 次提交
- W
  
  [PTEN] Add Gpu context (#39305) · a821c4a9
  由 Wilber 提交于 2月 06, 2022
  
  a821c4a9
24 12月, 2021 1 次提交

[pten] combine reduce_cuda codes (#38328) · 08941eda

由 chentianyu03 提交于 12月 24, 2021

* combine reduce_cuda codes

* support float16 in pten redcue_mean

* replace ReduceCudaKernel impl with pten reduce impl

* mv reduce funcs into reduce_cuda_impl

* rm unsed codes and headers

* mv GetReduceDim into reduce_cuda_impl

* recover GetReduceDim in reduce_op.h

* add new dispatch macro

* fix pool op output not inited and cause transform to pten::denseTensor error

* fix output tensor not initialized error

* rename new dispatch macro and format code style

* rm reduce_functor_op.h file

08941eda

16 12月, 2021 1 次提交
- N
  Add the transformop parameter in TensorReduceFunctorImpl (#38135) · 524389ee
  由 niuliling123 提交于 12月 16, 2021
```
* Add the transformop parameter in TensorReduceFunctorImpl
```
  524389ee
03 12月, 2021 1 次提交
- R
  refine structure for cuda and rocm (#37202) · a6d2fddb
  由 ronnywang 提交于 12月 03, 2021
```
* refine structure for cuda and rocm

* update

* update

* update

* update
```
  a6d2fddb
17 11月, 2021 1 次提交
- N
  Modify reduce_op.op.h for xpu2 with kernel primitive api (#36904) · 9c5d5665
  由 niuliling123 提交于 11月 17, 2021
```
* Modify reduce_op.op.h for xpu2 with kernel primitive api
```
  9c5d5665
08 9月, 2021 2 次提交
- N
  
  Modify the reduce op according to the kernel primitive api (#35282) · 82b33be3
  由 niuliling123 提交于 9月 08, 2021
  
  82b33be3
- G
  
  fix bug (#35482) · e133d8ef
  由 Guoxia Wang 提交于 9月 08, 2021
  
  e133d8ef
16 8月, 2021 1 次提交
- G
  support margin loss (arcface, cosface, sphereface) for single GPU and cross GPUs (#34247) · b0cb4148
  由 Guoxia Wang 提交于 8月 16, 2021
```
* support margin loss (arcface, cosface, sphereface)
```
  b0cb4148

机器未来 / Paddle 与 Fork 源项目一致

机器未来 / Paddle
与 Fork 源项目一致