提交 · c7b32fe1bdb3819ce1eb76affd28462d1201cd0c · 机器未来 / Paddle

25 2月, 2021 1 次提交
- L
  Add cublas_handle() to expose cublas_handle to ops (#31157) (#31190) · c7b32fe1
  由 liu zhengxi 提交于 2月 25, 2021
```
* add get_cublas_handle() api

* update format

* add unittests

* alter function name
```
  c7b32fe1
23 2月, 2021 1 次提交
- Z
  [cherry-pick] Fix softmax cross entropy integer overflow. (#30590) (#31134) · 30a2e7f0
  由 Zhong Hui 提交于 2月 23, 2021
```
[BUG FIX] Fix softmax cross entropy overflow problem.
```
  30a2e7f0
20 1月, 2021 1 次提交
- A
  [cherry-pick]Add tf32 support for A100 tensor core acceleration for cuBLAS (#28732) (#30612) · fd9d6fda
  由 AshburnLee 提交于 1月 20, 2021
```
* Add tf32 support for A100 tensor core acceleration for cuBLAS (#28732)

* Fixed an error

* Fixed an error
```
  fd9d6fda
28 12月, 2020 1 次提交

[Cherry-pick] Cherry-pick of PR#29579 and PR#29617 (#29904) · 63939597

由 Huihuang Zheng 提交于 12月 28, 2020

* [Dy2stat] Enable jit.save to Save Without Running (#29579)

Enable jit.save to Save Without Running.

* Modify CublasHandleHolder to Fix Random Unittest Failure. test=develop (#29617)

Modify CublasHandleHolder from using PADDLE_ENFORCE_CUDA_SUCCESS to PADDLE_RETRY_CUDA_SUCCESS to fix random unittest failure. We checked that the unittest log showed CUDA allocation error at this file, which may due to GPU not enough. We fixed similar failure in the past, so we applied PADDLE_RETRY_CUDA_SUCCESS here.

63939597

11 7月, 2020 1 次提交

Fix index overflow bug of the CUDA kernel loop increment (#25435) · 0b54d54f

由 Chen Weihang 提交于 7月 11, 2020

* fix softmax_with_cross_entropy cuda kernel overflow bug, test=develop

* replace old macro & for condition, test=develop

* polish details, test=develop

0b54d54f

20 4月, 2020 1 次提交

Optimize the error messages of paddle CUDA API (#23816) · 78170037

由 Zhou Wei 提交于 4月 20, 2020

* Optimize the error messages of paddle CUDA API, test=develop

* fix the error messages of paddle CUDA API, test=develop

* Refactoring PADDLE_ENFORCE_CUDA_SUCCESS, and apply to curand/cudnn/cublas/NCCL,test=develop

* remove build_ex_string,test=develop

* merge conflict,test=develop

78170037

30 12月, 2019 1 次提交
- C
  
  Add error message for cublas inItizalize failed (#21995) · 35ff1568
  由 Chen Weihang 提交于 12月 30, 2019
  
  35ff1568
18 11月, 2019 1 次提交

Fix warn of gcc8 (#21205) · cdb3d279

由 Zeng Jinle 提交于 11月 18, 2019

* fix warnings oof gcc 8 compilation, test=develop

* fix boost::bad_get, test=develop

* refine PADDLE_ENFORCE, test=develop

cdb3d279

03 9月, 2019 1 次提交
- T
  refine PADDLE_ENFORCE codes for unify PADDLE_ASSERT_MSG (#19603) · 75d15719
  由 Tao Luo 提交于 9月 03, 2019
```
test=develop
```
  75d15719
08 1月, 2019 2 次提交
- S
  Revert "Revert "Remove op handle lock"" · ed409ac9
  由 sneaxiy 提交于 1月 08, 2019
```
test=develop
```
  ed409ac9
- Z
  Revert "Remove op handle lock" · dacfaaa9
  由 Zeng Jinle 提交于 1月 08, 2019
```
test=develop
```
  dacfaaa9
02 1月, 2019 1 次提交
- S
  remove_op_handle_lock · d0a8a1e9
  由 sneaxiy 提交于 1月 02, 2019
```
test=develop
```
  d0a8a1e9
30 4月, 2018 1 次提交
- D
  Feature/cuda9 cudnn7 (#10140) · eb6f9dd5
  由 dzhwinter 提交于 4月 30, 2018
```
* "re-commit "

* "picked up"

* "fix ci"

* "fix pdb hang up issue in cuda 9"
```
  eb6f9dd5
10 4月, 2018 2 次提交
- Y
  
  Make cuda_helper.h Pass cpplint · 40e3fe17
  由 Yu Yang 提交于 4月 10, 2018
  
  40e3fe17
- C
  Move reduceSum to elementwise_op_function.h (#9773) · b1224da8
  由 chengduo 提交于 4月 10, 2018
```
* add cuda_device_functions.h

* move reduceSum to elementwise_op_function.h
```
  b1224da8
28 2月, 2018 1 次提交
- C
  
  Add todo for reduceSum · 90dc33b5
  由 chengduoZH 提交于 2月 28, 2018
  
  90dc33b5
26 2月, 2018 1 次提交
- C
  
  refine Sum · b8938b44
  由 chengduoZH 提交于 2月 24, 2018
  
  b8938b44
24 2月, 2018 1 次提交
- C
  
  follow comments · a8288392
  由 chengduoZH 提交于 2月 24, 2018
  
  a8288392
12 2月, 2018 1 次提交
- Q
  
  Fix the grammar in copyright. (#8403) · 24509f4a
  由 qingqing01 提交于 2月 12, 2018
  
  24509f4a
10 2月, 2018 1 次提交
- Y
  
  Move file to fluid/; Edit CMakeLists.txt · 90648f33
  由 Yi Wang 提交于 2月 09, 2018
  
  90648f33
23 11月, 2017 1 次提交
- Y
  Feature/support int64 for sum (#5832) · c077a6d5
  由 Yu Yang 提交于 11月 23, 2017
```
* Support int64 for sum op

* Refine code
```
  c077a6d5
18 9月, 2017 1 次提交
- 武
  Refine accuracy_op CUDA kernel (#4097) · 8580dce3
  由武毅提交于 9月 18, 2017
```
* refind accuracy_op

* follow comments

* follow comments
```
  8580dce3
23 8月, 2017 1 次提交
- D
  
  Remove set functor and add comapre_grad test · f188e22b
  由 dangqingqing 提交于 8月 23, 2017
  
  f188e22b
22 8月, 2017 2 次提交
- D
  
  fix cuda_helper.h · 9bc1a1a1
  由 dangqingqing 提交于 8月 22, 2017
  
  9bc1a1a1
- D
  lookup table op, cuda helper and set functor · 0f3b9e41
  由 dangqingqing 提交于 8月 22, 2017
```
1. finish lookup table CPU and GPU kernel
2. Add some cuda helper
3. Add some math funtor
```
  0f3b9e41

机器未来 / Paddle 与 Fork 源项目一致

机器未来 / Paddle
与 Fork 源项目一致