提交 · 2c687df042f456b2b16cb3d8519e9e54fdb68503 · BaiXuePrincess / Paddle

22 9月, 2022 1 次提交

Optimize topk's performance when k is small and input_width is large (#45312) · 2c687df0

由 carryyu 提交于 9月 22, 2022

* Optimize topk's performance when k is small and input_width is large

* 修改blockdim设置逻辑

* Update top_k_function_cuda.h

2c687df0

01 8月, 2022 1 次提交

unify gpu context (#44740) · 86763023

由 Leo Chen 提交于 8月 01, 2022

* remove cudaDeviceContext

* remove more template

* fix rocm compile

* remove alias name CUDADeviceContext

* fix compile

* fix tests

* revert changes

86763023

26 6月, 2022 1 次提交
- S
  
  format all files in fluid using new config (#43776) · 576236a0
  由 Sing_chan 提交于 6月 26, 2022
  
  576236a0
05 6月, 2022 1 次提交
- S
  
  【code format check upgrade】 step2：clang-format (#42840) · a3730dc8
  由 Sing_chan 提交于 6月 05, 2022
  
  a3730dc8
20 2月, 2022 1 次提交

[PTen->Phi PR1] Change pten dirname and namespace to phi (#39748) · dcfe1986

由 Chen Weihang 提交于 2月 20, 2022

* rename pten dir to phi

* rename namespace to phi

* rename infrt pten dir to phi

* resolve conflict

* rename pten to phi in cmake

* revert all infrt change

* change needed files

* fix infrt failed

* fix inference failed

dcfe1986

19 2月, 2022 1 次提交

[Pten]Unify paddle/pten::framework::ddim into pten::ddim (#39614) · 2fe04264

由 Aurelius84 提交于 2月 19, 2022

* Unify paddle/pten::framework::ddim into pten::ddim

* fix paddle namespace

* compile sucessfully

* fix npu src file

* fix conflict

* fix conflict

* fix tensorrt compiler error

* fix conflict

* fix conflict

* fix tesst file conflict

* fix conflict

* fix mlu file conflict

* fix mlu file conflict

* fix cinn header file conflict

* fix conflict

* fix conflict

* fix conflict

* fix conflict

2fe04264

04 3月, 2021 1 次提交
- Q
  
  [ROCM] update fluid platform for rocm (part5), test=develop (#31315) · 4d647ec1
  由 Qi Li 提交于 3月 04, 2021
  
  4d647ec1
14 10月, 2020 1 次提交
- C
  Polish some error message in opeators (#27876) · 4ba977c7
  由 Chen Weihang 提交于 10月 14, 2020
```
* polish some error message

* add white list

* revert shell script change
```
  4ba977c7
29 8月, 2020 1 次提交

Adadelta Optimizer (#26590) · a1b99fae

由 Jiawei Wang 提交于 8月 29, 2020

* add doc; notest

* fix doc; notest

* update doc; notest

* refine optimizer && adam

* refine optimizer; notest

* add adam

* fix doc

* fix doc && add adamw; notest

* add error message

* bug fix

* refine rmsprop && adamax

* fix ci

* buf fix

* update comment

* unify arguments place; notest

* fix ut, test=develop

* bug fix

* fix conflicts, test=develop

* add examples code

* bug fix

* fix comments

* fix sample code

* add sample code for Optimizer

* add adamax ut, test=develop

* fix rmsprop ut, test=develop

* add ut for optimizer.py and adamw.py

* first commit of adadelta optimizer

* fix learning rate

* fix adadelta doc and add sgd momentum

* remove unused fluid

* fix codestyle

* Update test_adam_op.py

* Update test_adam_op.py

* fix SGD in 2 unittests

* fix SGD in 2 unittests

* fix ci

* fix ut
Co-authored-by: NMRXLT <xlt2024@gmail.com>
Co-authored-by: Nmapingshuo <mps2012@yeah.net>

a1b99fae

25 8月, 2020 1 次提交
- W
  update the code for the topk v2 · 286eca2d
  由 wawltor 提交于 8月 25, 2020
```
add the top v2 for the paddlepaddle api 2.0
```
  286eca2d
17 2月, 2020 1 次提交

Add TopK Op Grad CPU&GPU Kernel test=develop (#22628) · 8f035fb6

由 Jiawei Wang 提交于 2月 17, 2020

* Add TopK Op Grad CPU&GPU Kernel test=develop

* Add TopK Op Grad, modify grad op maker test=develop

* Add TopK Op Grad, modify grad op maker test=develop

* Add TopK Op Grad, modify PADDLE_ENFORCE test=develop

* Add TopK Op Grad, modify PADDLE_THROW test=develop

* Add TopK Op Grad, modify unittest test=develop

* fix ngraph top k op unittest test=develop

8f035fb6

25 12月, 2019 1 次提交

add register op_data_type of pad/expand_as et.al (#21718) · 5cb2c741

由 Aurelius84 提交于 12月 25, 2019

* add register op_data_type test=develop

* fix register bug in isfinite op test=develop

* rm int int64_t in pad2d gradKernel  test=develop

5cb2c741

20 11月, 2019 1 次提交
- Z
  Fix topk compile failed on windows (#21243) · 3ff5cc2d
  由 zhaoyuchen2018 提交于 11月 20, 2019
```
* Fix topk compile failed on windows
* Use explicit cast for assign data
```
  3ff5cc2d
14 11月, 2019 1 次提交

Improve topk performance. (#21087) · b93870e6

由 zhaoyuchen2018 提交于 11月 13, 2019

* Improve topk performance.

give 200000 data to compute topk,
before opt: cost 1s
after opt: cost 0.0028s.

* Refine return value.
* Add cuda util funtions.
* Fix ComputeBlockSize bug & refine comments.
Signed-off-by: Nzhaoyuchen <zhaoyuchen01@baidu.com>

b93870e6

30 8月, 2019 1 次提交
- T
  remove unused assert.h (#19529) · 02270b3e
  由 Tao Luo 提交于 8月 30, 2019
```
test=develop
```
  02270b3e
26 12月, 2018 2 次提交

W
Make topk op support variable k. (#15044) · 2314f2eb
由 whs 提交于 12月 26, 2018
```
* Make topk op support variable k.
test=develop

* Fix tensor type.
test=develop
```
2314f2eb

Fp16 training (#14992) · 856f0da0

由 Wu Yi 提交于 12月 26, 2018

* wip

* wip

* wip

* wip for test

* add fp16 tests test=develop

* fix cpu build test=develop

* fix test=develop

* fix py3 tests test=develop

* fix lr_scheduler dtype test=develop

* fix test=dvelop

* test fix ci compile test=develop

* fix build and merge test=develop

* fallback momentumop change to general test=develop

* make fp16 lr schedule simple test=develop

* fix ut test=develop

* fix tests test=develop

* remove fp16 learning rate cast test=develop

856f0da0

20 12月, 2018 2 次提交

T
Revert "[Feature] Fp16 training for resnet50 (#14850)" · da87f7a6
由 typhoonzero 提交于 12月 20, 2018
```
This reverts commit 3d750f9c.
```
da87f7a6

[Feature] Fp16 training for resnet50 (#14850) · 3d750f9c

由 Wu Yi 提交于 12月 20, 2018

* wip

* wip

* wip

* wip for test

* add fp16 tests test=develop

* fix cpu build test=develop

* fix test=develop

* fix py3 tests test=develop

* fix lr_scheduler dtype test=develop

* fix test=dvelop

* test fix ci compile test=develop

* fix build and merge test=develop

* fallback momentumop change to general test=develop

3d750f9c

29 10月, 2018 1 次提交
- D
  
  cudnn version. staged. · c8adc2c6
  由 dzhwinter 提交于 10月 29, 2018
  
  c8adc2c6
26 10月, 2018 2 次提交
- D
  
  add cudnn back. staged. · 7141debe
  由 dzhwinter 提交于 10月 26, 2018
  
  7141debe
- D
  
  staged. test speed=49ms in 1080. · 09409bad
  由 dzhwinter 提交于 10月 26, 2018
  
  09409bad
24 10月, 2018 1 次提交

Fix top_k op (#14034) · c7379a73

由 qingqing01 提交于 10月 24, 2018

1. Fix CUDA kernel when height is large than 2048.
2. Support input with more than 2D.
3. Fix unit test when k is large than 1.
4. Enhence unit testing.

test=develop

c7379a73

08 10月, 2018 1 次提交
- Q
  
  Optimize Topk when height is large. (#13710) · 41e4f7ea
  由 qingqing01 提交于 10月 08, 2018
  
  41e4f7ea
17 8月, 2018 1 次提交
- D
  Revert ""cherry picked operators changes" (#12184)" (#12747) · 4069262f
  由 dzhwinter 提交于 8月 17, 2018
```
This reverts commit bf3c3496.
```
  4069262f
16 8月, 2018 1 次提交

"cherry picked operators changes" (#12184) · bf3c3496

由 dzhwinter 提交于 8月 16, 2018

* "cherry picked operators changes"

* "remove duplicated code"

* "add constant setter"

* "add get expected kernel"

* "fix ci"

* "add fill constant"

bf3c3496

04 5月, 2018 1 次提交
- C
  
  wrap_shfl_x_sync · d36af62c
  由 chengduoZH 提交于 5月 03, 2018
  
  d36af62c
03 5月, 2018 1 次提交

Fix/fp64 (#10346) · f63ff90b

由 dzhwinter 提交于 5月 03, 2018

* "fix double type error"

* "fix ci"

* "softmax fp64"

* "fix momentum"

* "fix ci"

f63ff90b

02 5月, 2018 1 次提交
- C
  
  replace __shfl with __shfl_sync · b8f7fa97
  由 chengduoZH 提交于 5月 02, 2018
  
  b8f7fa97
13 4月, 2018 1 次提交

Fix CPPLint errors in operators (#9828) · c241959e

由 Abhinav Arora 提交于 4月 12, 2018

* Fix CPPLint errors in operators

* Fix prior box op

* Fix Prior Box op

* Fix top_k_op.cu

* Fix pool mkmldnn

* Fix pool mkmldnn

c241959e

12 2月, 2018 1 次提交
- Q
  
  Fix the grammar in copyright. (#8403) · 24509f4a
  由 qingqing01 提交于 2月 12, 2018
  
  24509f4a
10 2月, 2018 2 次提交
- Y
  
  Correct #include path · fc374821
  由 Yi Wang 提交于 2月 09, 2018
  
  fc374821
- Y
  
  Move file to fluid/; Edit CMakeLists.txt · 90648f33
  由 Yi Wang 提交于 2月 09, 2018
  
  90648f33
26 12月, 2017 1 次提交
- L
  
  unify the indentation of license · 761b3297
  由 Luo Tao 提交于 12月 26, 2017
  
  761b3297
25 12月, 2017 1 次提交
- D
  
  GPUPlace to CUDAPlace (#6960) · 0d2235aa
  由 dzhwinter 提交于 12月 25, 2017
  
  0d2235aa
12 12月, 2017 1 次提交

Refine device context (#6433) · 61ec0b95

由 QI JUN 提交于 12月 12, 2017

There are mainly following fixes:

- take `DeviceContext` as the template parameter of math functors and OpKernel instead of `Place`
- remove `eigen_device` interface in base class  `DeviceContext`
- remove `GetEigenDevice` interface in `ExecutionContext` and base class `DeviceContext`
- remove unused `platform::EigenDeviceConverter`
- rename `REGISTER_OP_GPU_KERNEL` to `REGISTER_OP_CUDA_KERNEL`
- rename `USE_GPU_ONLY_OP` to `USE_CUDA_ONLY_OP`

61ec0b95

31 10月, 2017 1 次提交
- F
  Fix top k op GPU code (#5221) · d3cc7ac3
  由 fengjiayi 提交于 10月 30, 2017
```
* Fix Type error

* Fix error

* Fix top_k_op GPU code data type
```
  d3cc7ac3
28 9月, 2017 1 次提交
- Y
  
  Add Skeleton of Double support · 3a5693e0
  由 Yu Yang 提交于 9月 27, 2017
  
  3a5693e0
23 9月, 2017 1 次提交
- C
  
  fix cpu kernel with soft labels. · 6735585b
  由 caoying03 提交于 9月 22, 2017
  
  6735585b
07 9月, 2017 1 次提交

武

Add topk op (#3760) · 3fbb692d

由武毅提交于 9月 07, 2017

* init add

* add topk op

* someupdate

* fix style check

* add test py file

* update top k cuda kernel

* follow comments

* remove debug print

* fix casting error

* fix casting error

* fix casting error

* fix rename bug...

* fix travis

3fbb692d

BaiXuePrincess / Paddle 与 Fork 源项目一致

BaiXuePrincess / Paddle
与 Fork 源项目一致