提交 · de003ceea78a114f0cd6dac5c7d8b4283f96d530 · BaiXuePrincess / Paddle

18 1月, 2021 1 次提交

[cherry-pick] improve perfomance of cast and tril op (#30498) · de003cee

由 Zhang Ting 提交于 1月 18, 2021

* add fp16 support for tril_triu op (#30186)

* add VecCastCUDAKernel (#30296)
Co-authored-by: Nfurnace <34057289+windstamp@users.noreply.github.com>

de003cee

05 4月, 2020 1 次提交
- W
  add tril op and triu op (#23469) · c4d03052
  由 WuHaobo 提交于 4月 05, 2020
```
add tril op and  triu op
```
  c4d03052
24 10月, 2019 1 次提交

All elements in attr(shape) of crop_tensor can be -1 and int32/64 kernel registered (#20756) · 5a8d885d

由 Zhang Ting 提交于 10月 24, 2019

* All elements in attr(shape) of crop_tensor can be -1, test=develop, test=document_preview

* fix the bug that attr(offsets) should be initialized, test=develop

5a8d885d

20 9月, 2019 1 次提交

add crop_tensor_op, test=develop, test=document_preview (#19314) · b3888941

由 Zhang Ting 提交于 9月 20, 2019

add crop_tensor op. The main difference with crop is :

1. If the argument shape is a list, each element is an integer or a tensor variable with shape: [1]. This way is suitable for the case that the shape may be changed each iteration.

2. If the argument shape is a variable. Its rank must be 1. In crop op, the rank of shape must be the same as x

offsets can be a list, in which each element is an integer or a tensor variavle with shape: [1].

b3888941

03 4月, 2019 1 次提交

Add Pixel shuffle OP (#15782) · 229dc932

由 ruri 提交于 4月 03, 2019

* add pixel_shuffle op

* add pixel_shuffle op, test=develop

* rewrite code, test=develop

* delete useless comment, test=develop

* Refine pixel_shuffle_op and unit testing

* refine code,test=develop

* refine .cu,test=develop

* fix unittest,test=develop

* Fix unit testing
test=develop

* resolve conflict, test=develop

* fix test, test=develop

* fix API, test=develop

* fix test datatype bug,test=develop

* polish comments,test=develop

* add API,test=develop

* test=develop

* Add Pixel_Shuffle OP,test=develop

* support python3,test=develop

* add include memory to travis CI bug,test=develop

229dc932

21 3月, 2019 2 次提交
- P
  
  fix time; test=develop · 5dc9b519
  由 phlrain 提交于 3月 21, 2019
  
  5dc9b519
- P
  
  add elementwise floordiv, mod; test=develop · 56c2d384
  由 phlrain 提交于 3月 21, 2019
  
  56c2d384
12 2月, 2018 1 次提交
- Q
  
  Fix the grammar in copyright. (#8403) · 24509f4a
  由 qingqing01 提交于 2月 12, 2018
  
  24509f4a
10 2月, 2018 2 次提交
- Y
  
  Correct #include path · fc374821
  由 Yi Wang 提交于 2月 09, 2018
  
  fc374821
- Y
  
  Move file to fluid/; Edit CMakeLists.txt · 90648f33
  由 Yi Wang 提交于 2月 09, 2018
  
  90648f33
26 12月, 2017 1 次提交
- L
  
  unify the indentation of license · 761b3297
  由 Luo Tao 提交于 12月 26, 2017
  
  761b3297
12 12月, 2017 1 次提交

Refine device context (#6433) · 61ec0b95

由 QI JUN 提交于 12月 12, 2017

There are mainly following fixes:

- take `DeviceContext` as the template parameter of math functors and OpKernel instead of `Place`
- remove `eigen_device` interface in base class  `DeviceContext`
- remove `GetEigenDevice` interface in `ExecutionContext` and base class `DeviceContext`
- remove unused `platform::EigenDeviceConverter`
- rename `REGISTER_OP_GPU_KERNEL` to `REGISTER_OP_CUDA_KERNEL`
- rename `USE_GPU_ONLY_OP` to `USE_CUDA_ONLY_OP`

61ec0b95

03 11月, 2017 1 次提交
- W
  
  fix doc and code style · 34d68f24
  由 wwhu 提交于 11月 03, 2017
  
  34d68f24
02 11月, 2017 1 次提交
- W
  
  add cliy_by_norm op · 65451b5c
  由 wwhu 提交于 11月 02, 2017
  
  65451b5c
13 10月, 2017 1 次提交

Adding the Adam Optimizer operator (#4733) · 11680037

由 Abhinav Arora 提交于 10月 12, 2017

* add adam op

moment1_out = beta1 * moment1 + (1 − beta1) * grad
moment2_out = beta2 * moment2 + (1 − beta2) * grad * grad
moment1_hat =  moment1_out / (1 - beta1^t)
moment2_hat =  moment2_out / (1 - beta2^t)
param_out = param - learning_rate * moment1_hat / (sqrt(moment2_hat) +
epsilon)

* fix moment 2

* Adding the Adam optimization operator

* Adding more tests for Adam op

11680037

07 8月, 2017 1 次提交
- D
  
  "remove alias to more operators" · 6b23b91c
  由 dongzhihong 提交于 8月 07, 2017
  
  6b23b91c
04 8月, 2017 1 次提交
- L
  
  Add cpplint for *.h and cuda *.cu · b58725bd
  由 liaogang 提交于 8月 04, 2017
  
  b58725bd
31 7月, 2017 1 次提交
- Q
  
  add EIGEN_USE_GPU macro to op.cu file · 61f94f00
  由 qijun 提交于 7月 31, 2017
  
  61f94f00
25 7月, 2017 1 次提交
- Y
  Add type_alias to import framework into ops · efc119b4
  由 Yu Yang 提交于 7月 25, 2017
```
Make implement an operator less noisy.
```
  efc119b4
19 7月, 2017 1 次提交
- Q
  Add sgd op (#2950) · e3b27d19
  由 Qiao Longfei 提交于 7月 19, 2017
```
* a simplest SGD op
```
  e3b27d19

BaiXuePrincess / Paddle 与 Fork 源项目一致

BaiXuePrincess / Paddle
与 Fork 源项目一致