提交 · 128adf53cb4517f2a4f123044c1ffffd6a3fa74d · PaddlePaddle / Paddle

15 3月, 2018 1 次提交

[Speed]implement cudnn sequence softmax cudnn (#8978) · 128adf53

由 dzhwinter 提交于 3月 15, 2018

* "add softmax cudnn functor support"

* "add testing"

* "refine cmakelist"

* "sequence softmax forward speed up"

* "add softmax grad"

* "fix sequence softmax test"

* "add double precision'

* "fix softmax test"

* "add softmax cudnn support"

* "fix softmax cudnn test"

* "add softmax to nn.py"

* "fix compile bug"

* "refine cmakelist"

* "fix ci"

* "fix based on comment"

* "fix based on comments"

* "fix ci"

128adf53

12 2月, 2018 1 次提交
- Q
  
  Fix the grammar in copyright. (#8403) · 24509f4a
  由 qingqing01 提交于 2月 12, 2018
  
  24509f4a
10 2月, 2018 2 次提交
- Y
  
  Correct #include path · fc374821
  由 Yi Wang 提交于 2月 09, 2018
  
  fc374821
- Y
  
  Move file to fluid/; Edit CMakeLists.txt · 90648f33
  由 Yi Wang 提交于 2月 09, 2018
  
  90648f33
26 12月, 2017 1 次提交
- F
  
  Change softmax · 874cac0c
  由 fengjiayi 提交于 12月 26, 2017
  
  874cac0c
12 12月, 2017 1 次提交

Refine device context (#6433) · 61ec0b95

由 QI JUN 提交于 12月 12, 2017

There are mainly following fixes:

- take `DeviceContext` as the template parameter of math functors and OpKernel instead of `Place`
- remove `eigen_device` interface in base class  `DeviceContext`
- remove `GetEigenDevice` interface in `ExecutionContext` and base class `DeviceContext`
- remove unused `platform::EigenDeviceConverter`
- rename `REGISTER_OP_GPU_KERNEL` to `REGISTER_OP_CUDA_KERNEL`
- rename `USE_GPU_ONLY_OP` to `USE_CUDA_ONLY_OP`

61ec0b95

11 11月, 2017 2 次提交
- D
  
  Fix bug. · 5f217099
  由 dangqingqing 提交于 11月 11, 2017
  
  5f217099
- D
  
  Use G++ to compile some cu operators. · f5e36765
  由 dangqingqing 提交于 11月 11, 2017
  
  f5e36765
09 11月, 2017 1 次提交
- D
  
  remove header file paddle/framework/eigen.h · cceed081
  由 dangqingqing 提交于 11月 09, 2017
  
  cceed081
08 11月, 2017 1 次提交
- D
  
  Remove fill_constant_batch_size_like_op.h and clean some operator codes. · e5791dd1
  由 dangqingqing 提交于 11月 08, 2017
  
  e5791dd1
29 9月, 2017 1 次提交
- Q
  
  refine SoftmaxFunctor · 84ff7e97
  由 qijun 提交于 9月 28, 2017
  
  84ff7e97
28 9月, 2017 2 次提交
- L
  
  Add SoftmaxGradFunctor, and use SoftmaxGradFunctor in softmax_op instead. · 05ed8ee8
  由 Liu Yiqun 提交于 9月 28, 2017
  
  05ed8ee8
- Y
  
  Add Skeleton of Double support · 3a5693e0
  由 Yu Yang 提交于 9月 27, 2017
  
  3a5693e0
26 9月, 2017 1 次提交
- C
  
  fix implementations of supporting soft labels. · 8b8ad6b1
  由 caoying03 提交于 9月 25, 2017
  
  8b8ad6b1
22 9月, 2017 1 次提交
- C
  
  support soft labels. · f1d5fb3b
  由 caoying03 提交于 9月 21, 2017
  
  f1d5fb3b
15 9月, 2017 1 次提交
- C
  
  finish implementation and fix unittest. · efa4526c
  由 caoying03 提交于 9月 13, 2017
  
  efa4526c
13 9月, 2017 1 次提交
- C
  
  softmax as functor. · c6366c81
  由 caoying03 提交于 9月 12, 2017
  
  c6366c81
12 9月, 2017 1 次提交
- C
  
  softmax as function. · c0cef849
  由 caoying03 提交于 9月 12, 2017
  
  c0cef849
07 9月, 2017 1 次提交
- C
  
  rename input and output of softmax_op. · 5b4526fa
  由 caoying03 提交于 9月 07, 2017
  
  5b4526fa
06 9月, 2017 1 次提交
- C
  
  refine softmax operator. · 7d16fe87
  由 caoying03 提交于 9月 06, 2017
  
  7d16fe87
08 8月, 2017 1 次提交
- D
  
  clang format · cf924728
  由 dongzhihong 提交于 8月 08, 2017
  
  cf924728
07 8月, 2017 2 次提交
- D
  
  "remove type alias header file" · bd369c35
  由 dongzhihong 提交于 8月 07, 2017
  
  bd369c35
- D
  
  "remove a lot alias" · 610801b5
  由 dongzhihong 提交于 8月 07, 2017
  
  610801b5
05 8月, 2017 1 次提交
- Y
  
  Reformat paddle/operators/* strictly following Google Style Guide · 9620df44
  由 Yi Wang 提交于 8月 04, 2017
  
  9620df44
04 8月, 2017 1 次提交
- Y
  
  Move constants from framework::OperatorBase to framework:: · ddb29b6c
  由 Yi Wang 提交于 8月 03, 2017
  
  ddb29b6c
03 8月, 2017 1 次提交

Softmax grad op (#3164) · d953611e

由 Qiao Longfei 提交于 8月 03, 2017

* init softmax grad op

* add compute code

* export Backward to python

* update test ,export op.type to python

* update python test, fix compute bug

* update unit test

* use eigen

* optimize eigen code

* add gpu test

* register softmax_grad GPU kernel and fix test bug

* typo

* follow comments

d953611e

02 8月, 2017 1 次提交
- Y
  
  Return Reference Instead Pointer to GetEigenDevice · 02655a22
  由 Yu Yang 提交于 8月 02, 2017
  
  02655a22
01 8月, 2017 1 次提交

use operator context and infer context (#3024) · 61ebacbc

由 Qiao Longfei 提交于 8月 01, 2017

* use operator context

* optimize code

* update net infershape

* update InferShape

* disable override InferShape(scope) in OperatorBase

* change InferShapeImpl to InferShape

* add template to OperatorContext Input/Output

* merge Input InputVar, Output OutputVar

* change Inputs to MultiInput

* fix conflict

* fix MultiInput bugs and add unit test

* rename KernelContext to ExecutionContext

* clean code

* change InferShape to protected

* fix template bug

* refine code

* use InputVar instead of Input<Variable>

* typo

* optimize code

61ebacbc

25 7月, 2017 1 次提交
- Y
  Add type_alias to import framework into ops · efc119b4
  由 Yu Yang 提交于 7月 25, 2017
```
Make implement an operator less noisy.
```
  efc119b4
19 7月, 2017 2 次提交
- Q
  
  replace Tensor::tensor to EigenTensor::From · 736d078c
  由 qijun 提交于 7月 19, 2017
  
  736d078c
- Q
  
  fix gpu build error · 14cfb8c2
  由 qijun 提交于 7月 19, 2017
  
  14cfb8c2
18 7月, 2017 1 次提交
- Q
  
  implement some basic OpKernel · b6c07552
  由 qijun 提交于 7月 18, 2017
  
  b6c07552
17 7月, 2017 2 次提交
- Y
  
  Merge develop · 73a9f0f2
  由 Yu Yang 提交于 7月 17, 2017
  
  73a9f0f2
- Y
  Add skeletons of `mul`, `rowwise_add`, `sigmoid`, `softmax` ops · 1ed237c1
  由 Yu Yang 提交于 7月 17, 2017
```
* Implement InferShape and register them, give a stub Kernel method
  by LOG(INFO)
```
  1ed237c1
11 7月, 2017 2 次提交
- D
  
  "support net_proto header" · 18e65b0c
  由 dongzhihong 提交于 7月 11, 2017
  
  18e65b0c
- D
  
  "move opContext to DeviceContext" · bc021d77
  由 dongzhihong 提交于 7月 11, 2017
  
  bc021d77
06 7月, 2017 2 次提交
- L
  
  FIX: explicit construct pool element · a669bf48
  由 liaogang 提交于 7月 06, 2017
  
  a669bf48
- L
  
  ENH: add memory unit test · 74691789
  由 liaogang 提交于 7月 06, 2017
  
  74691789
05 7月, 2017 1 次提交
- L
  
  FIX: Buddy Allocator Free with Merge feature · ada1c20b
  由 liaogang 提交于 7月 05, 2017
  
  ada1c20b
04 7月, 2017 1 次提交
- L
  
  ENH: Add paddle_memory for external usage · 4dc3c9e0
  由 liaogang 提交于 7月 04, 2017
  
  4dc3c9e0

PaddlePaddle / Paddle 11 个月 前同步成功

PaddlePaddle / Paddle
11 个月前同步成功