提交 · 3c375751f8a8983257ea7f7e6086ab3a5fb555e0 · Crayon鑫 / Paddle

20 4月, 2019 1 次提交

Support seq len equal to 0 in sequence ops (#16935) · 3c375751

由 Yibing Liu 提交于 4月 20, 2019

* Support seq len equal to 0 in sequence ops

test=develop

* Add more test cases

* Fix some comments

test=develop

* Fix py3 error

test=develop

3c375751

08 3月, 2019 1 次提交
- T
  simplify the jitkernel templates and tests · 14a764c9
  由 tensor-tang 提交于 3月 08, 2019
```
test=develop
```
  14a764c9
07 3月, 2019 1 次提交
- T
  unify the kernelfuncs cache and add unit test · 802f362a
  由 tensor-tang 提交于 3月 07, 2019
```
test=develop
```
  802f362a
20 12月, 2018 1 次提交
- T
  fix enum style · 1aaec571
  由 tensor-tang 提交于 12月 20, 2018
```
test=develop
```
  1aaec571
18 12月, 2018 1 次提交
- T
  
  fix build · 6648995f
  由 tensor-tang 提交于 12月 17, 2018
  
  6648995f
17 12月, 2018 1 次提交
- T
  
  enable crf decoding and layer norm refer code · 720b55cb
  由 tensor-tang 提交于 12月 17, 2018
  
  720b55cb
26 10月, 2018 1 次提交
- T
  
  add crf decode jit kernel · 21487d78
  由 tensor-tang 提交于 10月 23, 2018
  
  21487d78
20 8月, 2018 1 次提交

Optimize CRF Decoding with AVX/AVX2/AVX512F instruction (#12767) · 084d4a9e

由 Yihua Xu 提交于 8月 20, 2018

* Optimize CRF decoding with AVX/AVX2 instruction

* Enable the AVX2 flags for compiling

* Clean the code and decrease the count of multiply calculation

* Add the support of AVX512 instruction to optimize CRF Decoding

* Clean the code

* Enable the AVX512f flags for compiling

* Clean the code for the invaluable switch

* Fixed the issue to check AVX512F status

* Clean the code

* Add some explanation of the key points

084d4a9e

11 4月, 2018 1 次提交
- S
  
  Fix cpplint errors (#9800) · cea39121
  由 Siddharth Goyal 提交于 4月 10, 2018
  
  cea39121
12 2月, 2018 1 次提交
- Q
  
  Fix the grammar in copyright. (#8403) · 24509f4a
  由 qingqing01 提交于 2月 12, 2018
  
  24509f4a
10 2月, 2018 2 次提交
- Y
  
  Correct #include path · fc374821
  由 Yi Wang 提交于 2月 09, 2018
  
  fc374821
- Y
  
  Move file to fluid/; Edit CMakeLists.txt · 90648f33
  由 Yi Wang 提交于 2月 09, 2018
  
  90648f33
08 1月, 2018 1 次提交

cpu gpu transform function (#7191) · 0f353ab4

由 Qiao Longfei 提交于 1月 08, 2018

* add rename guard

* add device_data_transform

* add device_data_transform_test

* modify GetExpectedKernelType

* update operator.run

* support test test_label_semantic_roles

* optimize code

* optimize code

* rename GetActualKernelType to GetExpectedKernelType

* fix chunk_eval_op and device_data_transform_test

* add is_same_place to place

* optimize code, refine rename_guard

* refine rename guard, add GetKernelTypeForVar

* optimize code

* add some log

* rename guard

* use sub scope to create var

* fix compile

* add IsInitialized for Tensor

* add VarIsTensor

* fix op_registry_test

* test

* tmp disable priority

* restore switch_kernel.md

* code clean

0f353ab4

12 12月, 2017 1 次提交

Refine device context (#6433) · 61ec0b95

由 QI JUN 提交于 12月 12, 2017

There are mainly following fixes:

- take `DeviceContext` as the template parameter of math functors and OpKernel instead of `Place`
- remove `eigen_device` interface in base class  `DeviceContext`
- remove `GetEigenDevice` interface in `ExecutionContext` and base class `DeviceContext`
- remove unused `platform::EigenDeviceConverter`
- rename `REGISTER_OP_GPU_KERNEL` to `REGISTER_OP_CUDA_KERNEL`
- rename `USE_GPU_ONLY_OP` to `USE_CUDA_ONLY_OP`

61ec0b95

05 12月, 2017 1 次提交
- Q
  add crf_decoding layer (#6274) · 45c8a88a
  由 Qiao Longfei 提交于 12月 05, 2017
```
* add crf_decoding layer

* fix some typo

* fix test_crf_decoding_op
```
  45c8a88a
04 11月, 2017 1 次提交
- C
  Add the crf_decoding operator. (#5352) · 45eabb8c
  由 Cao Ying 提交于 11月 03, 2017
```
* proj init.

* add unittest and implementation.
```
  45eabb8c

Crayon鑫 / Paddle 与 Fork 源项目一致

Crayon鑫 / Paddle
与 Fork 源项目一致