提交 · 2ffacdebc2cf0917807094c79580aacf95f16869 · 机器未来 / Paddle

26 2月, 2019 1 次提交
- B
  
  Update ngraph version to v0.14 test=develop · 2ffacdeb
  由 baojun-nervana 提交于 2月 25, 2019
  
  2ffacdeb
25 2月, 2019 2 次提交

[MKL-DNN] MKL-DNN specific Tensor modification (#15429) · dec9cf53

由 Jacek Czaja 提交于 2月 25, 2019

* - Implemented draft of primitive desc keeping in Tensor

test=develop

- TransposeMKLDNNHandler::AcquireSrcMemory was reimplemented

- Added nchw and nc formats setting for sake of compatiblity

Fixed unit tests

- Worakaround to problem with 5D data in conv

- Added 3D and 1D MKL-DNN formats for name handles for tensor

test=develop

- Fix to UTs

test=develop

- Conv fp32 op was updated

Cosmetic fixes

test=develop

- tensor mkldnn cosmetics

test=develop

- Moved most of mkl-dnn specific code from Tensor to mkl-dnn utils

* - Lint fixes

test=develop

* - setting prim dec in Tensor , sets also layout to kMKLDNN

test=develop

* - Moved creation of prim desc totally out of Tensor

test=develop

* - Cosmetic fixes adter review

test=develop

dec9cf53

X
polish · 5dd281f7
由 Xin Pan 提交于 2月 25, 2019
```
test=develop
```
5dd281f7

24 2月, 2019 2 次提交
- D
  
  use kernel size in global_pooling. test=develop · 373cfb0c
  由 dengkaipeng 提交于 2月 24, 2019
  
  373cfb0c
- D
  
  fix spell mistakes. test=develop · 60305196
  由 dengkaipeng 提交于 2月 24, 2019
  
  60305196
22 2月, 2019 11 次提交
- D
  
  fix spell error. test=develop · 14df92fe
  由 dengkaipeng 提交于 2月 22, 2019
  
  14df92fe
- D
  
  fix adaptive_pool and yolov3_loss. test=develop · 144016fc
  由 dengkaipeng 提交于 2月 22, 2019
  
  144016fc
- S
  Change *(smart_ptr.get()) -> *smart_ptr · 74672d1a
  由 Sylwester Fraczek 提交于 2月 07, 2019
```
reason: dereferencing smart pointer is the same as the underlying pointer
test=develop
```
  74672d1a
- T
  Revert 15770 develop a6910f90 gelu mkl opt (#15872) · ee2321de
  由 tensor-tang 提交于 2月 22, 2019
```
* Revert "Optimze Gelu with MKL Erf function (#15770)"

This reverts commit 676995c8.

* test=develop
```
  ee2321de
- D
  
  \frac -> \frac. test=develop · eb65b4e4
  由 dengkaipeng 提交于 2月 22, 2019
  
  eb65b4e4
- D
  
  add blank after math::. test=develop · 8167588f
  由 dengkaipeng 提交于 2月 22, 2019
  
  8167588f
- D
  
  use math:: instead of 29. test=develop · d9ec6058
  由 dengkaipeng 提交于 2月 22, 2019
  
  d9ec6058
- D
  
  fix adaptive pool doc.test=develop · 19292ac6
  由 dengkaipeng 提交于 2月 22, 2019
  
  19292ac6
- Y
  Initialize the benchmark tester for operator. (#15772) · 7d96c74a
  由 Yiqun Liu 提交于 2月 22, 2019
```
* Initialize the benchmark tester for operator.
test=develop

* Rearrange the codes.
test=develop
```
  7d96c74a
- Y
  Optimze Gelu with MKL Erf function (#15770) · 676995c8
  由 Yihua Xu 提交于 2月 22, 2019
```
* Optimize for gelu operator

* Set up the low accuracy mode of MKL ERF function.

test=develop

* Only enable MKLML ERF when OS is linux

* Use the speical mklml version included vmsErf function to verify gelu mkl kernel.

test=develop

* Add the CUDA macro to avoid NVCC's compile issue.

test=develop

* Add the TODO comments for mklml library modification.

test=develop

* Clean Code

test=develop

* Add the comment of marco for NVCC compiler.

test=develop
```
  676995c8
- M
  Auto-cmake generator, auto-fill map (#15402) · 5d132ecf
  由 mozga-intel 提交于 2月 22, 2019
```
test=develop
```
  5d132ecf
21 2月, 2019 3 次提交

K
Add new ut and remove unnecessary code · 1578c60b
由 Krzysztof Binias 提交于 2月 21, 2019
```
test=develop
```
1578c60b
X
add per kernel config and remove const_cast. · 5eb87506
由 Xin Pan 提交于 2月 21, 2019
```
test=develop
```
5eb87506

Profiler refine and add CUDA runtime api tracer (#15301) · a83e4704

由 Dun 提交于 2月 21, 2019

* refine profiler && add runtime tracer

* test=develop

* test=develop

* test=develop

* test=develop

* test=develop

* test=develop

* test=develop

* test=develop

* fix bug && test=develop

* add thread id map && test=develop

* test=develop

* testing

* bug fix

* remove cuda event && refine code && test=develop

* test=develop

* test=develop

* test=develop

* fix windows temp file && test=develop

* test=develop

* fix windows bug && test=develop

* fix start up issue && test=develop

* code polish &&  test=develop

* remove unused code && test=develop

* add some cupti cbid && test=develop

* add FLAGS_multiple_of_cupti_buffer_size && test=develop

* fix compile error && test=develop

* add keyword && test=develop

* fix && test=develop

* code polish && test=develop

a83e4704

20 2月, 2019 2 次提交
- M
  Enable momentum operator for a ngraph engine (#15673) · 13ec2d33
  由 mozga-intel 提交于 2月 20, 2019
```
* Enable momentum operator for a ngraph engine
test=develop

* Update tests
test=develop

* Unnecessary line of the code as intended was removed
test=develop
```
  13ec2d33
- X
  remove non-ascii charactor · eb7bc3e7
  由 xuezhong 提交于 2月 20, 2019
```
test=develop
```
  eb7bc3e7
19 2月, 2019 6 次提交

T
fix warnings (#15790) · e1c707fe
由 tensor-tang 提交于 2月 19, 2019
```
* fix warnings

test=develop

* fix enforce test

test=develop
```
e1c707fe
X
update comment · f2262d73
由 xuezhong 提交于 2月 19, 2019
```
test=develop
```
f2262d73
X

refine code · c5360a3f
由 xuezhong 提交于 2月 19, 2019

c5360a3f

Enable cross_entropy operator for a ngraph engine (#15674) · df23a6f8

由 mozga-intel 提交于 2月 19, 2019

* Enable cross_entropy operator for a ngraph engine
test=develop

* Update tests
test=develop

* Added PADDLE_ENFORCE for the batch_norm operator
test=develop

* Update the message about which format are supported right now
test=develop

df23a6f8

Correct the doc in Python API (#15725) · 56a5039e

由 Yiqun Liu 提交于 2月 19, 2019

* Correct the comment in control_flow.py.

* Correct the argument list of ops.
test=develop

* Update API.spec.
test=develop

* Skip op_callstack attr for all op apis.
test=develop

* Remove use_mkldnn and is_test from python api.
test=develop

* Remove use_mkldnn and is_test from op_proto_maker and hard-coding them in python when generating doc string.
test=develop

56a5039e

B

Add ngraph op coverage (#15721) · 72061b0a
由 baojun 提交于 2月 18, 2019

72061b0a

18 2月, 2019 5 次提交
- Y
  Add JIT CRF_decoding and Layer_norm unit-test (#15699) · 685a20ef
  由 Yihua Xu 提交于 2月 18, 2019
```
* Add the CRFDecoding and LayerNorm's test case

test=develop

* Fix the size checking issue

test=develop

* Remove the remnant code

test=develop

* Add TestAllImpls and double support

test=develop

* Clean Code

test=develop

* Add benchmark test for LayerNorm & CRFDecoding

test=develop
```
  685a20ef
- T
  fix when table width larger than 64 · 75fc792d
  由 tensor-tang 提交于 2月 18, 2019
```
test=develop
```
  75fc792d
- T
  add emb seqpool jitcode · 40402d5e
  由 tensor-tang 提交于 2月 15, 2019
```
test=develop
```
  40402d5e
- C
  fix shape api doc · 3ce12b1b
  由 chengduozh 提交于 2月 18, 2019
```
test=develop
```
  3ce12b1b
- D
  inplace group_norm (#15754) · 5e6834d8
  由 Dun 提交于 2月 18, 2019
```
* inplace group

* test=develop
```
  5e6834d8
15 2月, 2019 3 次提交
- D
  More restrict check load_combine_op. (#15479) · e4b9fcdb
  由 Dun 提交于 2月 15, 2019
```
* fix && test=develop

* fix && test=develop

* test=develop
```
  e4b9fcdb
- Q
  Fix debug mode in prior_box_op (#15702) · 48a5cccb
  由 qingqing01 提交于 2月 15, 2019
```
* Fix debug mode in prior_box_op
* Refine code
```
  48a5cccb
- D
  Fix row_conv doc · 28682325
  由 Dang Qingqing 提交于 2月 15, 2019
```
test=develop
```
  28682325
14 2月, 2019 5 次提交
- T
  add embseqpool jitkernel mkl impl and use it · a3a3d3d8
  由 tensor-tang 提交于 2月 14, 2019
```
test=develop
```
  a3a3d3d8
- T
  add embseqpool jitkernel refer code, test and benchmark · 15da2f9a
  由 tensor-tang 提交于 2月 13, 2019
```
test=develop
```
  15da2f9a
- Q
  Fix debug mode in fake_quantize_op (#15693) · abcefe72
  由 qingqing01 提交于 2月 14, 2019
```
* Fix debug mode in fake_quantize_op
* Remove template specialization
```
  abcefe72
- L
  
  fix lstmp bug; test=develop · 029be5fd
  由 liuhongyu 提交于 2月 14, 2019
  
  029be5fd
- L
  
  set lstm lstmp unsed pointer to nullptr; test=develop · 393fa602
  由 liuhongyu 提交于 2月 14, 2019
  
  393fa602

机器未来 / Paddle 与 Fork 源项目一致

机器未来 / Paddle
与 Fork 源项目一致