提交 · dc8cf36e4b88bca5571f351852804ff57b8e2731 · 机器未来 / Paddle

29 3月, 2019 3 次提交
- D
  add more example on datagenerator · dc8cf36e
  由 dongdaxiang 提交于 3月 23, 2019
```
test=develop
```
  dc8cf36e
- D
  
  refine print fetch list · 6bf796df
  由 dongdaxiang 提交于 3月 21, 2019
  
  6bf796df
- D
  
  add printer for fetch variable · cf136064
  由 dongdaxiang 提交于 2月 18, 2019
  
  cf136064
28 3月, 2019 4 次提交

[MKL-DNN] Tensor modifications revert (#16462) · 26323274

由 Jacek Czaja 提交于 3月 28, 2019

* Revert "[MKL-DNN] Fix to crash of Transformer when mkldnn is to be used (#16233)"

This reverts commit 13816dd4.
Apart from enabling transformer for MKL-DNN

* Revert "- MKL-DNN pooling updated to set_prim_desc"

This reverts commit c63f6b20.

Conflicts:
	paddle/fluid/operators/mkldnn/concat_mkldnn_op.cc

* Revert "[MKL-DNN] MKL-DNN specific Tensor modification (#15429)"

test=develop

This reverts commit dec9cf53.

* - concat compilation fix

- lint

test=develop

- Lint fixes

test=develop

- Lint fixes

test=develop

- Fix Transpose MKLDNN op

test=develop

26323274

S
fix travis ci · 5656fa9f
由 sneaxiy 提交于 3月 28, 2019
```
test=develop
```
5656fa9f
Z
Revert "Fix allocator bug" · 174d0d0b
由 Zeng Jinle 提交于 3月 28, 2019
```
add include headers to fix travis-ci
test=develop
```
174d0d0b
G

Add DGC(Deep Gradient Compression) interface. (#15841) · eb83abea
由 gongweibao 提交于 3月 28, 2019

eb83abea

25 3月, 2019 1 次提交
- N
  fix ci bug: cudnn handler in multi card · a1d11bb1
  由 nhzlx 提交于 3月 25, 2019
```
test=develop
```
  a1d11bb1
21 3月, 2019 2 次提交
- S
  add more unittest · 953214ad
  由 sneaxiy 提交于 3月 19, 2019
```
modify allocator strategy
remove changes of legacy buddy_allocator
test=develop
```
  953214ad
- W
  
  fix win gpu build test=develop (#16334) · b7baeed7
  由 Wu Yi 提交于 3月 21, 2019
  
  b7baeed7
20 3月, 2019 2 次提交

N

git cherry-pick from feature/anakin-engine: update anakin subgraph #16278 · 07dcf285
由 nhzlx 提交于 3月 20, 2019

07dcf285

Collective ops (#15572) · 6382b62f

由 Wu Yi 提交于 3月 20, 2019

* wip allreduce in op

* wip

* wip

* wip

* wip adding test

* wip for conflict with mp mode

* fix tests test=develop

* fix cpu build test=develop

* fix travis clang format test=develop

* fix cpu build test=develop

* update api.spec test=develop

* delete comment test=develop

* fix cpplint test=develop

* fix test=develop

* follow comment test=develop

* add file test=develop

* fix build test=develop

* update test=develop

* to be compatible with sync_bn, and fix mp mode in develop test=develop

6382b62f

19 3月, 2019 1 次提交
- Z
  add allocator flags · 22715487
  由 zhhsplendid 提交于 3月 19, 2019
```
test=develop
```
  22715487
16 3月, 2019 1 次提交
- Q
  Fix windows compiling (#16230) · 86e912c5
  由 qingqing01 提交于 3月 16, 2019
```
test=develop
```
  86e912c5
15 3月, 2019 1 次提交

Support sync batch norm. (#16121) · 8ad672a2

由 qingqing01 提交于 3月 15, 2019

* Support Sync Batch Norm.
* Note, do not enable it in one device.

Usage:

build_strategy = fluid.BuildStrategy()
build_strategy.sync_batch_norm = True
binary = fluid.compiler.CompiledProgram(tp).with_data_parallel(
        loss_name=loss_mean.name,
        build_strategy=build_strategy)

8ad672a2

13 3月, 2019 1 次提交
- C
  Add memory profiler (#16137) · 09799566
  由 chengduo 提交于 3月 12, 2019
```
test=develop
```
  09799566
11 3月, 2019 1 次提交

Revert "Revert "Add Event for TensorCopy"" (#16035) · ad80bde8

由 chengduo 提交于 3月 11, 2019

* Revert "Revert "Add Event for TensorCopy" (#16022)"

This reverts commit e2da3a5b.

* use default stream
test=develop

ad80bde8

06 3月, 2019 1 次提交
- S
  add allocator chain to fix bug · 2a639d5c
  由 sneaxiy 提交于 3月 06, 2019
```
test=develop
```
  2a639d5c
04 3月, 2019 5 次提交
- C
  Revert "Add Event for TensorCopy" (#16022) · 92438f61
  由 chengduo 提交于 3月 03, 2019
```
* Revert "Add Event for TensorCopy (#15953)"

This reverts commit 7235fd66.
test=develop

* fix CI
test=develop
```
  92438f61
- C
  Add Event for TensorCopy (#15953) · 06f3c857
  由 chengduo 提交于 3月 01, 2019
```
Add Event for TensorCopy 
```
  06f3c857
- D
  polish cudnn related code and fix bug. (#15164) · 4449e855
  由 dzhwinter 提交于 2月 27, 2019
```
* staged.

* polish code

* polish code. test=develop

* polish code. test=develop

* api change. test=develop

* fix default value. test=develop

* fix default value. test=develop
```
  4449e855
- Y
  Optimize gelu operation with mkl erf. · b48d56e8
  由 Yihua Xu 提交于 2月 26, 2019
```
test=develop
```
  b48d56e8
- C
  Revert "Add Event for TensorCopy" (#16022) · e2da3a5b
  由 chengduo 提交于 3月 03, 2019
```
* Revert "Add Event for TensorCopy (#15953)"

This reverts commit 7235fd66.
test=develop

* fix CI
test=develop
```
  e2da3a5b
01 3月, 2019 1 次提交
- C
  Add Event for TensorCopy (#15953) · 7235fd66
  由 chengduo 提交于 3月 01, 2019
```
Add Event for TensorCopy 
```
  7235fd66
27 2月, 2019 2 次提交

由 dzhwinter 提交于 2月 27, 2019

* staged.

* polish code

* polish code. test=develop

* polish code. test=develop

* api change. test=develop

* fix default value. test=develop

* fix default value. test=develop

225c11a9

INT8 Pool kernel Key Creation Optimization. (#15883) · 6724be2b

由 xiaolil1 提交于 2月 27, 2019

* Optimize key creation of INT8 pool kernel to improve the peformance of ResNet-50 and MobileNet, especially for latency.
test=develop

* Optimize key creation of pool fp32 grad.
test=develop

6724be2b

26 2月, 2019 1 次提交
- Y
  Optimize gelu operation with mkl erf. · 73967886
  由 Yihua Xu 提交于 2月 26, 2019
```
test=develop
```
  73967886
25 2月, 2019 7 次提交

P

test=develop · c6472579
由 peizhilin 提交于 2月 25, 2019

c6472579
P
fix build issue for cudaEvent_t · b5d6e38b
由 peizhilin 提交于 2月 25, 2019
```
test=develop
```
b5d6e38b
C
Remove unnecessary dependence for profiler (#15899) · 8e904d32
由 chengduo 提交于 2月 25, 2019
```
* refile profiler
test=develop

* follow comment
test=develop
```
8e904d32
Z

update with develop. test=develop · 9261cf39
由 Zhen Wang 提交于 2月 25, 2019

9261cf39
Z

add set_attr for IrOpNode. test=develop · 0bf809c9
由 Zhen Wang 提交于 2月 25, 2019

0bf809c9

[MKL-DNN] MKL-DNN specific Tensor modification (#15429) · dec9cf53

由 Jacek Czaja 提交于 2月 25, 2019

* - Implemented draft of primitive desc keeping in Tensor

test=develop

- TransposeMKLDNNHandler::AcquireSrcMemory was reimplemented

- Added nchw and nc formats setting for sake of compatiblity

Fixed unit tests

- Worakaround to problem with 5D data in conv

- Added 3D and 1D MKL-DNN formats for name handles for tensor

test=develop

- Fix to UTs

test=develop

- Conv fp32 op was updated

Cosmetic fixes

test=develop

- tensor mkldnn cosmetics

test=develop

- Moved most of mkl-dnn specific code from Tensor to mkl-dnn utils

* - Lint fixes

test=develop

* - setting prim dec in Tensor , sets also layout to kMKLDNN

test=develop

* - Moved creation of prim desc totally out of Tensor

test=develop

* - Cosmetic fixes adter review

test=develop

dec9cf53

P
fix build issue on windows for sample prop op · 6ccdb1b9
由 peizhilin 提交于 2月 25, 2019
```
test=develop
```
6ccdb1b9

24 2月, 2019 1 次提交
- D
  
  add memset CUPTI && test=develop (#15868) · c6bd434f
  由 Dun 提交于 2月 24, 2019
  
  c6bd434f
22 2月, 2019 4 次提交

S
Change *(smart_ptr.get()) -> *smart_ptr · 74672d1a
由 Sylwester Fraczek 提交于 2月 07, 2019
```
reason: dereferencing smart pointer is the same as the underlying pointer
test=develop
```
74672d1a
T
Revert 15770 develop a6910f90 gelu mkl opt (#15872) · ee2321de
由 tensor-tang 提交于 2月 22, 2019
```
* Revert "Optimze Gelu with MKL Erf function (#15770)"

This reverts commit 676995c8.

* test=develop
```
ee2321de
C
enhance profiler (#15842) · 3b08c9ab
由 chengduo 提交于 2月 22, 2019
```
test=develop
```
3b08c9ab

Optimze Gelu with MKL Erf function (#15770) · 676995c8

由 Yihua Xu 提交于 2月 22, 2019

* Optimize for gelu operator

* Set up the low accuracy mode of MKL ERF function.

test=develop

* Only enable MKLML ERF when OS is linux

* Use the speical mklml version included vmsErf function to verify gelu mkl kernel.

test=develop

* Add the CUDA macro to avoid NVCC's compile issue.

test=develop

* Add the TODO comments for mklml library modification.

test=develop

* Clean Code

test=develop

* Add the comment of marco for NVCC compiler.

test=develop

676995c8

21 2月, 2019 1 次提交
- T
  disable dam temporarily (#15860) · e3dd6970
  由 Tao Luo 提交于 2月 21, 2019
```
test=develop
```
  e3dd6970

机器未来 / Paddle 与 Fork 源项目一致

机器未来 / Paddle
与 Fork 源项目一致