提交 · d303270a0e3a640da1abc75936179c75250ba3e9 · PaddlePaddle / Paddle

26 1月, 2019 5 次提交
- G
  
  revert test=develop (#15535) · d303270a
  由 gongweibao 提交于 1月 26, 2019
  
  d303270a
- T
  Merge pull request #15532 from hshen14/calibration_api_refine · 0548aac2
  由 Tao Luo 提交于 1月 26, 2019
```
Refine INT8 calibration API
```
  0548aac2
- T
  Merge pull request #15538 from baojun-nervana/mv_ng_bridge_file · 8e2dea57
  由 Tao Luo 提交于 1月 26, 2019
```
move ngraph_bridge to ngraph directory 
```
  8e2dea57
- Y
  
  add dynamic memory optim (#15457) · e2818c86
  由 Yan Chunwei 提交于 1月 26, 2019
  
  e2818c86
- B
  
  mv ngraph_bridge to ngraph directory test=develop · 8e9308a5
  由 baojun-nervana 提交于 1月 25, 2019
  
  8e9308a5
25 1月, 2019 12 次提交
- R
  Merge pull request #15027 from shippingwang/shufflechannel · 88bd7e1a
  由 ruri 提交于 1月 25, 2019
```
Add Shuffle Channel Operator
```
  88bd7e1a
- H
  
  Refine INT8 calibration API; shorten the iteration number to reduce test time; test=develop · 2a82c565
  由 Haihao Shen 提交于 1月 25, 2019
  
  2a82c565
- T
  Merge pull request #15515 from tensor-tang/jit/benchmark · e043ea96
  由 tensor-tang 提交于 1月 25, 2019
```
jit benchmark use tensor with alignment
```
  e043ea96
- 乔
  Merge pull request #14731 from jacquesqiao/optimize-cpp-reader · c5855506
  由乔龙飞 Qiao Longfei 提交于 1月 25, 2019
```
Optimize cpp reader
```
  c5855506
- G
  
  cleanup test=develop (#15347) · d54494ba
  由 gongweibao 提交于 1月 25, 2019
  
  d54494ba
- G
  
  Add GetVariableNoBarrier on brpc. (#15488) · fe8f28c9
  由 gongweibao 提交于 1月 25, 2019
  
  fe8f28c9
- T
  fix bug in merge_ids (#15503) · 981fc2bd
  由 tangwei12 提交于 1月 25, 2019
```
* fix mistakes in merge_ids, test=develop
```
  981fc2bd
- Z
  Merge pull request #15504 from NHZlX/fix_conv2d_fusion · a7ba07d7
  由 Zhaolong Xing 提交于 1月 25, 2019
```
Add check: conv_fusion op runs with cudnn version > 7100 .
```
  a7ba07d7
- B
  Adding ngraph_engine_op (#14948) · efce2567
  由 baojun 提交于 1月 24, 2019
```
* enable ngraph_engine_op
test=develop

* merge develop test=develop

* avoid const_cast test=develop

* rm ngraph_operator test=develop

* Added TODO to move EnableNgraph test=develop

* Add TODO to remove const_cast test=develop
```
  efce2567
- C
  add limit_of_tmp_allocation for CI (#15513) · 7166b52a
  由 chengduo 提交于 1月 24, 2019
```
test=develop
```
  7166b52a
- C
  Revert conv transpose cudnn (#15514) · f8f91fb4
  由 chengduo 提交于 1月 24, 2019
```
* Revert "set constant for loss"

This reverts commit 167933f6.

* Revert "remove workspace_handle"
test=develop
This reverts commit b4aca8ed.
```
  f8f91fb4
- T
  jit benchmark use tensor · b67584a6
  由 tensor-tang 提交于 1月 24, 2019
```
test=develop
```
  b67584a6
24 1月, 2019 15 次提交
- Y
  Add the CUDA kernel for beam_search op (#15020) · 3008fa12
  由 Yiqun Liu 提交于 1月 24, 2019
```
* Refine the beam_search op and test.

* A basic CUDA implementation of beam_search for small batch_size.

* Implement CUDA kernel for beam_search_op.

* Use multiple CUDA threads in the same block to select the top beam.

* Update the python api of beam_search op.

* Enable extend function in CPU kernel of beam_search op.

* Unify the CUDA codes.
test=develop

* Unify the CPU kernel of beam_search op.

* Ensure the seletced items of beam_search_op's CPU kernel sorted by scores.

* Update the description of beam_search in API.spec.

* Enable the use of CUDA kernel in beam_search op.

* Exclude the beam_search's CUDA unittest when there is no CUDA gpu, and delete some debuging statements.
test=develop

* Follow comments.
test=develop

* Call the CPU kernel for beam_search op when batch_size > 4.
test=develop

* Remove the except of is_empty op in PrepareData.
test=develop
```
  3008fa12
- Z
  Merge pull request #15501 from sneaxiy/disable_eager_deletion_mnist · ed1726ea
  由 Zeng Jinle 提交于 1月 24, 2019
```
Disable eager deletion unittest temporarily since random failure.
```
  ed1726ea
- Z
  Merge pull request #15496 from sneaxiy/lazy_allocator2 · 2480a3df
  由 Zeng Jinle 提交于 1月 24, 2019
```
Fix bug when user set CUDA_VISIBLE_DEVICES be empty and run CPU-only models
```
  2480a3df
- W
  
  fix tangwei merge issue test=develop (#15506) · 22db82c0
  由 Wu Yi 提交于 1月 24, 2019
  
  22db82c0
- Z
  Merge pull request #15460 from sneaxiy/try_to_turn_on_remove_unnecessary_lock · dec89bd7
  由 Zeng Jinle 提交于 1月 24, 2019
```
Turn on remove_unnecessary_lock by default
```
  dec89bd7
- C
  Clean elementwise_op_function (#15502) · bf91d11e
  由 chengduo 提交于 1月 24, 2019
```
test=develop
```
  bf91d11e
- T
  nce add check sample lables, test=develop (#15463) · 5cfc40de
  由 tangwei12 提交于 1月 24, 2019
```
* nce add check sample lables, test=develop
```
  5cfc40de
- S
  
  test=develop · 9c360cc7
  由 sneaxiy 提交于 1月 24, 2019
  
  9c360cc7
- N
  fix comments · 96413249
  由 nhzlx 提交于 1月 24, 2019
```
test=develop
```
  96413249
- N
  When cudnn version < 7100, there is problem with conv_fusion. · 484b3bc8
  由 nhzlx 提交于 1月 24, 2019
```
Add check for it.
test=develop
```
  484b3bc8
- T
  Merge pull request #15486 from tensor-tang/fix/pass/debug · af07118d
  由 tensor-tang 提交于 1月 24, 2019
```
fix debug compile issue of analysis pass
```
  af07118d
- L
  Gpu memory monitoring (#15436) · 5d026a88
  由 liuwei1031 提交于 1月 24, 2019
```
* fix github issue 15267 test=develop

* fix github issue 15267 test=develop

* monitor the GPU usage during runtime

* revert allocator_facade.cc change

* comments update test=develop
```
  5d026a88
- X
  Merge pull request #15322 from velconia/imperative_resnet · 58cb18d9
  由 Xin Pan 提交于 1月 24, 2019
```
Imperative Resnet
```
  58cb18d9
- S
  disable eager deletion unittest · eed4a638
  由 sneaxiy 提交于 1月 24, 2019
```
test=develop
```
  eed4a638
- S
  lazy_allocator · 51227bd4
  由 sneaxiy 提交于 1月 23, 2019
```
test=develop
```
  51227bd4
23 1月, 2019 8 次提交

M
Polish code · c8965dc1
由 minqiyang 提交于 1月 23, 2019
```
test=develop
```
c8965dc1
T
fix debug compile of analysis pass fail · 5c68dee7
由 tensor-tang 提交于 1月 23, 2019
```
test=develop
```
5c68dee7
乔
Merge pull request #15080 from jacquesqiao/optimize-assign · d243e555
由乔龙飞 Qiao Longfei 提交于 1月 23, 2019
```
Optimize assign
```
d243e555

[V1.3] Add the calibration tool code for int8 inference and focus test. (#15062) · dbdaf15c

由 guomingz 提交于 1月 23, 2019

* Add the calibration tool code for int8 inference and focus test.

* Fix the calibration tool per the review comments.

test=develop

* Update the calibrator doc and remove extra line.

* Fix the invalid is_negative_input attr set on Mobilenet.

* Add the comments and fix the format issue.

test=develop

* Update the CMakelist.txt for Calibration PR.Disable the Calibration UT if not enable MKLDNN.

test=develop

* Update the CMakeList.txt.

test=develop

* Disable the test_calibration case on WIN and MAC.

test=develop

* Add the missing brackets.

test=develop

* Remove the outdated map operator which not supported on Python3.

test=develop

* Fix the style issue.

test=develop

* 1.Update the CMakeList.txt to disable calibration tool ut when the WITH_MKL is not set;
2.Add the workaround to enable the FLAGS_use_mkldnn for PR_CI(PADDLE).

test=develop

* Fix the typo and format the License header.

test=develop

* 1.Add and Update TODOs per review comments.
2.Code clean.

test=develop

dbdaf15c

Z
Merge pull request #15461 from NHZlX/fix_trt_stream_bug · b7b68f2a
由 Zhaolong Xing 提交于 1月 23, 2019
```
fix trt stream bug.
```
b7b68f2a
T
checkpoint at distributed training (#14854) · 8b50ad80
由 tangwei12 提交于 1月 23, 2019
```
checkpoint for distributed training.
```
8b50ad80
Q

update comment test=develop · 119a3d4d
由 Qiao Longfei 提交于 1月 23, 2019

119a3d4d
N
change the input to a smaller value · e6218c1d
由 nhzlx 提交于 1月 23, 2019
```
test=develop
```
e6218c1d

PaddlePaddle / Paddle 1 年多 前同步成功

PaddlePaddle / Paddle
1 年多前同步成功