提交 · 02dab46ab8101873663a63614f88931ead7846d9 · BaiXuePrincess / Paddle

28 1月, 2019 2 次提交
- Q
  
  add some debug info · 02dab46a
  由 Qiao Longfei 提交于 1月 28, 2019
  
  02dab46a
- Q
  
  optimize test_async_ssa_graph_executor_mnist · 7e145b7c
  由 Qiao Longfei 提交于 1月 28, 2019
  
  7e145b7c
27 1月, 2019 3 次提交
- Q
  
  clean code of test_async_ssa_graph_executor_mnist · 9da96aba
  由 Qiao Longfei 提交于 1月 27, 2019
  
  9da96aba
- Q
  
  add some debug infor · be738a64
  由 Qiao Longfei 提交于 1月 27, 2019
  
  be738a64
- Q
  
  add GenParentScopeTreeDebugInfo · 62549e07
  由 Qiao Longfei 提交于 1月 27, 2019
  
  62549e07
26 1月, 2019 7 次提交
- Q
  Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add-async-ssa-graph-executor · a66115be
  由 Qiao Longfei 提交于 1月 26, 2019
```
test=develop
```
  a66115be
- Q
  
  code optimize · fab8457e
  由 Qiao Longfei 提交于 1月 26, 2019
  
  fab8457e
- G
  
  revert test=develop (#15535) · d303270a
  由 gongweibao 提交于 1月 26, 2019
  
  d303270a
- T
  Merge pull request #15532 from hshen14/calibration_api_refine · 0548aac2
  由 Tao Luo 提交于 1月 26, 2019
```
Refine INT8 calibration API
```
  0548aac2
- T
  Merge pull request #15538 from baojun-nervana/mv_ng_bridge_file · 8e2dea57
  由 Tao Luo 提交于 1月 26, 2019
```
move ngraph_bridge to ngraph directory 
```
  8e2dea57
- Y
  
  add dynamic memory optim (#15457) · e2818c86
  由 Yan Chunwei 提交于 1月 26, 2019
  
  e2818c86
- B
  
  mv ngraph_bridge to ngraph directory test=develop · 8e9308a5
  由 baojun-nervana 提交于 1月 25, 2019
  
  8e9308a5
25 1月, 2019 13 次提交
- Q
  Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add-async-ssa-graph-executor · ada43e89
  由 Qiao Longfei 提交于 1月 25, 2019
```
test=develop
```
  ada43e89
- R
  Merge pull request #15027 from shippingwang/shufflechannel · 88bd7e1a
  由 ruri 提交于 1月 25, 2019
```
Add Shuffle Channel Operator
```
  88bd7e1a
- H
  
  Refine INT8 calibration API; shorten the iteration number to reduce test time; test=develop · 2a82c565
  由 Haihao Shen 提交于 1月 25, 2019
  
  2a82c565
- T
  Merge pull request #15515 from tensor-tang/jit/benchmark · e043ea96
  由 tensor-tang 提交于 1月 25, 2019
```
jit benchmark use tensor with alignment
```
  e043ea96
- 乔
  Merge pull request #14731 from jacquesqiao/optimize-cpp-reader · c5855506
  由乔龙飞 Qiao Longfei 提交于 1月 25, 2019
```
Optimize cpp reader
```
  c5855506
- G
  
  cleanup test=develop (#15347) · d54494ba
  由 gongweibao 提交于 1月 25, 2019
  
  d54494ba
- G
  
  Add GetVariableNoBarrier on brpc. (#15488) · fe8f28c9
  由 gongweibao 提交于 1月 25, 2019
  
  fe8f28c9
- T
  fix bug in merge_ids (#15503) · 981fc2bd
  由 tangwei12 提交于 1月 25, 2019
```
* fix mistakes in merge_ids, test=develop
```
  981fc2bd
- Z
  Merge pull request #15504 from NHZlX/fix_conv2d_fusion · a7ba07d7
  由 Zhaolong Xing 提交于 1月 25, 2019
```
Add check: conv_fusion op runs with cudnn version > 7100 .
```
  a7ba07d7
- B
  Adding ngraph_engine_op (#14948) · efce2567
  由 baojun 提交于 1月 24, 2019
```
* enable ngraph_engine_op
test=develop

* merge develop test=develop

* avoid const_cast test=develop

* rm ngraph_operator test=develop

* Added TODO to move EnableNgraph test=develop

* Add TODO to remove const_cast test=develop
```
  efce2567
- C
  add limit_of_tmp_allocation for CI (#15513) · 7166b52a
  由 chengduo 提交于 1月 24, 2019
```
test=develop
```
  7166b52a
- C
  Revert conv transpose cudnn (#15514) · f8f91fb4
  由 chengduo 提交于 1月 24, 2019
```
* Revert "set constant for loss"

This reverts commit 167933f678ccbb3563e949710279efe004a27731.

* Revert "remove workspace_handle"
test=develop
This reverts commit b4aca8ede9e685bce1dfb1c59e63919f33432572.
```
  f8f91fb4
- T
  jit benchmark use tensor · b67584a6
  由 tensor-tang 提交于 1月 24, 2019
```
test=develop
```
  b67584a6
24 1月, 2019 15 次提交
- Y
  Add the CUDA kernel for beam_search op (#15020) · 3008fa12
  由 Yiqun Liu 提交于 1月 24, 2019
```
* Refine the beam_search op and test.

* A basic CUDA implementation of beam_search for small batch_size.

* Implement CUDA kernel for beam_search_op.

* Use multiple CUDA threads in the same block to select the top beam.

* Update the python api of beam_search op.

* Enable extend function in CPU kernel of beam_search op.

* Unify the CUDA codes.
test=develop

* Unify the CPU kernel of beam_search op.

* Ensure the seletced items of beam_search_op's CPU kernel sorted by scores.

* Update the description of beam_search in API.spec.

* Enable the use of CUDA kernel in beam_search op.

* Exclude the beam_search's CUDA unittest when there is no CUDA gpu, and delete some debuging statements.
test=develop

* Follow comments.
test=develop

* Call the CPU kernel for beam_search op when batch_size > 4.
test=develop

* Remove the except of is_empty op in PrepareData.
test=develop
```
  3008fa12
- Z
  Merge pull request #15501 from sneaxiy/disable_eager_deletion_mnist · ed1726ea
  由 Zeng Jinle 提交于 1月 24, 2019
```
Disable eager deletion unittest temporarily since random failure.
```
  ed1726ea
- Z
  Merge pull request #15496 from sneaxiy/lazy_allocator2 · 2480a3df
  由 Zeng Jinle 提交于 1月 24, 2019
```
Fix bug when user set CUDA_VISIBLE_DEVICES be empty and run CPU-only models
```
  2480a3df
- W
  
  fix tangwei merge issue test=develop (#15506) · 22db82c0
  由 Wu Yi 提交于 1月 24, 2019
  
  22db82c0
- Z
  Merge pull request #15460 from sneaxiy/try_to_turn_on_remove_unnecessary_lock · dec89bd7
  由 Zeng Jinle 提交于 1月 24, 2019
```
Turn on remove_unnecessary_lock by default
```
  dec89bd7
- C
  Clean elementwise_op_function (#15502) · bf91d11e
  由 chengduo 提交于 1月 24, 2019
```
test=develop
```
  bf91d11e
- T
  nce add check sample lables, test=develop (#15463) · 5cfc40de
  由 tangwei12 提交于 1月 24, 2019
```
* nce add check sample lables, test=develop
```
  5cfc40de
- S
  
  test=develop · 9c360cc7
  由 sneaxiy 提交于 1月 24, 2019
  
  9c360cc7
- N
  fix comments · 96413249
  由 nhzlx 提交于 1月 24, 2019
```
test=develop
```
  96413249
- N
  When cudnn version < 7100, there is problem with conv_fusion. · 484b3bc8
  由 nhzlx 提交于 1月 24, 2019
```
Add check for it.
test=develop
```
  484b3bc8
- T
  Merge pull request #15486 from tensor-tang/fix/pass/debug · af07118d
  由 tensor-tang 提交于 1月 24, 2019
```
fix debug compile issue of analysis pass
```
  af07118d
- L
  Gpu memory monitoring (#15436) · 5d026a88
  由 liuwei1031 提交于 1月 24, 2019
```
* fix github issue 15267 test=develop

* fix github issue 15267 test=develop

* monitor the GPU usage during runtime

* revert allocator_facade.cc change

* comments update test=develop
```
  5d026a88
- X
  Merge pull request #15322 from velconia/imperative_resnet · 58cb18d9
  由 Xin Pan 提交于 1月 24, 2019
```
Imperative Resnet
```
  58cb18d9
- S
  disable eager deletion unittest · eed4a638
  由 sneaxiy 提交于 1月 24, 2019
```
test=develop
```
  eed4a638
- S
  lazy_allocator · 51227bd4
  由 sneaxiy 提交于 1月 23, 2019
```
test=develop
```
  51227bd4

BaiXuePrincess / Paddle 与 Fork 源项目一致

BaiXuePrincess / Paddle
与 Fork 源项目一致