1. 23 9月, 2019 10 次提交
    • Z
    • M
      Forward recompute3 (#19913) · 9901f696
      mapingshuo 提交于
      * add recompute based checkpoints methods for large batch training
      test=develop
      
      * add append_backward_with_forward_recomputation
      test=develop
      
      * refine optimizer
      test=develop
      
      * update backward and optimizer
      test=develop
      
      * make Variable usable
      test=develop
      
      * add recompute code
      
      * refine optimizer
      test=develop
      
      * refine addup _append_backward_ops_with_checkpoints_
      1) for recompute part, just cache the grad_op_desc without appending to block
      2) before appending grad_op_desc to backward part, addup_repetitive_vars, remove unused branch
      test=develop
      
      * make method private
      
      * add recompute strategy into DistributedStrategy
      test=develop
      
      * checkpoint version3
      test=develop
      
      * remove some print information
      test=develop
      
      * remove unused sumop
      test=develop
      
      * try to fix recompute with graph building modules
      
      * add input names to vars should be held
      
      * add memory debug tool
      
      * backup backward
      
      * Fix bugs
      
      * add backward desc for op not in any segments
      
      * add exception info for sub_block
      
      test=develop
      
      * modify code style
      
      test=develop
      
      * modify code style
      
      test=develop
      
      * remove print functions
      
      test=develop
      
      * add API spec
      
      test=develop
      test=document_preview
      
      * make Recompute a child class of Optimizer
      
      test=develop
      test=document_preview
      
      * add API spec
      
      test=develop
      test=document_preview
      
      * modify API spec
      
      test=develop
      test=document_preview
      
      * add document for Recompute
      
      test=develop
      test=document_preview
      
      * change API doc of Rcompute
      
      test=develop
      test=document_preview
      
      * code cleaning
      
      test=develop
      test=document_preview
      
      * modify API spec
      
      * fix bugs when segments hold no element
      
      * add testcase for Recompute Optimizer
      
      test=develop
      test=document_preview
      
      * add test for apply_gradient, and code cleaning
      
      test=develop
      test=document_preview
      
      * add test case for load function
      
      * enable CI
      
      test=develop
      test=document
      
      * add test case
      
      test=develop
      test=document_preview
      
      * add sample code for 4 function of recompute optimizer
      
      test=develop
      test=document_preview
      9901f696
    • C
      Delete local execution scopes (#19749) · d7251a8e
      chengduo 提交于
      * Add RecordHistoryLocalExecScopes
      test=develop
      d7251a8e
    • W
      remove the useless warning for user to avoid confuse test=develop (#19871) · 5452b6a1
      wopeizl 提交于
      * remove the useless warning for user to avoid confuse test=develop
      5452b6a1
    • R
      add mse_loss (#19759) · d31c92a2
      ruri 提交于
      * add mse_loss op
      d31c92a2
    • H
      Add op compatible information (#19910) · 85b398f1
      hong 提交于
      * add op compatible infomation; test=develop
      
      * add enum type
      
      * add enum type; test=develop
      85b398f1
    • K
      fix softmax CE time limit check failed (#19846) · 3f021781
      Kaipeng Deng 提交于
      * fix softmax ce time limit check failed. test=develop
      
      * refine softmax calc. test=develop
      3f021781
    • T
      move tree_conv to fluid.contrib.layers (#19918) · a4919d36
      Tao Luo 提交于
      * move tree_conv to fluid.contrib.layers
      
      test=develop
      
      * update API.spec for tree_conv
      
      test=develop
      
      * update tree_conv api to increase unit coverage
      
      test=develop
      a4919d36
    • tensor_array_to_tensor_op.cc, test=develop (#19289) · 30adea0a
      石晓伟 提交于
      30adea0a
    • Z
      Unify DataLoader APIs (#19305) · 0436efd6
      Zeng Jinle 提交于
      * unify DataLoader APIs, test=develop
      
      * integrate iterable CPU Dataset, test=develop
      add GPU dataset supporting, test=develop
      
      * add unittests for dataset, test=develop
      
      * add more docs to dataloader apis, test=develop, test=document_preview
      
      * refine doc, test=develop
      
      * refine doc again, test=develop
      
      * increase coverage, test=develop
      0436efd6
  2. 22 9月, 2019 2 次提交
  3. 21 9月, 2019 6 次提交
  4. 20 9月, 2019 11 次提交
  5. 19 9月, 2019 11 次提交
    • L
      Refactor conv computeINT8 (#19574) · 2c32c2d6
      lidanqing 提交于
      * fix conflicts
      test=develop
      
      * change mask_bias_reorder
      test=develop
      
      * add ComputeMask function to make code clear
      test=develop
      
      * change according to reviews
      test=develop
      
      * change according to reviews
      test=develop
      2c32c2d6
    • J
      Fix conv2d+dequantize squash for residual fusion (#19545) · 3f1d0234
      joanna.wozna.intel 提交于
      * Fix conv2d+dequantize squash for residual fusion
      
      test=develop
      
      * Change condition
      
      test=develop
      3f1d0234
    • H
      Fix deps of prune (#19876) · a35557d8
      Huihuang Zheng 提交于
      Add boost as dependency of prune
      
      fix #19862
      a35557d8
    • A
      Add template functions for Acquire primitive/primitive_desc (#19867) · c7e68892
      Adam 提交于
      * Add template functions for Acquire primitive/primitive_desc
      test=develop
      
      * Move acquire primitive descriptor to protected section
      test=develop
      c7e68892
    • F
      hide with inference optim API (#17355) · fe18cfdb
      flame 提交于
      fe18cfdb
    • L
      578a2f5d
    • A
      Remove constraint that last dimension is forced to be 1 in cross_entropy (#19606) · b125e327
      Aurelius84 提交于
      * Remove constraint that last dimension is forced to be 1 in cross_entropy
      test=develop
      
      * modify labels last dims test=develop
      b125e327
    • W
      add precise roi pooling op test=develop (#18960) · a7c440d3
      wopeizl 提交于
      * add precise roi pooling op test=develop
      
      * test=develop
      
      * test=develop
      
      * test=develop
      
      * test=develop
      
      * test=develop
      
      * test=develop
      
      * test=develop
      
      * test=develop
      
      * test=develop
      
      * test=develop
      
      * test=develop
      
      * test=develop
      
      * test=develop
      
      * detail the description test=develop
      
      * test=develop
      
      * elaborate the doc for return type test=develop
      
      * test=develop
      a7c440d3
    • Y
      Add a pass to fuse fc+elementwise_add+layernorm (#19776) · 3cd985a6
      Yiqun Liu 提交于
      * Add fc_elementwise_layernorm_fuse pass and unittest.
      
      * Add fused_fc_elementwise_layernorm op and its GPU kernel.
      test=develop
      
      * Apply fc_elementwise_layernorm_fuse_pass to GPU inference.
      
      * Add the setting of attrs in the definition of binary_op.
      test=develop
      
      * Add comment.
      
      * Implement the unittest.
      test=develop
      
      * Change the unittest name of layer_norm.
      test=develop
      3cd985a6
    • J
      Optimize amp for multi-gpu to enable FP16 gradients transfer across gpus. (#19714) · d9db94d7
      Jie Fang 提交于
      Optimize amp for multi-gpu to enable FP16 gradients transfer across gpus
      d9db94d7
    • W
      Strided slice (#19642) · 47af618f
      wangchaochaohu 提交于
      * strided_slice op basic function test=develop
      
      * test=develop rewrite and fix
      
      * fix bug test=develop
      
      * fix for the PADDLE_ENFORCE usage
      
      * add some unit testw
      
      * fix for the aip  test and copright and fix test=develop
      
      * fix API.spec test=develop
      
      * fix API.spec test=develop
      
      * add axis parameter test=develop
      
      * fix for the build error test=develop
      
      * fix python api  test=develop
      
      * fix the build test=develop
      
      * fix build test=develop
      
      * fix API spec test=develop
      
      * test=develop add some comment and single op test
      
      * fix API spece test=develop
      
      * fix test=develop
      
      * fix test=develop
      
      * fix api test=develop
      
      * fix api test=develop
      
      * fix API.spec test=develop
      
      * fix typo test=develop
      
      * fix API.spec test=develop
      
      * fix API typo test=develop
      
      * fix doc and API.spec test=develop
      47af618f