1. 21 9月, 2019 4 次提交
    • J
      Feature/auto prune in dygraph (#19757) · 45425411
      Jiabin Yang 提交于
      * refactor dygraph,test=develop
      
      * fix failed unittest,test=develop
      
      * polish code,test=develop
      
      * check windows ci error,test=develop
      try to fix windows ci error by np.allclose,test=develop
      
      * polish vlog and profiler, test=develop
      
      * try to fix preceding ops order,test=develop
      
      * test transformer in windows ci, test=develop
      
      * use python c-api to speed up tracer.trace,test=develop
      
      * test=develop, fix docker with paddle nccl problem
      
      * test=develop, add ut for debug string and gradient_accumulator
      
      * test=develop, add tests for layer/gradient_accumulator/prepared_op
      
      * test=develop, fix complie error for test_prepared_op
      
      * test=develop, add more ut for dygraph
      
      * test=develop, create API.spec for dygraph api change
      
      * test=develop, refoctor name to make it easier to understand
      
      * test=develop, refoctor name to make it easier to understand
      
      * test=develop, fix multi-gpu failed problem , add Tracer tests, change PADDLEENFORCE to PADDLEENFORCE_EQ
      
      * test=develop, fix ut failed on parallel se-resnext
      
      * test=develop, change one more PADDLE_ENFORCE
      
      * support auto prune in dygraph mode
      
      * test=develop, support auto prune
      
      * test=develop, merge develop conflict
      
      * test=develop, fix test_layer and test_tracer ut
      
      * test=develop, fix bug which may cause stop_gradient disabled with a list of backward inputs
      45425411
    • A
    • P
      Add TRT input shape check between model and runtime (#19864) · baccd7e2
      Pei Yang 提交于
      * add TRT shape check, test=develop
      
      * model_input_shape == runtime_input_shape, refine message, test=develop
      baccd7e2
    • P
      Fix BUGS: paddle-TRT repeatedly sets weight_map and overdeletes repetitive_params (#19825) · 74812d1c
      Pei Yang 提交于
      * fix trt bugs when sharing params, test=develop
      
      * add unittest for cascade_rcnn
      74812d1c
  2. 20 9月, 2019 11 次提交
  3. 19 9月, 2019 11 次提交
    • L
      Refactor conv computeINT8 (#19574) · 2c32c2d6
      lidanqing 提交于
      * fix conflicts
      test=develop
      
      * change mask_bias_reorder
      test=develop
      
      * add ComputeMask function to make code clear
      test=develop
      
      * change according to reviews
      test=develop
      
      * change according to reviews
      test=develop
      2c32c2d6
    • J
      Fix conv2d+dequantize squash for residual fusion (#19545) · 3f1d0234
      joanna.wozna.intel 提交于
      * Fix conv2d+dequantize squash for residual fusion
      
      test=develop
      
      * Change condition
      
      test=develop
      3f1d0234
    • H
      Fix deps of prune (#19876) · a35557d8
      Huihuang Zheng 提交于
      Add boost as dependency of prune
      
      fix #19862
      a35557d8
    • A
      Add template functions for Acquire primitive/primitive_desc (#19867) · c7e68892
      Adam 提交于
      * Add template functions for Acquire primitive/primitive_desc
      test=develop
      
      * Move acquire primitive descriptor to protected section
      test=develop
      c7e68892
    • F
      hide with inference optim API (#17355) · fe18cfdb
      flame 提交于
      fe18cfdb
    • L
      578a2f5d
    • A
      Remove constraint that last dimension is forced to be 1 in cross_entropy (#19606) · b125e327
      Aurelius84 提交于
      * Remove constraint that last dimension is forced to be 1 in cross_entropy
      test=develop
      
      * modify labels last dims test=develop
      b125e327
    • W
      add precise roi pooling op test=develop (#18960) · a7c440d3
      wopeizl 提交于
      * add precise roi pooling op test=develop
      
      * test=develop
      
      * test=develop
      
      * test=develop
      
      * test=develop
      
      * test=develop
      
      * test=develop
      
      * test=develop
      
      * test=develop
      
      * test=develop
      
      * test=develop
      
      * test=develop
      
      * test=develop
      
      * test=develop
      
      * detail the description test=develop
      
      * test=develop
      
      * elaborate the doc for return type test=develop
      
      * test=develop
      a7c440d3
    • Y
      Add a pass to fuse fc+elementwise_add+layernorm (#19776) · 3cd985a6
      Yiqun Liu 提交于
      * Add fc_elementwise_layernorm_fuse pass and unittest.
      
      * Add fused_fc_elementwise_layernorm op and its GPU kernel.
      test=develop
      
      * Apply fc_elementwise_layernorm_fuse_pass to GPU inference.
      
      * Add the setting of attrs in the definition of binary_op.
      test=develop
      
      * Add comment.
      
      * Implement the unittest.
      test=develop
      
      * Change the unittest name of layer_norm.
      test=develop
      3cd985a6
    • J
      Optimize amp for multi-gpu to enable FP16 gradients transfer across gpus. (#19714) · d9db94d7
      Jie Fang 提交于
      Optimize amp for multi-gpu to enable FP16 gradients transfer across gpus
      d9db94d7
    • W
      Strided slice (#19642) · 47af618f
      wangchaochaohu 提交于
      * strided_slice op basic function test=develop
      
      * test=develop rewrite and fix
      
      * fix bug test=develop
      
      * fix for the PADDLE_ENFORCE usage
      
      * add some unit testw
      
      * fix for the aip  test and copright and fix test=develop
      
      * fix API.spec test=develop
      
      * fix API.spec test=develop
      
      * add axis parameter test=develop
      
      * fix for the build error test=develop
      
      * fix python api  test=develop
      
      * fix the build test=develop
      
      * fix build test=develop
      
      * fix API spec test=develop
      
      * test=develop add some comment and single op test
      
      * fix API spece test=develop
      
      * fix test=develop
      
      * fix test=develop
      
      * fix api test=develop
      
      * fix api test=develop
      
      * fix API.spec test=develop
      
      * fix typo test=develop
      
      * fix API.spec test=develop
      
      * fix API typo test=develop
      
      * fix doc and API.spec test=develop
      47af618f
  4. 18 9月, 2019 9 次提交
  5. 17 9月, 2019 5 次提交