1. 20 8月, 2020 3 次提交
  2. 19 8月, 2020 5 次提交
    • L
      add pad func (#26106) · bcf03273
      littletomatodonkey 提交于
      * add pad func
      
      * add pad
      
      * test=develop, add pad op and apis
      
      * restore pad2d
      
      * test=develop, fix paddl declare
      
      * fix pad interface
      
      * test=develop, fix pad
      
      * test=develop, add all pad api and cos_sim
      
      * test=develop, remove padding default value
      
      * test=develop, rename var to tensor
      
      * test=develop, add more tests
      
      * test=develop, rename tovar to totensor
      
      * test=develop, fix init
      
      * test=develop, add more test
      
      * test=develop, add more tests
      bcf03273
    • C
      Fix ps gpu (#26218) · eeeef957
      Chengmo 提交于
      * support ps-gpu
      eeeef957
    • Z
      add zero norm, inf norm support for p_norm op (#26364) · 6cbeafb6
      Zhong Hui 提交于
      * add zero norm, inf norm support for p_norm op
      
      * fix the invalid argument check, fix the dtype problem in test case.
      6cbeafb6
    • G
      remove scope in cudnn lstm (#25188) · 1fbee267
      GaoWei8 提交于
      1fbee267
    • C
      [Quantization] Conv2d_transpose and mul support channnelwise quantization (#25639) · 3f816bc8
      cc 提交于
      * Conv2d_transpose and mul support channnelwise quantization, test=develop
      * Skip collecting out threshold for output tensor of which the type is not fp32 or fp64, test=develop
      * Fix error in test_user_defined_quantization, test=develop
      * Add depthwise_conv_bn_fuse, test=develop
      * Add conv_transpose_bn_fuse_pass for post_training_quant, test=develop
      3f816bc8
  3. 18 8月, 2020 4 次提交
  4. 17 8月, 2020 1 次提交
  5. 16 8月, 2020 1 次提交
  6. 15 8月, 2020 1 次提交
  7. 14 8月, 2020 5 次提交
  8. 13 8月, 2020 3 次提交
    • W
      support the tuple for attribute of axis in min, max for api2.0 · 2d6cc0b1
      wawltor 提交于
      Update the code for the min,max, test=develop
      2d6cc0b1
    • L
      [OpDevOptimize] Add common infershape functions (#26096) · ffe52b44
      Leo Chen 提交于
      * add unchaged infershape function
      
      * add broadcast infershape function
      
      * fix bug
      
      * rename infershape functions
      
      * add UnaryOpUnchangedInferShapeCheckAxis
      
      * add error message
      
      * add test for common infer shape functions
      
      * dont update existed ops
      
      * dont update op_desc.h
      
      * add more test
      
      * add error check, refine error message
      ffe52b44
    • L
      Feature/Enable Auto-Mixed-Precision in dynamic graph (#24903) · 2d95280e
      Leo Chen 提交于
      * add auto_cast, test=develop
      
      * add loss scaler, test=develop
      
      * add comments, test=develop
      
      * refine code, test=develop
      
      * refine code, test=develop
      
      * do not set flags automatically, test=develop
      
      * fix custom op bug, test=develop
      
      * add more test, test=develop
      
      * refine enable logic, test=develop
      
      * enable amp test with GPU, test=develop
      
      * add unittest
      
      * add test for found_inf
      
      * follow comments
      
      * follow comments
      
      * remove global variable, use singleton
      
      * add some notes
      
      * update comments
      
      * update comments
      
      * update comments
      
      * add use_dynamic_loss_scaling argument
      
      * refine found_inf
      
      * refine found_inf
      2d95280e
  9. 12 8月, 2020 2 次提交
  10. 10 8月, 2020 6 次提交
  11. 08 8月, 2020 3 次提交
    • F
      update document template for unary elementwise layers (#25896) · e853ece0
      Feiyu Chan 提交于
      1. update document template for unary elementwise layers(a.k.a. activation layer);
      2. remove generate_op_noattr and use generate_activation instead; remove redundant function copies;
      3. minor update for docstring to fix rst format errors.
      4. fix doc for Rsqrt OP
      5. add sample code for each activation separately;
      6. remove the unused deprecated decorator.
      e853ece0
    • J
      Change use_quantizer attribute name and data type (#25838) · 734cf1c3
      joanna.wozna.intel 提交于
      * Change use_quantizer attribute name and data type
      
      * Fix problem with setting attribute
      
      * Add changes due to review
      
      * Small change in function
      
      * Restore use_quantizer attr for compatibility
      734cf1c3
    • L
      refine unsqueeze, test=develop (#25470) · 5258d53d
      Leo Chen 提交于
      * refine unsqueeze, test=develop
      
      * update unsqueeze, test=develop
      
      * refine unsqueeze, test=develop
      
      * refine unsqueeze, test=develop
      
      * update
      
      * remove None, test=develop
      
      * follow comments
      
      * support bool
      
      * update doc
      
      * follow comments
      
      * merge develop
      5258d53d
  12. 06 8月, 2020 2 次提交
  13. 05 8月, 2020 4 次提交
    • Z
      fix invalid read of pnorm gradient function · dca56f47
      Zhong Hui 提交于
      fix invalid read of pnorm gradient function and delete the unused code
      dca56f47
    • Z
      [CUDNN8 support] : support CUDNN8 (#25664) · 358bc06c
      Zhaolong Xing 提交于
      * cunn8 support
      test=develop
      
      * fix ci error
      test=develop
      358bc06c
    • Z
      add eltwise clip cuda impl. (#25689) · 5970871a
      Zhaolong Xing 提交于
      test=develop
      5970871a
    • P
      Fix registering trt plugin (#25744) · b717895f
      Pei Yang 提交于
      * develop dynamic shape serilization
      
      * add test param for gelu
      
      * fix bugs
      
      * delete redundant comments
      
      * debug
      
      * fix conflict. test=develop
      
      * fix bug. test=develop
      
      * add trt dynamic shape serialized support
      
      * fix ernie serialized bug
      test=develop
      
      * fix codestyle
      test=develop
      
      * fix bug
      test=develop
      
      * fix bug.test=develop
      
      * modify cmakelist test=develop
      
      * fix bug
      test=develop
      
      * fix error message.  test=develop
      
      * fix trt register plugin based on pr#25003
      
      * add trt dynload
      
      * fix deserialization bug of not finding plugin registration
      
      * refine code style
      
      * recover engine key in tensorrt_subgraph_pass
      
      * for ci coverage
      
      * add unittest for deserialization
      Co-authored-by: Nhaozech <chenhaoze94@gmail.com>
      b717895f