1. 24 11月, 2021 4 次提交
  2. 23 11月, 2021 6 次提交
  3. 22 11月, 2021 3 次提交
    • F
      disable copying of datatype when sharing buffer between two tensors. (#37247) · 9ec1432d
      Feiyu Chan 提交于
      * disable copying of datatype when sharing buffer between two tensors.
      * fix for mkldnn operator kernels (elementwise_add, sum, softplus, softmax, scale, activation), mannually set the data type when reusing memory by ShareBufferWith.
      9ec1432d
    • C
      [PTen] Add variable transform to/from ptenTensor and add cast kernel (#36916) · 5caa6fc5
      chentianyu03 提交于
      * add cast kernel
      
      * add cast cuda kernel
      
      * add cast kernel
      
      * make cast kernel output dtype undefined
      
      * get cast dtype from vardesc
      
      * move cast to manipulation and add test case
      
      * add castinfershape
      
      * avoid reinitilaze variable
      
      * InitializeVariable support datatype
      
      * merge develop branch
      
      * fix merge bug
      
      * revert modify initializeVariable
      
      * revert modify on InitializeVariable
      
      * revert modify on InitializeVariable
      
      * mutable support reset dtype
      
      * enable make pten tensor from variable when def_arg.type is undefined
      
      * fix build pten ctx start_idx error
      
      * copy pten out tensor to variable
      
      * merge develop branch
      
      * fix non pten kernel cast failed
      
      * add reset allocation place for remake tensor
      
      * fix inplace realloc error
      
      * add mutable on pten kernles and remove unused cast files
      
      * rename function names
      
      * fix output type error
      
      * fix conflict with develop branch
      
      * set data type to variable with pten's dtype
      
      * fix test_cast_api type mismatch
      
      * densorTensro mutable_data support 0 bytes value
      
      * fix the inplace bug of reshape kernel
      
      * fix pten.backend != variable.place when moving storage, palce mismatch bug
      
      * fix conflict with develop branch
      
      * Fix bug of paddle::experimental::MovesStorage
      
      * fix ReMakePtenDenseTensor place mismatch bug
      
      * Revert "fix ReMakePtenDenseTensor place mismatch bug"
      
      This reverts commit 86336032f60b8a15eacd2c1ff2fa513f5d8dfd1a.
      
      * fix ReMakePtenDenseTensor place mismatch bug
      
      * reverts the set_lod interface, test=develop
      
      * modify by the review options
      
      * modify error message
      
      * add & for const input arguments
      
      * add reference in params
      
      * elementwise_sub add mutable_data
      
      * fix ResetHolderWithType check size bug
      
      * add dependence pten_tensor to test_cast_api object
      
      * remove unused code to pass ci coverage
      Co-authored-by: NChen Weihang <chenweihang@baidu.com>
      Co-authored-by: NYuanRisheng <yuanrisheng@baidu.com>
      Co-authored-by: Nshixiaowei02 <39303645+Shixiaowei02@users.noreply.github.com>
      5caa6fc5
    • L
      [new feature] add local scope for interpretercore (#37379) · 1f0512be
      Leo Chen 提交于
      1f0512be
  4. 19 11月, 2021 2 次提交
    • J
      Optimize cinn_cache_key by replace GraphToProgram to Dot string (#37317) · edc3496f
      jiangcheng 提交于
      * optimize cache-key by replace GraphToProgram to Dot string
      
      * fix compile failure bug
      edc3496f
    • W
      Add fuse_resnet_unit pass (#36818) · 3cd3bf29
      wuhuanzhou 提交于
      * GeneratePass support attr condition and mapping, test=develop
      
      * fix coverage, test=develop
      
      * Add fuse_resnet_unit pass, test=develop
      
      * fix CI errors, test=develop
      
      * fix CI errors, test=develop
      
      * fix unittest error when compiling without CUDA, test=develop
      
      * fix static ci error, test=develop
      
      * limit kernel size must equal 1, test=develop
      3cd3bf29
  5. 18 11月, 2021 1 次提交
    • Z
      Add the `GetFetchNames` method in CinnGraphSymbolization. (#37218) · 3ad495e8
      Zhen Wang 提交于
      * Add the `GetFetchNames` method in CinnGraphSymbolization.
      
      * Use unordered_set instead vector as the type of fetch_var_names.
      
      * Reuse the definition of kCompilationKey.
      
      * Use CompileOptions to set fetch_var_ids.
      
      * Update the argument passing of GraphCompiler.Build.
      
      * Fix some bugs in CinnGraphSymbolization::GetFetchIds.
      3ad495e8
  6. 17 11月, 2021 8 次提交
  7. 16 11月, 2021 3 次提交
  8. 15 11月, 2021 7 次提交
    • C
      [Pten] Refactor the implementation of custom operator (#37122) · 1e598f1a
      Chen Weihang 提交于
      * move extension into pten [no-verify]
      
      * append tensor methods by ext_tensor [no-verify]
      
      * append other tensor methods [no-verify]
      
      * ext related files tidy [no-verify]
      
      * include relation tidy [no-verify]
      
      * add pten tensor test [no-verify]
      
      * replace tensor in custom op & compile success
      
      * refine tensor constructor for unittest
      
      * custom relu jit run success
      
      * fix all custom op unittests
      
      * add inference cmake adapt [no-verify]
      
      * fix failed unittests
      
      * fix windows failed unittests
      
      * try to fix kunlun and inference failed
      
      * fix test_elementwise_api error
      
      * try to fix win compile failed
      
      * fix kunlun fp16 type error
      
      * remove useless haddle error macro
      
      * add custom linear op test
      
      * fix compile failed & add win symbols
      
      * fix non pten kernel cast failed
      
      * add dll decl for api
      
      * polish several deetails
      
      * polish details by review comment
      
      * add dll_decl for register
      1e598f1a
    • L
      [new-exec] fix stream analysis (#37161) · 584b4b24
      Leo Chen 提交于
      * fix revord_event
      
      * refine class Instruction
      
      * refine Instruction and InterpreterCore
      
      * make instruction and operator_base consistent
      
      * support NoNeedBufferVar in stream_analyzer
      
      * fix place of event
      
      * add vlog before continue
      584b4b24
    • W
      fix 3 bug of new_executor (#37142) · 8358d614
      wanghuancoder 提交于
      * fix 3 bug, test=develop
      
      * refine, test=develop
      8358d614
    • J
      fix cinn_compile_test not pass problem (#37190) · 83eef6d2
      jiangcheng 提交于
      83eef6d2
    • Z
      Add distributed pass framework: including PassBase/PassTest/PassUtils (#36643) · 12339fa0
      Zeng Jinle 提交于
      * add split_program
      
      * make ut faster
      
      * increase ut timeout
      
      * make result deterministic
      
      * add fuse_all_reduce pass
      
      * add ut framework, update
      
      * fix ut framework
      
      * remove useless code
      
      * add coverage support
      
      * update
      
      * fix CI
      
      * fix some bugs and fix ci coverage
      
      * fix conflict
      12339fa0
    • J
      10cc040d
    • Z
      [heterps]bug fix for local training with --heter_worker_num (#37166) · 31cd9145
      zmx 提交于
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix ut. test=develop
      
      * fix ut. test=develop
      
      * fix ut. test=develop
      31cd9145
  9. 14 11月, 2021 1 次提交
    • Y
      [PTen]Reshape Kernel Refactor (#37164) · 895692e3
      YuanRisheng 提交于
      * reshape kernel refactor
      
      * fix compile bugs when run ci
      
      * support xpu for reshape
      
      * fix bugs when run unittest in kunlun ci
      
      * fix compile bugs when run kunlun
      
      * perfect code according to suggestion
      895692e3
  10. 12 11月, 2021 3 次提交
  11. 11 11月, 2021 2 次提交