1. 15 2月, 2022 2 次提交
    • L
      [Pten]Move expand_v2 to pten (#39471) · 2d16d69b
      Linjie Chen 提交于
      * move expand to pten
      
      * move expand_v2 to pten
      
      * move expand_v2 to pten
      
      * fix grad register
      
      * fix grad register
      
      * fix tensorcpry
      
      * fix tensorcopy
      
      * fix tensorcopy
      
      * fix tensorcopy
      
      * fix tensorcopy
      
      * fix ci
      
      * fix tensorcopy
      2d16d69b
    • A
      [PTen]Migrate proto::VarType outside of Pten (#39411) · 7e7e9404
      Aurelius84 提交于
      * #1 migrate dist-related type()-> dtype()
      
      * move datatype function from pten -> fluid/framework
      
      * change type() in imperative into convert(dtype())
      
      * modify xx_tensor->type into xx_tensor->dtype
      
      * change the set_type interface and the caller
      
      * modify xx_tensor.type into xx_tensor.dtype
      
      * fix mutable_data(place, dtype())
      
      * change caller of mutable_data in pten and distributed
      
      * change the caller of mutable_data in fluid/framework
      
      * change the caller of mutable_data in imperative directory
      
      * mutable_data: inference
      
      * update the call of mutable_data
      
      * transfer MakePenScalarArray MakePtenScalar ResetHolderWithType
      
      * pass the compile. the next step is remove VarType in Pten
      
      * fix all and remove VarType from pten. success in linux. Next task is other platform
      
      * fix conflict with develop
      
      * fix compiled error
      
      * Fix reset conversion
      
      * fix conflict
      
      * fix compiled problem
      
      * fix typo
      
      * Fix << in tensor_utils.cc
      
      * fix type->dtype
      
      * fix unittest
      
      * fix tensor init constructor
      
      * fix DataTypeSize for BFloat16
      
      * fix code style
      
      * fix npu compiled error
      
      * fix npu
      
      * compile npu sucessfully
      
      * fix conflict
      
      * fix conflict
      Co-authored-by: Nxiongkun <xiongkun03@baidu.com>
      7e7e9404
  2. 14 2月, 2022 11 次提交
    • C
      9c2cee1c
    • S
      [Bug fix] prevent squashing pair u8 dequantize -> s8 quantize (#39346) · 66b5348e
      Sylwester Fraczek 提交于
      * prevent squashing pair u8 dequantize -> s8 quantize
      
      * add relu op to check for uint8
      
      * fix ptq fc attr name fuse_activation->activation_type
      
      * fix
      
      * add unit test
      
      * remove unused variable
      
      * test fix unsuccessful
      
      * fix test and logic
      
      * multiline comment
      
      * remove cout
      
      * Revert "fix ptq fc attr name fuse_activation->activation_type"
      
      This reverts commit ffd023353a5e9b0fd15e81b9e9f9fe1794035017.
      
      * fix ptq fc attr name fuse_activation->activation_type
      66b5348e
    • W
      context add generator (#39475) · 463e31f4
      Wilber 提交于
      * context add generator
      
      * update
      463e31f4
    • L
      Add cuda tracer (#39488) · 0790f949
      liutiexing 提交于
      * add align for WorkQueue
      
      * add spinlock
      
      * merge develop
      
      * merge
      
      * Add EventsWaiter
      
      * Revert "Add EventsWaiter"
      
      This reverts commit e206173aa9be7401b83a53581627bfaf557c8fb2.
      
      * add log for Executor
      
      * Add CudaTracer to trace CUDA events
      Co-authored-by: Nliutiexing <liutiexing@google.com>
      0790f949
    • L
      [NewExe] Ignore eof Exception(#39487) · 2f642159
      liutiexing 提交于
      * add align for WorkQueue
      
      * add spinlock
      
      * merge develop
      
      * merge
      
      * Add EventsWaiter
      
      * Revert "Add EventsWaiter"
      
      This reverts commit e206173aa9be7401b83a53581627bfaf557c8fb2.
      
      * add log for Executor
      
      * Avoid thread reconsruction when EOF
      Co-authored-by: Nliutiexing <liutiexing@google.com>
      2f642159
    • C
      [PTen] Add HasAttr for ArgumentMappingContext (#39464) · ddb1e23f
      Chen Weihang 提交于
      * add has_attr for arg map context
      
      * skip useless attr now
      
      * skip attr if not exists
      
      * fix typo
      ddb1e23f
    • C
      [pten] add split kernel (#39060) · d0df5632
      chentianyu03 提交于
      * add split kernel
      
      * add split kernel signature
      
      * fix split bug
      
      * modify MakePtenScalarArrayFromVarList
      
      * modify MakePtenScalarArrayFromVarList
      
      * fix split windows register error
      
      * add test case for split kernel
      
      * replace raw split kernel with pten kernel
      
      * fix makeScalar/ScalarArray bug
      
      * remove debug log
      
      * remove int64_t type in buildPtcontext
      
      * update by code review
      
      * fix split dev test failed
      
      * change DenseTensorMeta to MetaTensor
      
      * change split api code from auto gen to manual
      
      * split cuda kernel support bfloat16 type
      
      * fix conflict
      
      * rm raw split kernel
      
      * merge develop branch
      
      * change to pten::errors
      d0df5632
    • T
      fix gather_nd, *test=kunlun (#39283) · d12c3636
      TTerror 提交于
      d12c3636
    • mhhhh1's avatar
      [MLU] add mlu kernel for c_broadcast op (#39470) · 1b9e6790
      mhhhh1 提交于
      1b9e6790
    • Z
      Fixed get_tensor method for EagerTensor (#39414) · 97229944
      Zhanlue Yang 提交于
      * Enabled Eager OpTest #1
      
      * Enabled Eager OpTest #1
      
      * Fixed get_tensor method for EagerTensor
      97229944
    • Z
      Adjusted python-level trace_op to accomodate final state Eager Dygraph (#39319) · ec8a0c1d
      Zhanlue Yang 提交于
      * Removed debug info
      
      * Added automatic code generation for final state Eager Dygraph
      
      * Modified backward yaml
      
      * Added EagerUtils helper functions for final state CodeGen
      
      * Adjusted CMakeFiles to support compilation for final state auto generated codes
      
      * Added python-c code generation for final state Eager Dygraph
      
      * Fixed minor issue
      
      * Fixed yaml.load() method failure
      
      * Fixed minor issues
      
      * Refactored Python-C Attributes Parsing Functions
      
      * Fixed minor issue with Python-C AddFunctions
      
      * Adjusted python-level trace_op to accomodate final state Eager Dygraph
      
      * Added Logs for final state Eager Dygraph
      
      * Fixed merge issues
      
      * Fixed minor issue
      ec8a0c1d
  3. 11 2月, 2022 18 次提交
  4. 10 2月, 2022 9 次提交