1. 23 3月, 2022 14 次提交
    • J
      Added support for BF16 datatype for all oneDNN activation kernels (#40721) · 8e67629c
      jakpiase 提交于
      * added missing BF16 activations
      
      * added softplus bf16
      
      * minor change
      
      * disabled tests for GPU
      8e67629c
    • F
      [NPU] add npu support for conv3d and conv3d_grad (#38480) · ff568afa
      furnace 提交于
      * [NPU] add npu support for conv3d and conv3d_grad
      
      * [NPU] delete failed unittests due to Ascend not support
      
      * [NPU] delete debug codes
      
      * [NPU] optimize codes, notest
      
      * [NPU] remove const_cast
      
      * [NPU] optimize for remove const_cast
      
      * [NPU] fix written errors
      ff568afa
    • Z
      two-phase training for ps (#40762) · b1a4668c
      zhaocaibei123 提交于
      * fix benchmark and communicator config
      
      * fix bugs of the_one_ps
      
      * multi program and fix bug in optimizer
      
      * multi program in the_one_ps
      
      * public commcontext
      
      * ps optimizer multi programs
      
      * cvm & datanorm backend
      
      * fix dim
      
      * fix unittest
      
      * fix
      
      * the one ps merge
      
      * remove comm
      
      * add DownpourLiteWorker
      
      * all
      
      * fix
      
      * fix
      
      * device worker downpour lite
      
      * fix
      
      * fix bug in global shuffle
      
      * save inference model
      
      * fix & add log
      
      * fix
      
      * remove log
      
      * fix
      
      * fix save summary
      
      * fix
      
      * fix pscore
      
      * fix
      
      * fix
      
      * fix
      
      * fix
      
      * fix
      
      * remove logs
      
      * fix
      
      * fix
      
      * fix
      
      * fix
      
      * fix
      
      * add some comments
      
      * fix
      Co-authored-by: Nesythan <esythan@126.com>
      b1a4668c
    • Z
      [AutoParallel] engine & dist_saver (#40528) · 3980e222
      zhaoyingli 提交于
      * add dist_saver and update engine
      
      * add dist_saver and update engine
      3980e222
    • W
      [Eager Hook + Inplace] Refactor register_hook and test with inplace operation (#40778) · ff7cbaae
      Weilong Wu 提交于
      * disable scatter case in test_inplace_eager_fluid
      
      * Update register_hook logic
      
      * Add register_hook test cases
      Co-authored-by: Npangyoki <pangyoki@126.com>
      ff7cbaae
    • J
      Support sharding (#40637) · fe291daf
      Jiabin Yang 提交于
      * suppor sharding api
      
      * support multi api for sharding in eager
      
      * support multi api for sharding in eager
      
      * fix test
      
      * fix test coverage
      fe291daf
    • H
      Add yaml config part2 (#40742) · f4075db8
      hong 提交于
      * fix error; test=develop
      
      * update
      
      * close some yaml
      
      * fix backward attrite error; test=develop
      
      * add div test
      
      * polish code; test=develop
      
      * remove none gbk charactor;
      
      * remove some yaml;
      
      * fix optional bug
      
      * recover yaml config
      
      * resolve confilct; test=develop
      
      * close div; test=develop
      f4075db8
    • W
      [Eager] Slice (#40587) · b07d239c
      wanghuancoder 提交于
      * fix some slice bug, test=develop
      
      * eager slice, test=develop
      
      * eager slice, test=develop
      
      * refine, test=develop
      
      * refine, test=develop
      
      * fix bug, test=develop
      
      * refine, test=develop
      
      * rename function name, test=develop
      b07d239c
    • Z
      Support initializing specific grad tensors to zero for selected operators (#39963) · 2f50ae99
      Zhanlue Yang 提交于
      * Supported Complex2Real Conversion for Eager Dygraph
      
      * Supported Complex2Real Conversion for Eager Dygraph
      
      * Enabled complex type promotion test for matmul_v2
      
      * Fix CI issues
      
      * Support initializing specific grad tensors to zero for selected operators
      
      * Merged adj_edges_ with GradSlotMeta
      
      * Fixed monir issue
      
      * Adjusted num runs
      
      * Recovered Eager performance tests configurations
      
      * Recovered Eager performance tests configurations
      
      * Adjusted performance tests configurations
      
      * Fixed Minor Issues with performance tests
      
      * Moved out Edge from GradSlotMeta
      
      * Fixed issues from merge
      
      * Fixed typo
      
      * Addressed review comments
      
      * Fixed merge issues
      
      * Fixed minor issues
      
      * Fixed minor issue
      
      * Fixed major issues and enabled auto_prune test cases
      
      * Fixed issues from merge
      2f50ae99
    • K
      Add complex type compatibility for stft api and stft op. (#40113) · 319f95d0
      KP 提交于
      * Add stft_op.
      
      * Add stft_grad_op.
      
      * Add stft_op unittest.
      
      * [DLTP-45176] Add complex compatibility in static mode for stft api.
      
      * [DLTP-45176] Add complex compatibility in static mode for stft api.
      
      * Add doc.
      
      * Update unitests of stft op.
      
      * Update spectral helper.
      
      * fix coding style.
      319f95d0
    • C
      Add profiler features (#40357) · c15e3823
      chenjian 提交于
      * add event record for model profiling
      
      * fix format
      
      * fix format
      
      * fix code example bug
      
      * no
      
      * add profiler statistic
      
      * add profiler feature
      
      * fix bug
      
      * fix bug
      
      * fix bug
      
      * fix bug
      
      * required: gpu
      
      * required: gpu
      
      * fix bug
      
      * required: gpu
      
      * fix ci bug
      
      * fix ci error
      
      * fix ci error
      
      * upgrade document
      
      * fix doc
      
      * fix ci bug
      
      * add doc and fix bug
      
      * nothing
      
      * fix bug
      
      * fix format bug
      
      * modify format
      
      * add deprecated description for old profiler
      
      * fix bug
      
      * fix bug
      
      * fix
      
      * add load_profiler_reuslt doc
      
      * add load_profiler_reuslt doc
      
      * add load_profiler_reuslt doc
      
      * help fix old profiler sample code
      
      * add api doc
      
      * fix format
      
      * fix api doc
      
      * fix api doc format
      
      * fix api doc format
      
      * fix api doc c format
      
      * fix api doc format
      c15e3823
    • zhouweiwei2014's avatar
      58970995
    • W
      db41e39e
    • K
      enable continuous log; update doc (#40782) · fdafbc7b
      kuizhiqing 提交于
      fdafbc7b
  2. 22 3月, 2022 6 次提交
  3. 21 3月, 2022 9 次提交
  4. 20 3月, 2022 3 次提交
  5. 19 3月, 2022 5 次提交
    • P
      fix bugs · c01bcbf6
      phlrain 提交于
      c01bcbf6
    • Z
      Call sparse op from python (#40608) · 95fbbc5b
      zhangkaihuo 提交于
      * call sparse api from python
      95fbbc5b
    • P
      fix some bugs; test=develop · 111ee988
      phlrain 提交于
      111ee988
    • H
      Add infer meta (#40544) · 8e4e19ab
      hong 提交于
      * add infer meta; test=develop
      
      * add histogram infer meta; test=develop
      
      * fix unitest bug; test=develop
      
      * format; test=develop
      
      * format; test=develop
      
      * bn not use new infer meta; test=develop
      
      * add infer meta; test=develop
      
      * fixbug; test=develop
      
      * fix bug;
      
      * recover unitest; test=develop
      8e4e19ab
    • P
      support inplace in dygraph eager_fluid state (#40400) · 8e612903
      pangyoki 提交于
      * [Eager] Support eager grad interface, draft version
      
      * Support eager grad interface with allow_unused and multi startup_op
      
      * Fix code format
      
      * Fix allow_unused case, return PyNone if tensor not initialize
      
      * Support output's stop_gradient related to create_graph
      
      * Support grad exception case in eager mode, fix coverage CI
      
      * Update ToPyObject, return PyNone if not initialize
      
      * AccumulationNode add FLAGS_retain_grad_for_all_tensor
      
      * Fix ci issue
      
      * Fix CI issue
      
      * fix, use core.eager.Tensor
      
      * Add func SetBufferSlotRankZeros for GradTensorHolder
      
      * Support retain_graph by using ClearTensorWrappers
      
      * Support retain_graph by using ClearTensorWrappers
      
      * Update retain_graph and no_grad_vars related test case
      
      * Update code gen logic for ClearTensorWrappers
      
      * Fix by override statement
      
      * fix override func args
      
      * Support retain_graph, update unit tests
      
      * Updated ClearTensorWrappers logic
      
      * fix grad python interface
      
      * Use deep copy and update unit tests
      
      * Polish code
      
      * Polish code
      
      * Fix CI issue, Deep copy only use when user set grad_tensors
      
      * Fix CI, use Backward instead RunBackward
      
      * Fix CI, Declare kernel explicitly in test file
      
      * Polish, remove vector of TensorWrapper
      
      * Refactor the logic of grad/backward, polish codes
      
      * Update code after merge upstream develop
      
      * Polish after merge upstream develop
      
      * Update to adapt new GradNodeBase superclass
      
      * Fix error introduced during conflict resolution
      
      * support inplace strategy in eager_fluid state
      
      * solve conflict
      
      * nothing
      
      * Update purify potential_startup_nodes logic
      
      * Fix errors
      
      * Polish code
      
      * Remove useless args for ToPyObject
      
      * Remove useless TensorWrappersSet
      
      * fix record conflict
      
      * Fix code-format, re-install pre-commit
      
      * fix tensor_wrapper bug
      
      * Fix pre-process logic for potential_startup_ops
      
      * Update unit tests, use eager mode
      
      * Fix conflicts
      
      * fix unittest timeout
      
      * little change
      Co-authored-by: NWeilong Wu <veyron_wu@163.com>
      8e612903
  6. 18 3月, 2022 3 次提交