1. 24 2月, 2022 8 次提交
    • J
      Fix for split op in BF16 inference (#39548) · 75f91ce4
      jakpiase 提交于
      * Fix for split bf16 inference
      
      * added test for pass
      
      * changes after review
      75f91ce4
    • H
      Optimize where_op and abs_grad_op by the elementwise interface (#39609) · c9699556
      huangxu96 提交于
      * Optimize the where_op by the elementwise_op funtion
      
      * Modified where_op & abs_grad_op by elementwise interface
      c9699556
    • H
      Add Note for Place of Executor in Parallel Environment (#39063) · 867224b2
      Huihuang Zheng 提交于
      Add note for Place of Executor in parallel environment
      867224b2
    • J
      fix bug for block state (#39854) · 5fd7b5c3
      JZ-LIANG 提交于
      5fd7b5c3
    • W
      [Eager] save load testcase (#39571) · 6b5749eb
      wanghuancoder 提交于
      * eager, test=develop
      
      * fix bug, test=develop
      
      * eager, test=develop
      
      * merge legacy to fluid
      
      * eager, test=develop
      
      * eager, test=develop
      
      * Refactor TensorAdd func by template and remove gradient_accumulation in eager
      
      * Remove needless target name
      
      * eager, test=develop
      
      * eager, test=develop
      
      * Use overload instead of template
      
      * Remove legacy code
      
      * Remove legacy code
      
      * selectedrows, test=develop
      
      * Remove DataType test
      
      * eager, test=develop
      
      * eager, test=develop
      
      * support gan, test=develop
      
      * Using Tensor directly instead of using EagerTensor
      
      * support gradient_accumulation
      
      * make test_imperative_lod_tensor_to_selected_rows longer
      
      * make test_imperative_lod_tensor_to_selected_rows longer
      
      * refine code
      
      * ptb, test=develop
      
      * Rename all EagerTensor to Tensor
      
      * Rename some EagerTensor to Tensor
      
      * rename EagerTensor to EagerVariable
      
      * eager, test=develop
      
      * eager, test=develop
      
      * eager, test=develop
      
      * eager, test=develop
      
      * add more test
      
      * eager, test=develop
      
      * Support copiable selected rows and merge develop
      
      * save load, eager, test=develop
      
      * save load, eager, test=develop
      
      * refine, test=develop
      
      * refine, test=develop
      
      * refine, test=develop
      
      * revert static_runner, test=develop
      
      * EagerTensor to Tensor, test=develop
      
      * refine, test=develop
      
      * refine, test=develop
      
      * clear grad, test=develop
      
      * merge, develop
      
      * merge, develop
      
      * merge, test=develop
      
      * merge, test=develop
      Co-authored-by: NJiabinYang <360788950@qq.com>
      Co-authored-by: NWeilong Wu <veyron_wu@163.com>
      6b5749eb
    • N
      Fix a bug in IndexKernel out-of-memory (#39867) · 2136bd42
      niuliling123 提交于
      2136bd42
    • L
      optimize performance of lookup_table_v2_op (#39856) · d6038c22
      Li Min 提交于
      * optimize block config  and fp16 atomicAdd perf for lookup_table_v2_grad.
      d6038c22
    • C
      [PHi] Skip kernel declare for cuda only kernel on rocm (#39869) · 76a6b88d
      Chen Weihang 提交于
      * skip kernel declare for cuda only kernel on rocm
      
      * fix error
      76a6b88d
  2. 23 2月, 2022 26 次提交
  3. 22 2月, 2022 6 次提交