1. 15 4月, 2022 3 次提交
    • P
      support no_need_buffer in eager_fluid state (#41720) · 840d2eb6
      pangyoki 提交于
      * support no_need_buffer in eager_fluid state
      
      * change no_need_buffer info from fwd_info to bwd_info
      
      * fix CI fail, gru_unit donnot use no_need_buffer
      
      * fix conflict between no_need_buffer and dispensable
      
      * use tensor.define in dispensable
      
      * solve conflict
      
      * solve conflict
      840d2eb6
    • L
      Change cuDNN Conv kernel for auto tune feature (#41313) · 35acfeda
      limingshu 提交于
      * change cudnn helper for auto-tune
      
      * Add FLAGS_use_autotune to set the global status of autotune and change the order of choosing algorithm.
      
      * Fix the bug in calculating and printing current step cache hit rate.
      
      * Improve the autotune cache and fix unittest.
      
      * Change the key from AlgorithmType to int64_t.
      
      * Fix unittest for cpu-only env.
      
      * change ChooseAlgoByWorkspace for heuristic mode
      Co-authored-by: NLiu Yiqun <liuyiqun01@baidu.com>
      35acfeda
    • F
      [MLU] add mlu new profiler (#41138) · fc208b7e
      fwenguang 提交于
      * [MLU] add mlu new profiler
      
      * fix format
      fc208b7e
  2. 14 4月, 2022 5 次提交
  3. 13 4月, 2022 3 次提交
  4. 12 4月, 2022 1 次提交
  5. 11 4月, 2022 1 次提交
  6. 09 4月, 2022 2 次提交
    • Z
      Unittest recover (#41431) · 7a07c4a5
      zhaocaibei123 提交于
      * update name
      
      * update name
      
      * fix test
      
      * fix fleet bind
      
      * update name
      
      * update name
      
      * fix test
      
      * fix gpups wrapper
      
      * remove Push/Pull/Load/Save with context in client and wrapper base class
      
      * fix
      
      * fix
      
      * remove some interface
      
      * fix
      
      * remove
      
      * code style
      
      * recover
      
      * fix
      
      * remove code unused
      
      * remove some unused table & accessor & CommonDenseTable => MemoryDenseTable
      
      * fix
      
      * fix
      
      * fix
      
      * recover
      
      * remove unused code
      
      * recover unittest
      
      * fix
      
      * remove
      
      * fix
      
      * remove code unuseful
      
      * remove
      
      * fix
      
      * recover
      
      * remove
      Co-authored-by: Nesythan <esythan@126.com>
      7a07c4a5
    • W
      fix pylayer mem leak, test=develop (#41559) · be11648a
      wanghuancoder 提交于
      be11648a
  7. 08 4月, 2022 1 次提交
  8. 07 4月, 2022 4 次提交
  9. 06 4月, 2022 4 次提交
  10. 05 4月, 2022 3 次提交
    • Z
      Implement AutoTuneStatus class for Kernel Auto Tune (#41218) · b0f8000e
      Zhang Ting 提交于
      * switch autotune
      
      * implement AutoTuneCache
      
      * implement AutoTuneCache class
      
      * add pybind api
      
      * add dygraph test
      
      * support static mode and eager mode and improve unittests
      
      * rename the SwitchAutoTune Class and improve tests
      
      * improve AutoTuneStatus and reduce the cost of tests
      b0f8000e
    • W
      [Eager] dataloader2 (#41338) · 3b0e911c
      wanghuancoder 提交于
      * eager math op, test=develop
      
      * eager support lookahead, test=develop
      
      * refine,test=develop
      
      * refine doc, test=develop
      
      * refine,test =develop
      
      * refie, test=develop
      
      * refie, test=develop
      
      * refie, test=develop
      
      * test_paddle_multiprocessing
      
      * refine, test=develop
      
      * refine, test=develop
      
      * fix bug, test=develop
      
      * refine, test=develop
      
      * dataloader, test=develop
      
      * refine, test=develop
      
      * refine, test=develop
      
      * refine, test=develop
      
      * test_datasets timeout, test=develop
      
      * refine, test=develop
      3b0e911c
    • A
      [Eager] Fix empty tensor Initializer bug with shape=[] (#41374) · ceb3382b
      Aurelius84 提交于
      * [Eager] Fix empty tensor Initializer bug with shape=[]
      
      * [Eager] Fix empty tensor Initializer bug with shape=[]
      
      * ignore two unittest
      
      * fix unittest
      ceb3382b
  11. 04 4月, 2022 7 次提交
  12. 03 4月, 2022 3 次提交
  13. 02 4月, 2022 3 次提交
    • A
      [Eager]Fix eager no take effect problem (#41291) · 50714d5c
      Aurelius84 提交于
      * [Eager]Fix eager no take effect problem
      
      * add element_wise and fix greater_than
      50714d5c
    • S
      Add graph apis (#40809) · b0398c8e
      Siming Dai 提交于
      * Add graph_reindex API
      
      * add graph_sample_neighbors api
      
      * Add buffer
      
      * delete VLOG
      
      * delete thrust::copy for output
      
      * add ShareDataWith
      
      * delete graph_reindex hashtable output
      
      * add graph_reindex dispensable
      
      * add reindex unittest, move memset to cuda kernel, change api
      
      * fix conflict
      
      * add reindex buffer for gpu version note
      
      * fix conflicts for op_func_generator
      
      * Add fisher_yates sampling, add dispensable, change infermeta
      
      * add dtype for edge_id
      
      * fix rocm ci and static check ci
      
      * add unittest
      
      * fix unittest
      
      * fix unittest
      
      * fix bug
      b0398c8e
    • W
      [Eager] Fix Pylayer compile error (#41240) · 1d8246b0
      wanghuancoder 提交于
      * fix bug, test=develop
      
      * refine, test=develop
      1d8246b0