1. 04 6月, 2022 1 次提交
  2. 16 5月, 2022 1 次提交
  3. 12 5月, 2022 1 次提交
  4. 01 5月, 2022 1 次提交
  5. 27 4月, 2022 1 次提交
  6. 26 4月, 2022 1 次提交
  7. 25 4月, 2022 2 次提交
  8. 24 4月, 2022 1 次提交
  9. 21 4月, 2022 2 次提交
  10. 20 4月, 2022 1 次提交
  11. 19 4月, 2022 1 次提交
  12. 18 4月, 2022 2 次提交
  13. 17 4月, 2022 1 次提交
    • C
      [Perf] Optimize dygraph scheduling performance (#41696) · 7ee31a96
      Chen Weihang 提交于
      * split phi and fluid infermeta context
      
      * resolve conflict
      
      * fix type error
      
      * optimize scheduling perf
      
      * spec small vector size
      
      * replace all grad var name
      
      * fix test failed
      
      * move init defalut signature
      
      * polish details
      
      * polish details
      
      * fix no init bug
      
      * init sig for tests
      
      * add init sig for infer
      
      * fix infrt error
      
      * fix infrt failed
      
      * fix kunlun error
      
      * fix infrt failed
      7ee31a96
  14. 15 4月, 2022 1 次提交
  15. 14 4月, 2022 2 次提交
  16. 13 4月, 2022 1 次提交
  17. 12 4月, 2022 1 次提交
  18. 09 4月, 2022 1 次提交
  19. 07 4月, 2022 1 次提交
  20. 05 4月, 2022 1 次提交
    • L
      [new-exec] enable the new standalone executor by default (#41179) · 93ea1297
      Leo Chen 提交于
      * enable new executor by default
      
      * enable stream safe allocator
      
      * test=document_fix;test=coverage
      
      * do not use scope in op kernel
      
      * fit empty program for new executor
      
      * fix communication depend
      
      * fix test_sync_batch_norm
      
      * skip unsupported place
      
      * refine datatransfer
      
      * fit for dirtributed program
      
      * fix dependencpy
      
      * fix some ut
      93ea1297
  21. 03 4月, 2022 1 次提交
    • C
      [Phi]Concat grad (#41112) · 3f57ef7a
      chentianyu03 提交于
      * add concat_grad kernel
      
      * fix error
      
      * remove comment code
      
      * fix outs nullptr error
      
      * change to phi header
      
      * add concat_grad declare for standalone_executor_test
      3f57ef7a
  22. 02 4月, 2022 2 次提交
  23. 01 4月, 2022 3 次提交
  24. 31 3月, 2022 1 次提交
    • L
      [new-exec] fit mkldnn op (#41058) · 02cf6764
      Leo Chen 提交于
      * fix bug that some op has no op_role attr
      
      * add mkldnn support for new executor
      
      * fit for mkldnn data_transfer
      
      * fit for mkldnn data_transfer
      02cf6764
  25. 30 3月, 2022 1 次提交
  26. 29 3月, 2022 3 次提交
  27. 28 3月, 2022 2 次提交
    • H
      Move some activation to phi (#40727) · e77a947e
      hong 提交于
      * update
      
      * add forward case
      
      * update
      
      * update; test=develop
      
      * add some grad kernel; test=develop
      
      * move gpu kernel; test=develop
      
      * update
      
      * update;
      
      * update test;
      
      * fix selected rows bug;
      
      * add mix vector include ;
      
      * add mixed vector depen; test=develop
      
      * add logit grad signature;
      
      * polish code
      
      * fix bug;
      
      * add namespace for abs
      
      * revert code
      
      * not move softsign
      
      * revmove duplate register;
      
      * fix softsign bug
      
      * polish code
      
      * format
      
      * format
      
      * fix bug
      
      * remove cmake dep
      
      * add square sqrt selected rows support
      
      * update
      
      * remove clip norm
      
      * add standalone executor sqrt dep
      
      * standalone exec denp sqrt
      
      * remove sqrt op in cmkaelist
      
      * open some case
      e77a947e
    • L
  28. 27 3月, 2022 3 次提交
    • L
      [new-exec] fit for mkldnn and inplace op (#40955) · afa0e82c
      Leo Chen 提交于
      * fit for mkldnn and inplace op
      
      * fix compile
      
      * refine ut
      
      * register op version
      
      * fix inplace op
      
      * fix transfer_layout
      afa0e82c
    • H
      Move slice to phi (#40736) · b8236b7b
      hong 提交于
      * move slice to pten
      
      * merge develop; test=develop
      
      * fix slice bug;
      
      * update
      
      * update
      
      * fix error
      
      * update
      
      * fix bug
      
      * polish code
      
      * polish code
      
      * polish code
      
      * try to fix windows bug
      
      * add gpu compile flag;
      
      * try to fix
      
      * remov template;
      
      * polish code;
      
      * fix npu bug;
      
      * fix npu bug
      
      * fix npu bug; test=develop
      
      * fix slice bug;
      
      * remove no need dep
      b8236b7b
    • F
      Make StreamSafeCUDAAllocator compatible with NaiveBestFit strategy (#40886) · 0ad2e192
      From00 提交于
      * Make StreamSafeCUDAAllocator compatible with NaiveBestFit strategy
      
      * Set FLAGS_use_stream_safe_cuda_allocator to false
      
      * Update
      
      * Remove unnecessary code
      
      * Fix CI errors
      
      * Add UT
      0ad2e192