1. 10 1月, 2023 7 次提交
  2. 09 1月, 2023 9 次提交
    • W
      Preln groupnorm (#49463) · 591be3bd
      wenbin 提交于
      * skip_groupnorm
      
      * init
      
      * preln
      
      * add ut
      
      * more assert
      
      * set timeout
      
      * fix windows ci issue
      591be3bd
    • H
      Rewrite batch norm act fuse pass tester (#49277) · aaa25222
      Hulek 提交于
      * Rewritten
      
      * change mkldnn to onednn
      
      * fix cmake name
      aaa25222
    • Q
      add fill/fill_any for kunlun (#49645) · 31ea3231
      QingshuChen 提交于
      31ea3231
    • Y
      [XPU] add einsum fill diagonal and diagonal kernels (#49465) · a5bf156b
      ykkk2333 提交于
      * migrate shaple sgd, split,sign xpu kernels to phi, test=kunlun
      
      * fix dlrm throughput problem, test=kunlun
      
      * add xpu einsum, fill_diagonal, and diagonal kernels, test=kunlun
      a5bf156b
    • J
      Prim paddle Basic (#49272) · 2f601282
      Jiabin Yang 提交于
      * proto type of composite grad in paddle
      
      * proto type of composite grad in paddle
      
      * refactor composite api with phi
      
      * fix compile error
      
      * support static graph code-gen for squeeze op
      
      * generate static graph code of unsqueeze
      
      * refine op name
      
      * fix compile error
      
      * add extra output in op_compat
      
      * remove debug log
      
      * fix clang compile error
      
      * support prim switch flag
      
      * support prim switch flag
      
      * fix dygraph error
      
      * merge develop
      
      * add code_gen
      
      * add necessary files without codegen
      
      * fix code_gen bug
      
      * add deps
      
      * modify igmnore
      
      * add ignore
      
      * delete std cout
      
      * add composite logic for backward.py
      
      * add tanh first order grad composite
      
      * support enable_prim flag for static graph
      
      * throw expection when both GrapOpMaker and GradCompOpMaker not been registered
      
      * reorganize the directory of prim api tests
      
      * fix windows error
      
      * add eager_utils
      
      * add eager_utils
      
      * modify code gen
      
      * add composite parse
      
      * add unittest for get_grad_op_desc
      
      * code optimize
      
      * fix static test on windows
      
      * support generate static graph code for imag and real op
      
      * fix windows compile error in test_static_prim
      
      * merge develop
      
      * disable test eager in inference
      
      * prim code gen
      
      * disable eager compile in inference
      
      * rm other file
      
      * rm gitignore file
      
      * code_style
      
      * add eager test
      
      * code_style
      
      * merge develop
      
      * remove useless files
      
      * modify static test
      
      * support bool flag from singlton
      
      * merge develop
      
      * recover git ignore
      
      * fix conflict
      
      * recover git ignore for generated op
      
      * fix test compile error
      
      * remove some tests
      
      * add python test
      
      * fix some name issue
      
      * add composite code gen
      
      * modify backward yaml
      
      * fix static composite grad maker code gen
      
      * remove addtional files
      
      * add some static funcs unit test
      
      * fix some bugs
      
      * fix composite grad maker register code gen
      
      * optimize some functions
      Co-authored-by: Nzyfncg <zhangyunfei07@baidu.com>
      Co-authored-by: Nwangruting <wangruting@baidu.com>
      Co-authored-by: Ncxxly <chenxx_id@163.com>
      Co-authored-by: Ncharles-hit <wanghao107@baidu.com>
      Co-authored-by: Nxiaoguoguo626807 <100397923+xiaoguoguo626807@users.noreply.github.com>
      2f601282
    • W
      [0 Tensor support] cumprod (#49550) · 50a8b655
      wangzhen38 提交于
      50a8b655
    • Z
      [AutoParalle] balancing the calculation of global_norm in data parallel (#49510) · 926c4bd2
      zhaoyingli 提交于
      * [AutoParalle] balancing the calculation of global_norm in data parallel
      
      * fix unittest
      
      * update cond pure_data_parallel
      926c4bd2
    • X
      [ Dy2static ] Add ParameterRecorder to support science training cases. (#49459) · c549c6b9
      xiongkun 提交于
      * [Dy2static] Add ParameterRecorder
      
      * filter by shape(tensor)==0
      
      * fix code by review
      
      * fix random failed in CI. (especially coverage)
      
      * fix bugs
      
      * remove API changes to avoid static CI approval
      c549c6b9
    • L
      Create comm_context and modified static init (#49536) · 04e24e58
      LiYuRio 提交于
      * comm_context and static init
      
      * refactor: move to phi/core/distributed
      
      * refactor: avoid mutable_data usage
      
      * fix: windows sock
      
      * fix: device without nccl
      Co-authored-by: 元无心's avatarWen Sun <syl1887415157@126.com>
      04e24e58
  3. 07 1月, 2023 1 次提交
    • R
      Enable standalone executor for fleet training (#49293) · 67fc8e93
      Ruibiao Chen 提交于
      * Enable standalone executor for fleet training
      
      * Update code
      
      * Replace use_standalone_executor utils in auto parallel
      
      * Update code
      
      * Diable standalone executor for test_pass_sharding
      
      * Update code
      
      * Set sequential run for auto parallel
      
      * Fix dist_attr bug
      
      * Set sequential run for auto parallel
      67fc8e93
  4. 06 1月, 2023 7 次提交
  5. 05 1月, 2023 9 次提交
  6. 04 1月, 2023 5 次提交
  7. 03 1月, 2023 2 次提交