1. 12 1月, 2023 6 次提交
  2. 11 1月, 2023 6 次提交
    • N
      395520f1
    • Y
      add FusedLinear pass (#49606) · 0f08a432
      yuehuayingxueluo 提交于
      * add FusedLinear pass
      
      * add fused_op_list and renname PASSES to OP_FUSION
      
      * add fused_passes_list to constants.py
      
      * add test_passes.py
      
      * fix test_fused_passes.py
      
      * fix add if float(paddle.version.cuda()) >= 11.6:
      
      * renamed test_fused_passes.py
      
      * fix CMakeList.txt
      0f08a432
    • R
      [Dy2St] 移除 ProgramTranslator (#49628) · 2bb28f31
      Ryan 提交于
      * add enable_to_static and drop some methods of ProgramTranslator
      
      * fix code style
      
      * fix cant import enable_to_static and update unitest
      
      * change unitest and rollback code of PT
      
      * fix can't import as of utils
      
      * roll back PT
      
      * fix roll back
      
      * add some unitest
      
      * add unitest and fix codestyle bug in api.py
      
      * finish all unitest
      
      * remove ProgramTranslator
      
      * fix code style
      
      * restore test_program_translator
      
      * api.py remove get_func
      
      * TestDygraphToStaticCode
      
      * fix check_type and import err
      
      * roll back PT without getcode
      
      * roll back pt with get_code
      
      * convert_to_static
      
      * fix import __all__
      2bb28f31
    • L
      fix hsigmoid_loss (#49549) · 8f0adcb5
      Linjie Chen 提交于
      8f0adcb5
    • L
      Add input check for NLLLoss (#49547) · 08bf1b49
      Linjie Chen 提交于
      * fix nll_loss
      
      * fix nll_loss
      
      * update
      
      * update
      
      * update
      
      * fix
      08bf1b49
    • rm retain_grad_flag for tests part0 (#49655) · a504508c
      姜永久 提交于
      * rm retain_grad_flag for tests
      
      * modify transpose op
      
      * retain grads for xpu tests
      
      * lint
      
      * modify xpu test
      a504508c
  3. 10 1月, 2023 13 次提交
  4. 09 1月, 2023 9 次提交
    • W
      Preln groupnorm (#49463) · 591be3bd
      wenbin 提交于
      * skip_groupnorm
      
      * init
      
      * preln
      
      * add ut
      
      * more assert
      
      * set timeout
      
      * fix windows ci issue
      591be3bd
    • H
      Rewrite batch norm act fuse pass tester (#49277) · aaa25222
      Hulek 提交于
      * Rewritten
      
      * change mkldnn to onednn
      
      * fix cmake name
      aaa25222
    • Q
      add fill/fill_any for kunlun (#49645) · 31ea3231
      QingshuChen 提交于
      31ea3231
    • Y
      [XPU] add einsum fill diagonal and diagonal kernels (#49465) · a5bf156b
      ykkk2333 提交于
      * migrate shaple sgd, split,sign xpu kernels to phi, test=kunlun
      
      * fix dlrm throughput problem, test=kunlun
      
      * add xpu einsum, fill_diagonal, and diagonal kernels, test=kunlun
      a5bf156b
    • J
      Prim paddle Basic (#49272) · 2f601282
      Jiabin Yang 提交于
      * proto type of composite grad in paddle
      
      * proto type of composite grad in paddle
      
      * refactor composite api with phi
      
      * fix compile error
      
      * support static graph code-gen for squeeze op
      
      * generate static graph code of unsqueeze
      
      * refine op name
      
      * fix compile error
      
      * add extra output in op_compat
      
      * remove debug log
      
      * fix clang compile error
      
      * support prim switch flag
      
      * support prim switch flag
      
      * fix dygraph error
      
      * merge develop
      
      * add code_gen
      
      * add necessary files without codegen
      
      * fix code_gen bug
      
      * add deps
      
      * modify igmnore
      
      * add ignore
      
      * delete std cout
      
      * add composite logic for backward.py
      
      * add tanh first order grad composite
      
      * support enable_prim flag for static graph
      
      * throw expection when both GrapOpMaker and GradCompOpMaker not been registered
      
      * reorganize the directory of prim api tests
      
      * fix windows error
      
      * add eager_utils
      
      * add eager_utils
      
      * modify code gen
      
      * add composite parse
      
      * add unittest for get_grad_op_desc
      
      * code optimize
      
      * fix static test on windows
      
      * support generate static graph code for imag and real op
      
      * fix windows compile error in test_static_prim
      
      * merge develop
      
      * disable test eager in inference
      
      * prim code gen
      
      * disable eager compile in inference
      
      * rm other file
      
      * rm gitignore file
      
      * code_style
      
      * add eager test
      
      * code_style
      
      * merge develop
      
      * remove useless files
      
      * modify static test
      
      * support bool flag from singlton
      
      * merge develop
      
      * recover git ignore
      
      * fix conflict
      
      * recover git ignore for generated op
      
      * fix test compile error
      
      * remove some tests
      
      * add python test
      
      * fix some name issue
      
      * add composite code gen
      
      * modify backward yaml
      
      * fix static composite grad maker code gen
      
      * remove addtional files
      
      * add some static funcs unit test
      
      * fix some bugs
      
      * fix composite grad maker register code gen
      
      * optimize some functions
      Co-authored-by: Nzyfncg <zhangyunfei07@baidu.com>
      Co-authored-by: Nwangruting <wangruting@baidu.com>
      Co-authored-by: Ncxxly <chenxx_id@163.com>
      Co-authored-by: Ncharles-hit <wanghao107@baidu.com>
      Co-authored-by: Nxiaoguoguo626807 <100397923+xiaoguoguo626807@users.noreply.github.com>
      2f601282
    • W
      [0 Tensor support] cumprod (#49550) · 50a8b655
      wangzhen38 提交于
      50a8b655
    • Z
      [AutoParalle] balancing the calculation of global_norm in data parallel (#49510) · 926c4bd2
      zhaoyingli 提交于
      * [AutoParalle] balancing the calculation of global_norm in data parallel
      
      * fix unittest
      
      * update cond pure_data_parallel
      926c4bd2
    • X
      [ Dy2static ] Add ParameterRecorder to support science training cases. (#49459) · c549c6b9
      xiongkun 提交于
      * [Dy2static] Add ParameterRecorder
      
      * filter by shape(tensor)==0
      
      * fix code by review
      
      * fix random failed in CI. (especially coverage)
      
      * fix bugs
      
      * remove API changes to avoid static CI approval
      c549c6b9
    • L
      Create comm_context and modified static init (#49536) · 04e24e58
      LiYuRio 提交于
      * comm_context and static init
      
      * refactor: move to phi/core/distributed
      
      * refactor: avoid mutable_data usage
      
      * fix: windows sock
      
      * fix: device without nccl
      Co-authored-by: 元无心's avatarWen Sun <syl1887415157@126.com>
      04e24e58
  5. 07 1月, 2023 1 次提交
    • R
      Enable standalone executor for fleet training (#49293) · 67fc8e93
      Ruibiao Chen 提交于
      * Enable standalone executor for fleet training
      
      * Update code
      
      * Replace use_standalone_executor utils in auto parallel
      
      * Update code
      
      * Diable standalone executor for test_pass_sharding
      
      * Update code
      
      * Set sequential run for auto parallel
      
      * Fix dist_attr bug
      
      * Set sequential run for auto parallel
      67fc8e93
  6. 06 1月, 2023 5 次提交