1. 17 1月, 2023 1 次提交
    • Y
      [PHI]Change feed_op to phi kernel (#49116) · f7f1dc03
      YuanRisheng 提交于
      * change feed_op to phi kernel
      
      * fix ci bugs
      
      * fix build bugs
      
      * fix ci bugs
      
      * fix compile bugs
      
      * fix ci bugs
      
      * perfect code
      
      * perfect comment code
      
      * fix install bugs
      
      * modify code according comment
      
      * remove visitor in feed_op
      
      * modify according comment
      
      * perfect code according comment
      
      * add infershape
      
      * fix py3 bugs
      
      * fix getexpected kernel type
      
      * fix getexpected kernel type
      
      * fix ci bugs
      
      * add registry for custom device
      
      * fix py3 bugs
      
      * fix floating point error
      
      * fix py3 test bugs
      f7f1dc03
  2. 13 1月, 2023 1 次提交
  3. 11 1月, 2023 1 次提交
  4. 09 1月, 2023 2 次提交
    • J
      Prim paddle Basic (#49272) · 2f601282
      Jiabin Yang 提交于
      * proto type of composite grad in paddle
      
      * proto type of composite grad in paddle
      
      * refactor composite api with phi
      
      * fix compile error
      
      * support static graph code-gen for squeeze op
      
      * generate static graph code of unsqueeze
      
      * refine op name
      
      * fix compile error
      
      * add extra output in op_compat
      
      * remove debug log
      
      * fix clang compile error
      
      * support prim switch flag
      
      * support prim switch flag
      
      * fix dygraph error
      
      * merge develop
      
      * add code_gen
      
      * add necessary files without codegen
      
      * fix code_gen bug
      
      * add deps
      
      * modify igmnore
      
      * add ignore
      
      * delete std cout
      
      * add composite logic for backward.py
      
      * add tanh first order grad composite
      
      * support enable_prim flag for static graph
      
      * throw expection when both GrapOpMaker and GradCompOpMaker not been registered
      
      * reorganize the directory of prim api tests
      
      * fix windows error
      
      * add eager_utils
      
      * add eager_utils
      
      * modify code gen
      
      * add composite parse
      
      * add unittest for get_grad_op_desc
      
      * code optimize
      
      * fix static test on windows
      
      * support generate static graph code for imag and real op
      
      * fix windows compile error in test_static_prim
      
      * merge develop
      
      * disable test eager in inference
      
      * prim code gen
      
      * disable eager compile in inference
      
      * rm other file
      
      * rm gitignore file
      
      * code_style
      
      * add eager test
      
      * code_style
      
      * merge develop
      
      * remove useless files
      
      * modify static test
      
      * support bool flag from singlton
      
      * merge develop
      
      * recover git ignore
      
      * fix conflict
      
      * recover git ignore for generated op
      
      * fix test compile error
      
      * remove some tests
      
      * add python test
      
      * fix some name issue
      
      * add composite code gen
      
      * modify backward yaml
      
      * fix static composite grad maker code gen
      
      * remove addtional files
      
      * add some static funcs unit test
      
      * fix some bugs
      
      * fix composite grad maker register code gen
      
      * optimize some functions
      Co-authored-by: Nzyfncg <zhangyunfei07@baidu.com>
      Co-authored-by: Nwangruting <wangruting@baidu.com>
      Co-authored-by: Ncxxly <chenxx_id@163.com>
      Co-authored-by: Ncharles-hit <wanghao107@baidu.com>
      Co-authored-by: Nxiaoguoguo626807 <100397923+xiaoguoguo626807@users.noreply.github.com>
      2f601282
    • L
      Create comm_context and modified static init (#49536) · 04e24e58
      LiYuRio 提交于
      * comm_context and static init
      
      * refactor: move to phi/core/distributed
      
      * refactor: avoid mutable_data usage
      
      * fix: windows sock
      
      * fix: device without nccl
      Co-authored-by: 元无心's avatarWen Sun <syl1887415157@126.com>
      04e24e58
  5. 06 1月, 2023 1 次提交
  6. 21 12月, 2022 1 次提交
  7. 20 12月, 2022 1 次提交
  8. 15 12月, 2022 1 次提交
  9. 12 12月, 2022 1 次提交
  10. 06 12月, 2022 1 次提交
    • Z
      Clear extra input (Bias, ResidualData) in OpMaker of conv2d (#47579) · 0a2dfa38
      zyfncg 提交于
      * delete Bias and ResidualData in OpMaker of conv2d
      
      * delete extra input of conv3d
      
      * refactor pass of conv_bias_fusion
      
      * fix mkldnn dependency
      
      * fix mkldnn compile
      
      * fix test_conv_bias_mkldnn_fuse_pass
      
      * police some code
      
      * remove useless log
      
      * fix analyzer_vit_ocr_tester
      
      * fix conv_activation_mkldnn_fuse_pass
      
      * fix test_analyzer_ocr
      
      * add fused_conv_sig
      
      * fix performence regression
      
      * fix performance regression
      0a2dfa38
  11. 25 11月, 2022 1 次提交
    • C
      [PROFILER] add flops for Profiler (#47766) · 3d1981ad
      Chitsing KUI 提交于
      * attr ready
      
      * op ip ready
      
      * start dynamic
      
      * end2end ok
      
      * input shape to map, stat by op
      
      * layer wip
      
      * first version ready
      
      * fix proto depds
      
      * fix profiler deps
      
      * fix flops typo, rm tuple shape
      3d1981ad
  12. 18 11月, 2022 1 次提交
  13. 10 11月, 2022 1 次提交
    • J
      XPU multi-card support eager mode (#47445) · 3b91f8f3
      james 提交于
      * XPU support eager mode
      
      * add unittest for XPU eager mode
      
      * minor bugfix
      
      * minor bugfix, test=kunlun
      
      * correct copyright info
      
      * 1. remove unsed vars/funcs
      2. ProcessGroupBKCL inherit from ProcessGroupStream
      
      * bugfix for fp16 in eager mode multi-card, test=kunlun
      
      * rebase & fix a few issues
      
      * use new processgroup interface, test=kunlun
      
      * fix compile issue, test=kunlun
      3b91f8f3
  14. 03 11月, 2022 1 次提交
  15. 26 10月, 2022 2 次提交
  16. 21 10月, 2022 1 次提交
  17. 19 10月, 2022 1 次提交
  18. 13 10月, 2022 2 次提交
    • L
      add thread name for dataloader (#46990) · 770501b8
      Leo Chen 提交于
      770501b8
    • X
      [WIP]飞桨PaddlePaddle 分布式强化学习功能研发 (#45998) · f0afcabc
      Xinger 提交于
      * add rpc module in cpp side
      
      * add rpc module in python side
      
      * support win32 and mac for rpc
      
      * 代码优化
      
      * 优化代码
      
      * update rpc
      
      * update rpc launch
      
      * rpc remove rank and world_size api
      
      * fix logger import bug
      
      * remove support for win and mac
      
      * remove support for xpu, npu, cinn and rocm
      
      * remove support for xpu, npu, cinn and rocm
      
      * fix shutdown barrier timeout bug
      
      * update:python_rpc_handler to shared ptr
      
      * fix master shutodwn first bug
      
      * tests support for cpu
      
      * update log to vlog
      
      * update get service info api
      
      * add single process test case
      
      * remove process group
      
      * remove some useless dependencies
      
      * update rpc api comments
      
      * update rpc comments: Example to Examples
      
      * update rpc api comments
      
      * update rpc api comments
      
      * update launch api comments
      
      * update init_rpc comments
      
      * update rpc sync and async comments
      
      * fix bug: init_rpc cant be called repeatly in a process
      
      * update rpc api comment: make master endpoint unique
      
      * update rpc api:service to worker, timeout_ms to timeout
      
      * rename ServiceInfo to WorkerInfo
      
      * refactor: rename server to worker, log to vlog
      
      * add launch test
      
      * remove unused codes
      
      * refine
      f0afcabc
  19. 11 10月, 2022 1 次提交
  20. 28 9月, 2022 1 次提交
    • C
      Remove the declaration of using Tensor in framework/tensor.h (#46432) · e12a905e
      Chen Weihang 提交于
      * remove needless using tensor
      
      * remove needless using tensor
      
      * resolve conflict
      
      * replace tensor using
      
      * fix format error
      
      * revert needless changing
      
      * fix rocm and npu compile error
      
      * fix cinn compile error
      
      * fix format error
      
      * fix mkldnn format error
      
      * fix mkldnn format error
      
      * fix cinn compile error
      
      * fix cinn compile error
      
      * fix cinn compile error
      
      * resolve conflict
      e12a905e
  21. 27 9月, 2022 1 次提交
  22. 21 9月, 2022 1 次提交
  23. 19 9月, 2022 1 次提交
  24. 16 9月, 2022 1 次提交
    • C
      Unify core avx and core_noavx to libpaddle (#46095) · 267d71a4
      Chen Weihang 提交于
      * unify  core_avx and core_noavx
      
      * fix except error
      
      * revert mac compile logic
      
      * revert dylib to so
      
      * add core_noavx branch
      
      * remove core_noavx
      
      * replace paddle_core by lib paddle
      
      * polish var name
      
      * replace paddle_core by libpaddle
      
      * update custom device commit
      
      * polish code by comments
      267d71a4
  25. 15 9月, 2022 1 次提交
  26. 14 9月, 2022 1 次提交
  27. 09 9月, 2022 2 次提交
  28. 08 9月, 2022 1 次提交
  29. 06 9月, 2022 2 次提交
  30. 30 8月, 2022 1 次提交
    • Z
      Remove extra attribute in OpMaker (#44310) · fe321f9a
      zyfncg 提交于
      * add runtime config in phi
      
      * add runtime attr for op desc and op
      
      * fix no proto error
      
      * adjust opdesc set_attr impl
      
      * try to remove conv_op extra attrs
      
      * add init runtime attr map
      
      * change extra header path
      
      * fix runtime_attr
      
      * fix trace_op
      
      * fix bug of pass
      
      * fix merge conflict
      
      * fix dygraph attrs
      
      * fix bug of pass
      
      * fix dygraph bug
      
      * fix unittest module
      
      * delete extra attr default
      
      * fix dropout kernel
      
      * polish code
      
      * fix extra output of instance_norm
      
      * fix merge confilct
      
      * fix op_desc bug
      
      * add extra attr in yaml for conv3d_transpose
      
      * don't remove extra input and output
      
      * fix save_inference_model
      
      * fix bug of batch_norm
      
      * revert some change
      
      * polish log
      
      * polish code
      
      * add code comment
      Co-authored-by: NChen Weihang <chenweihang@baidu.com>
      fe321f9a
  31. 29 8月, 2022 1 次提交
    • Z
      [new_exe] Dy2Static support new_executor (#44450) · aba1295b
      zhangbo9674 提交于
      * add interpretercore
      
      * refine backward program id
      
      * add code
      
      * refine program
      
      * refine code
      
      * create forward/backward_program by prog2graph2prog method
      
      * test, do not care
      
      * refine code
      
      * refine code
      
      * refine code
      
      * test, do not care
      
      * add interpretorcore
      
      * add scope
      
      * refine scope create method
      
      * add jit for new_exe
      
      * solve conflict
      
      * delete unused code
      
      * polish code
      
      * polish code
      
      * refine scope in inplace
      
      * refine for datatransfer
      
      * refine _rebuild_from_desc
      
      * refine control eager deletion attr
      
      * refine used_for_jit
      
      * refine jit for infer
      
      * op size0 use ori program
      
      * polish code
      
      * refine jit
      
      * refine run_program_op ut
      
      * refine inplace
      
      * refine control
      
      * refine graph helper
      
      * refine control
      
      * refine inplace
      
      * refine buffer_share_inplace_pass
      
      * polish code
      
      * polish code
      
      * refine usage for compilerProgram
      
      * refine control
      
      * test
      
      * test core cache
      
      * refine code
      
      * refine io.py
      
      * increase test_seq2seq timeout
      
      * refine convert program
      
      * refine interpretercore_cache release
      
      * delete buildinplace
      
      * refine partial_program && io
      
      * refine code for io
      
      * test
      
      * test
      
      * test
      aba1295b
  32. 18 8月, 2022 1 次提交
  33. 15 8月, 2022 1 次提交
    • Y
      [Auto Parallel] Move the distributed info from python to c++ (#44510) · a52357fe
      Yulong Ao 提交于
      * [Auto Parallel] Move the distributed info from python to c++
      
      * [Auto Parallel] Add dist_attrs for VarDesc and OpDesc
      
      * [Auto Parallel] Add the lost file
      
      * [Auto Parallel] Make the dist attr be unique_ptr
      
      * [Auto Parallel] Add the proto conversion
      
      * [Auto Parallel] Improve the proto support
      
      * [Auto Parallel] Fix the bugs for adding a device or a link
      
      * [Auto Parallel] Add the C++ ProcessMesh and DistributedMapper
      
      * [Auto Parallel] Improve the impl of these dist attrs
      
      * [Auto Parallel] Pybind11 ProcessMesh and DeviceMesh
      
      * [Auto Parallel] Fix the unittest problem
      
      * [Auto Parallel] Explicitly add the src file for auto_parallel target
      
      * [Auto Parallel] Add the proto depedency explicitly
      
      * [Auto Parallel] Fix the cmake bug on windows and mac
      
      * [Auto Parallel] Remove the pybind11 header file in process_mesh.h
      
      * [Auto Parallel] Remove unused codes
      
      * [Auto Parallel] Check whether the dist attr is null
      
      * [Auto Parallel] Implement the assign operator for OpDesc explicitly
      a52357fe
  34. 12 8月, 2022 1 次提交
    • Y
      [Auto Parallel] Pybind ProcessMesh and DeviceMesh (#45013) · 5bf3dec9
      Yulong Ao 提交于
      * [Auto Parallel] Pybind11 ProcessMesh and DeviceMesh
      
      * [Auto Parallel] Fix the unittest problem
      
      * [Auto Parallel] Explicitly add the src file for auto_parallel target
      
      * [Auto Parallel] Add the proto depedency explicitly
      
      * [Auto Parallel] Fix the cmake bug on windows and mac
      
      * [Auto Parallel] Remove the pybind11 header file in process_mesh.h
      5bf3dec9
  35. 11 8月, 2022 1 次提交