1. 02 2月, 2023 1 次提交
    • X
      【PRIM】Support use operator's output metadata info in constructing static... · d8643cb6
      Xiaoxu Chen 提交于
      【PRIM】Support use operator's output metadata info  in constructing static backward composite (#50043)
      
      * [prim] support custom target_gradients
      
      * support infershape after append one gradop
      
      * [prim] add simple net test
      
      * fix test_loop segment fault bug
      
      * [prim] fix infer shape segment fault bug when output of grad_op_desc is empty
      d8643cb6
  2. 31 1月, 2023 5 次提交
  3. 30 1月, 2023 2 次提交
    • R
      Support stream priority for standalone executor (#49939) · 172d1de6
      Ruibiao Chen 提交于
      * Support stream priority for standalone executor
      
      * Fix compile error
      
      * Fix compile error
      
      * Fix compile error
      
      * Fix compile error
      
      * Fix compile error
      172d1de6
    • Z
      [Pglbox2.0] merge gpugraph to develop (#49946) · cb525d4e
      zmxdream 提交于
      * add set slot_num for psgpuwraper (#177)
      
      * add set slot_num_for_pull_feature for psgpuwarper
      
      * Add get_epoch_finish python interface (#182)
      
      * add get_epoch_finish interface
      
      * add return
      
      * delete return
      
      * add unzip op (#183)
      
      * fix miss key for error dataset (#186)
      
      * fix miss key for error dataset
      
      * fix miss key for error dataset
      Co-authored-by: Nyangjunchao <yangjunchao@baidu.com>
      
      * add excluded_train_pair and infer_node_type (#187)
      
      * support return of degree (#188)
      
      * fix task stuck in barrier (#189)
      Co-authored-by: Nyangjunchao <yangjunchao@baidu.com>
      
      * check node/feature format when loading (#190)
      
      * check node&feature format when loading
      
      * check node&feature format when loading (2£ (2)
      
      * degrade log (#191)
      
      * [PGLBOX]fix conflict
      
      * [PGLBOX]fix conflict
      
      * [PGLBOX]replace LodTensor with phi::DenseTensor
      
      * [PGLBOX]fix gpu_primitives.h include path
      
      * [PGLBOX]from platform::PADDLE_CUDA_NUM_THREADS to phi::PADDLE_CUDA_NUM_THREADS
      
      * [PGLBOX]fix unzip example code
      
      * [PGLBOX]fix unzip example code
      
      * [PGLBOX]fix unzip example code
      
      * [PGLBOX]fix unzip example code
      
      * [PGLBOX]fix unzip ut
      
      * [PGLBOX]fix unzip ut
      
      * [PGLBOX]fix code style
      
      * [PGLBOX]fix code style
      
      * [PGLBOX]fix code style
      
      * fix code style
      
      * fix code style
      
      * fix unzip ut
      
      * fix unzip ut
      
      * fix unzip ut
      
      * fix unzip
      
      * fix code stype
      
      * add ut
      
      * add c++ ut & fix train_mode_ set
      
      * fix load into memory
      
      * fix c++ ut
      
      * fix c++ ut
      
      * fix c++ ut
      
      * fix c++ ut
      
      * fix code style
      
      * fix collective
      
      * fix unzip_op.cc
      
      * fix barrier
      
      * fix code style
      
      * fix barrier
      
      * fix barrier
      
      * fix code styple
      
      * fix unzip
      
      * add unzip.py
      
      * add unzip.py
      
      * fix unzip.py
      
      ---------
      Co-authored-by: Nchao9527 <33347532+chao9527@users.noreply.github.com>
      Co-authored-by: NSiming Dai <908660116@qq.com>
      Co-authored-by: Nhuwei02 <53012141+huwei02@users.noreply.github.com>
      Co-authored-by: Nyangjunchao <yangjunchao@baidu.com>
      cb525d4e
  4. 29 1月, 2023 1 次提交
  5. 28 1月, 2023 1 次提交
  6. 20 1月, 2023 1 次提交
  7. 17 1月, 2023 3 次提交
    • Z
      Refine munmap freq for RefcountedMemoryMapAllocation (#49691) · 3fdc105f
      zhangbo9674 提交于
      * refine munmap freq for ref_cnt_mmap_allocator
      
      * add shm reuse logic
      
      * fix compile bug
      
      * fix compile bug
      
      * fix bug of file refcount
      
      * fix compile bug
      
      * fix compile bug
      
      * refine code for delete shm case
      
      * polish code
      
      * refine shm cache pool size setting logic
      
      * set buffer is 2
      
      * refine shm cache size logic
      
      * refine max shm cache
      
      * refine shm cache size
      3fdc105f
    • Y
      [PHI]Change feed_op to phi kernel (#49116) · f7f1dc03
      YuanRisheng 提交于
      * change feed_op to phi kernel
      
      * fix ci bugs
      
      * fix build bugs
      
      * fix ci bugs
      
      * fix compile bugs
      
      * fix ci bugs
      
      * perfect code
      
      * perfect comment code
      
      * fix install bugs
      
      * modify code according comment
      
      * remove visitor in feed_op
      
      * modify according comment
      
      * perfect code according comment
      
      * add infershape
      
      * fix py3 bugs
      
      * fix getexpected kernel type
      
      * fix getexpected kernel type
      
      * fix ci bugs
      
      * add registry for custom device
      
      * fix py3 bugs
      
      * fix floating point error
      
      * fix py3 test bugs
      f7f1dc03
    • W
      [Dy2St]Support call backward() without params in dy2st (#49812) · 2f24b2d8
      WangZhen 提交于
      * Support call backward() without params in dy2st
      2f24b2d8
  8. 13 1月, 2023 4 次提交
  9. 12 1月, 2023 1 次提交
    • J
      Fix reduce func bug in process_group_bkcl (#49749) · 8e291bf7
      jameszhang 提交于
      * Fix reduce func bug in process_group_bkcl
      
      Also catch up with a recent process_group PR that failed to add XPU branch.
      Note that reduce is still accomplished by allreduce for xpu. Fix this should
      xccl lib be updated.
      
      * fix compile issue for non-XPU
      8e291bf7
  10. 11 1月, 2023 1 次提交
  11. 10 1月, 2023 2 次提交
  12. 09 1月, 2023 3 次提交
    • J
      Prim paddle Basic (#49272) · 2f601282
      Jiabin Yang 提交于
      * proto type of composite grad in paddle
      
      * proto type of composite grad in paddle
      
      * refactor composite api with phi
      
      * fix compile error
      
      * support static graph code-gen for squeeze op
      
      * generate static graph code of unsqueeze
      
      * refine op name
      
      * fix compile error
      
      * add extra output in op_compat
      
      * remove debug log
      
      * fix clang compile error
      
      * support prim switch flag
      
      * support prim switch flag
      
      * fix dygraph error
      
      * merge develop
      
      * add code_gen
      
      * add necessary files without codegen
      
      * fix code_gen bug
      
      * add deps
      
      * modify igmnore
      
      * add ignore
      
      * delete std cout
      
      * add composite logic for backward.py
      
      * add tanh first order grad composite
      
      * support enable_prim flag for static graph
      
      * throw expection when both GrapOpMaker and GradCompOpMaker not been registered
      
      * reorganize the directory of prim api tests
      
      * fix windows error
      
      * add eager_utils
      
      * add eager_utils
      
      * modify code gen
      
      * add composite parse
      
      * add unittest for get_grad_op_desc
      
      * code optimize
      
      * fix static test on windows
      
      * support generate static graph code for imag and real op
      
      * fix windows compile error in test_static_prim
      
      * merge develop
      
      * disable test eager in inference
      
      * prim code gen
      
      * disable eager compile in inference
      
      * rm other file
      
      * rm gitignore file
      
      * code_style
      
      * add eager test
      
      * code_style
      
      * merge develop
      
      * remove useless files
      
      * modify static test
      
      * support bool flag from singlton
      
      * merge develop
      
      * recover git ignore
      
      * fix conflict
      
      * recover git ignore for generated op
      
      * fix test compile error
      
      * remove some tests
      
      * add python test
      
      * fix some name issue
      
      * add composite code gen
      
      * modify backward yaml
      
      * fix static composite grad maker code gen
      
      * remove addtional files
      
      * add some static funcs unit test
      
      * fix some bugs
      
      * fix composite grad maker register code gen
      
      * optimize some functions
      Co-authored-by: Nzyfncg <zhangyunfei07@baidu.com>
      Co-authored-by: Nwangruting <wangruting@baidu.com>
      Co-authored-by: Ncxxly <chenxx_id@163.com>
      Co-authored-by: Ncharles-hit <wanghao107@baidu.com>
      Co-authored-by: Nxiaoguoguo626807 <100397923+xiaoguoguo626807@users.noreply.github.com>
      2f601282
    • [Bug fixes] remove duplicated element_size (#49631) · 1574a862
      骑马小猫 提交于
      * remove dumplicate element_size
      
      * remove tensor_methon_* method
      1574a862
    • L
      Create comm_context and modified static init (#49536) · 04e24e58
      LiYuRio 提交于
      * comm_context and static init
      
      * refactor: move to phi/core/distributed
      
      * refactor: avoid mutable_data usage
      
      * fix: windows sock
      
      * fix: device without nccl
      Co-authored-by: 元无心's avatarWen Sun <syl1887415157@126.com>
      04e24e58
  13. 06 1月, 2023 3 次提交
    • Y
      [Inference] fix pass_builder (#49595) · 44cb3da3
      Yuanle Liu 提交于
      44cb3da3
    • N
      a214e5dc
    • Y
      [Auto Parallel] Merge dist attrs from python into c++ (#49214) · c7899074
      Yulong Ao 提交于
      * [Auto Parallel] Rename methods of ProcessMesh
      
      * [Auto Parallel] Impl the python process_mesh by the c++ one
      
      * [Auto Parallel] Add some minor modifications
      
      * [Auto Parallel] Rename some methods
      
      * [Auto Parallel] Remove unnecessary codes
      
      * [Auto Parallel] Add back some removed files
      
      * [Auto Parallel] Fix bugs
      
      * [Auto Parallel] Fix a bug
      
      * Update process_mesh.cc
      
      * [Auto Parallel] Merge dist attrs of Python into C++
      
      * [Auto Parallel] Add back deleted importing
      
      * [Auto Parallel] Add back removed unittest
      
      * [Auto Parallel] Remove type qualifiers of return types
      
      * [Auto Parallel] Fix some bugs
      
      * [Auto Parallel] Fix a bug of the quant pass
      
      * [Auto Parallel] Fix the code style
      c7899074
  14. 05 1月, 2023 2 次提交
    • Yj/rm core ops exp (#49490) · 70ea88bf
      姜永久 提交于
      * rm op_function_generator
      
      * rm op_func_generator.h
      
      * rm op_function
      
      * modify cmake
      
      * rm op_function.h
      
      * rm check for op_function_generator.cc
      
      * reset imperative
      
      * rm python part
      
      * fix imperative
      
      * lint
      
      * lint
      
      * modify legacy_c
      
      * review
      
      * modify
      
      * modify legacy
      
      * rm gen op_functions code
      
      * reset framework
      
      * rm core.ops for test
      
      * core.ops->core.eager.ops.legacy
      
      * not raiseError for xpu
      70ea88bf
    • W
      Refactor `ProcessGroup` to support comm context migration & clang compilation (#49451) · 1be70bc5
      Wen Sun 提交于
      * refactor: use base class
      
      * fix: incorrect deps
      
      * fix: add missing header
      
      * refactor: update class structures
      
      * fix: bkcl typo
      
      * fix: remove redundant def
      1be70bc5
  15. 04 1月, 2023 1 次提交
  16. 03 1月, 2023 2 次提交
  17. 30 12月, 2022 1 次提交
  18. 28 12月, 2022 2 次提交
  19. 27 12月, 2022 1 次提交
  20. 26 12月, 2022 2 次提交
    • R
      [0d Tensor] update scatter for zero-dimension tensor (#49279) · 73aa98cf
      Roc 提交于
      * revert concat and change concat to stack
      
      * let stack kernel support int8, uint8 and bool type
      73aa98cf
    • Y
      [Auto Parallel] Merge the python and c++ impls of ProcessMesh (#47503) · 1c0afa79
      Yulong Ao 提交于
      * [Auto Parallel] Rename methods of ProcessMesh
      
      * [Auto Parallel] Impl the python process_mesh by the c++ one
      
      * [Auto Parallel] Add some minor modifications
      
      * [Auto Parallel] Rename some methods
      
      * [Auto Parallel] Remove unnecessary codes
      
      * [Auto Parallel] Add back some removed files
      
      * [Auto Parallel] Fix bugs
      
      * [Auto Parallel] Fix a bug
      
      * Update process_mesh.cc
      
      * [Auto Parallel] Fix a bug
      1c0afa79
  21. 23 12月, 2022 1 次提交