1. 23 2月, 2023 1 次提交
  2. 21 2月, 2023 1 次提交
  3. 20 2月, 2023 3 次提交
    • J
      8ad635d5
    • H
      [Cpp Extension] Support Cpp Extension (#49893) · 2a0563fa
      HongyuJia 提交于
      * update include directory
      
      * fully support C++ extension, pass unittest
      
      * fix include directory
      
      * support both extension and operator in one file
      
      * polish testcase
      
      * add jit unittest
      
      * update third_party.cmake, pass CI test
      
      * fix cmake
      
      * fix setup
      
      * fix inference, fix unittest precision
      
      * fix unittest precision
      
      * fix inference_lib cmake
      
      * try fix setup, try fix inference_lib
      
      * try fix inference_lib pybind
      
      * fix mix_op_extension, fix inference_lib
      
      * fix mix_op_extension, fix inference_lib
      
      * change cmake
      
      * change cmake
      
      * add compile flags
      
      * add Python.h headerfile
      
      * add test_custom_plugin_creater cmake
      
      * comment compile flag
      
      * pass all CI
      
      * pass all CI
      
      * comment compile flag
      
      * try solve test_custom_plugin_creater link error
      
      * try solve test_custom_plugin_creater link error
      
      * polish codes
      
      * remove windows compile flag
      
      * remove python_include_path
      
      * update pybind11, 2.4.3->2.6.0
      
      * update pybind11, 2.6.0->2.10.0
      
      * update pybind11, 2.10.0->2.6.0b1
      
      * update pybind11, 2.6.0b1->2.6.0, start fix unittest
      
      * fix pybind11 2.6.0 VarBase print error
      
      * fix pybind11 2.6.0 VarBase print error
      
      * handle PADDLE_ON_INFERENCE
      
      * modify according to reviewer
      
      * fix cmake
      
      * cmake decouple pybind_util when not ON_INFER
      
      * cmake decouple pybind_util when not ON_INFER
      
      * remove copy of inference_lib.cmake
      
      * change pybind.cc headerfile fluid->phi
      2a0563fa
    • H
      [Tensor operants] Polish tensor operants implementation (#50634) · 8c844356
      HongyuJia 提交于
      * polish tensor operants implementation
      
      * change year, 2021->2023
      8c844356
  4. 18 2月, 2023 1 次提交
  5. 17 2月, 2023 1 次提交
    • Y
      Rename MultiTensorAdam To FusedAdam (#50449) · e6af9bd2
      yuehuayingxueluo 提交于
      * rename multi_tensor_adam to fused_adam
      
      * fix some bugs
      
      * fix CI coverage
      
      * rename test_fused_adam.py
      
      * fix some bug
      
      * add test_fused_adam_op.py
      
      * fix some bugs
      
      * fix fused_adam_op.cc
      
      * fix CI bugs
      
      * fix CI bug
      
      * fix CI bug
      e6af9bd2
  6. 16 2月, 2023 2 次提交
  7. 15 2月, 2023 1 次提交
  8. 14 2月, 2023 2 次提交
  9. 13 2月, 2023 3 次提交
  10. 11 2月, 2023 1 次提交
    • H
      [Tensor Operator] Overload Tensor Operator (#50098) · 14e45f6b
      HongyuJia 提交于
      * init commit
      
      * fix tensor operator*
      
      * fix compile bug
      
      * bug reproduce
      
      * update commit
      
      * polish codes
      
      * fix compile bug
      
      * test begin
      
      * test begin
      
      * compile finish
      
      * restore origin composite_backward_api
      
      * pass local CI
      
      * fix merge error
      
      * fix merge error
      
      * change py_test from GPU->CPU, test custom op
      
      * polish codes, modify prim unittest
      
      * modify prim unittest
      
      * determine phi_tensor_operants location
      
      * polish codes
      
      * add header file
      
      * solve windows unresolved symbol
      
      * fix some CI error
      
      * add overload defination
      
      * fix CI inference and Windows
      
      * polish codes according to reviewers' opinion
      
      * polish codes according to reviewers' opinion
      14e45f6b
  11. 09 2月, 2023 3 次提交
    • R
      [kunlun] support async send/recv via group (#50329) · 350cd82a
      Roc 提交于
      Co-authored-by: Nzhangxiaoci <zhangxiaoci@baidu.com>
      350cd82a
    • H
      [PHI decoupling] move strided_memcpy.h to phi (#50346) · 17318c1a
      Huang Jiyi 提交于
      * decouple strided_memcpy
      
      * move strided_memcpy
      
      * move strided_memcpy to phi
      
      * fix namespace
      
      * update
      
      * fix gpu compile bugs
      17318c1a
    • Y
      Add MultiTenosrAdam OP (#49220) · 10654c77
      yuehuayingxueluo 提交于
      * add multi_tenosr_adam
      
      * update multi_tensor_base.py, test_multi_tensor_adam.py, adamw.py
      
      * fix adam.py optimizer.py
      
      * fix adamw.py
      
      * fix test_multi_tensor_adam.py
      
      * fix CI bug
      
      * fix CI coverage
      
      * fix ci bug
      
      * fix betapow
      
      * fix some bugs
      
      * fix test_adamw_op.py
      
      * fix CI coverage
      
      * fix multi_tensor_adam_kernel.cc
      
      * fix CI bug
      
      * fix multi_tensor_adam_op.cc and test_multi_tensor_adam.py
      
      * fix code style
      
      * update C++ parts
      
      * remove python parts modification temporarily
      
      * add C++ ut
      
      * update betapow copy code logic
      
      * fix ci ut
      
      * fix windows ci
      
      * fix coverage ci
      
      * improve coverage rate
      
      ---------
      Co-authored-by: Nsneaxiy <sneaxiy@126.com>
      10654c77
  12. 08 2月, 2023 1 次提交
  13. 07 2月, 2023 1 次提交
  14. 06 2月, 2023 4 次提交
  15. 03 2月, 2023 1 次提交
    • J
      【Prim】Blacklist bwd comp (#50148) · cc8a7858
      Jiabin Yang 提交于
      * refactor dir for prim
      
      * support blacklist for bwd comp
      
      * fix type error
      
      * remove additional file
      
      * fix git ignore
      
      * add more test
      
      * merge develop
      cc8a7858
  16. 02 2月, 2023 1 次提交
    • X
      【PRIM】Support use operator's output metadata info in constructing static... · d8643cb6
      Xiaoxu Chen 提交于
      【PRIM】Support use operator's output metadata info  in constructing static backward composite (#50043)
      
      * [prim] support custom target_gradients
      
      * support infershape after append one gradop
      
      * [prim] add simple net test
      
      * fix test_loop segment fault bug
      
      * [prim] fix infer shape segment fault bug when output of grad_op_desc is empty
      d8643cb6
  17. 31 1月, 2023 5 次提交
  18. 30 1月, 2023 2 次提交
    • R
      Support stream priority for standalone executor (#49939) · 172d1de6
      Ruibiao Chen 提交于
      * Support stream priority for standalone executor
      
      * Fix compile error
      
      * Fix compile error
      
      * Fix compile error
      
      * Fix compile error
      
      * Fix compile error
      172d1de6
    • Z
      [Pglbox2.0] merge gpugraph to develop (#49946) · cb525d4e
      zmxdream 提交于
      * add set slot_num for psgpuwraper (#177)
      
      * add set slot_num_for_pull_feature for psgpuwarper
      
      * Add get_epoch_finish python interface (#182)
      
      * add get_epoch_finish interface
      
      * add return
      
      * delete return
      
      * add unzip op (#183)
      
      * fix miss key for error dataset (#186)
      
      * fix miss key for error dataset
      
      * fix miss key for error dataset
      Co-authored-by: Nyangjunchao <yangjunchao@baidu.com>
      
      * add excluded_train_pair and infer_node_type (#187)
      
      * support return of degree (#188)
      
      * fix task stuck in barrier (#189)
      Co-authored-by: Nyangjunchao <yangjunchao@baidu.com>
      
      * check node/feature format when loading (#190)
      
      * check node&feature format when loading
      
      * check node&feature format when loading (2£ (2)
      
      * degrade log (#191)
      
      * [PGLBOX]fix conflict
      
      * [PGLBOX]fix conflict
      
      * [PGLBOX]replace LodTensor with phi::DenseTensor
      
      * [PGLBOX]fix gpu_primitives.h include path
      
      * [PGLBOX]from platform::PADDLE_CUDA_NUM_THREADS to phi::PADDLE_CUDA_NUM_THREADS
      
      * [PGLBOX]fix unzip example code
      
      * [PGLBOX]fix unzip example code
      
      * [PGLBOX]fix unzip example code
      
      * [PGLBOX]fix unzip example code
      
      * [PGLBOX]fix unzip ut
      
      * [PGLBOX]fix unzip ut
      
      * [PGLBOX]fix code style
      
      * [PGLBOX]fix code style
      
      * [PGLBOX]fix code style
      
      * fix code style
      
      * fix code style
      
      * fix unzip ut
      
      * fix unzip ut
      
      * fix unzip ut
      
      * fix unzip
      
      * fix code stype
      
      * add ut
      
      * add c++ ut & fix train_mode_ set
      
      * fix load into memory
      
      * fix c++ ut
      
      * fix c++ ut
      
      * fix c++ ut
      
      * fix c++ ut
      
      * fix code style
      
      * fix collective
      
      * fix unzip_op.cc
      
      * fix barrier
      
      * fix code style
      
      * fix barrier
      
      * fix barrier
      
      * fix code styple
      
      * fix unzip
      
      * add unzip.py
      
      * add unzip.py
      
      * fix unzip.py
      
      ---------
      Co-authored-by: Nchao9527 <33347532+chao9527@users.noreply.github.com>
      Co-authored-by: NSiming Dai <908660116@qq.com>
      Co-authored-by: Nhuwei02 <53012141+huwei02@users.noreply.github.com>
      Co-authored-by: Nyangjunchao <yangjunchao@baidu.com>
      cb525d4e
  19. 29 1月, 2023 1 次提交
  20. 28 1月, 2023 1 次提交
  21. 20 1月, 2023 1 次提交
  22. 17 1月, 2023 3 次提交
    • Z
      Refine munmap freq for RefcountedMemoryMapAllocation (#49691) · 3fdc105f
      zhangbo9674 提交于
      * refine munmap freq for ref_cnt_mmap_allocator
      
      * add shm reuse logic
      
      * fix compile bug
      
      * fix compile bug
      
      * fix bug of file refcount
      
      * fix compile bug
      
      * fix compile bug
      
      * refine code for delete shm case
      
      * polish code
      
      * refine shm cache pool size setting logic
      
      * set buffer is 2
      
      * refine shm cache size logic
      
      * refine max shm cache
      
      * refine shm cache size
      3fdc105f
    • Y
      [PHI]Change feed_op to phi kernel (#49116) · f7f1dc03
      YuanRisheng 提交于
      * change feed_op to phi kernel
      
      * fix ci bugs
      
      * fix build bugs
      
      * fix ci bugs
      
      * fix compile bugs
      
      * fix ci bugs
      
      * perfect code
      
      * perfect comment code
      
      * fix install bugs
      
      * modify code according comment
      
      * remove visitor in feed_op
      
      * modify according comment
      
      * perfect code according comment
      
      * add infershape
      
      * fix py3 bugs
      
      * fix getexpected kernel type
      
      * fix getexpected kernel type
      
      * fix ci bugs
      
      * add registry for custom device
      
      * fix py3 bugs
      
      * fix floating point error
      
      * fix py3 test bugs
      f7f1dc03
    • W
      [Dy2St]Support call backward() without params in dy2st (#49812) · 2f24b2d8
      WangZhen 提交于
      * Support call backward() without params in dy2st
      2f24b2d8