1. 06 9月, 2022 1 次提交
  2. 30 8月, 2022 1 次提交
    • Z
      Remove extra attribute in OpMaker (#44310) · fe321f9a
      zyfncg 提交于
      * add runtime config in phi
      
      * add runtime attr for op desc and op
      
      * fix no proto error
      
      * adjust opdesc set_attr impl
      
      * try to remove conv_op extra attrs
      
      * add init runtime attr map
      
      * change extra header path
      
      * fix runtime_attr
      
      * fix trace_op
      
      * fix bug of pass
      
      * fix merge conflict
      
      * fix dygraph attrs
      
      * fix bug of pass
      
      * fix dygraph bug
      
      * fix unittest module
      
      * delete extra attr default
      
      * fix dropout kernel
      
      * polish code
      
      * fix extra output of instance_norm
      
      * fix merge confilct
      
      * fix op_desc bug
      
      * add extra attr in yaml for conv3d_transpose
      
      * don't remove extra input and output
      
      * fix save_inference_model
      
      * fix bug of batch_norm
      
      * revert some change
      
      * polish log
      
      * polish code
      
      * add code comment
      Co-authored-by: NChen Weihang <chenweihang@baidu.com>
      fe321f9a
  3. 29 8月, 2022 1 次提交
    • Z
      [new_exe] Dy2Static support new_executor (#44450) · aba1295b
      zhangbo9674 提交于
      * add interpretercore
      
      * refine backward program id
      
      * add code
      
      * refine program
      
      * refine code
      
      * create forward/backward_program by prog2graph2prog method
      
      * test, do not care
      
      * refine code
      
      * refine code
      
      * refine code
      
      * test, do not care
      
      * add interpretorcore
      
      * add scope
      
      * refine scope create method
      
      * add jit for new_exe
      
      * solve conflict
      
      * delete unused code
      
      * polish code
      
      * polish code
      
      * refine scope in inplace
      
      * refine for datatransfer
      
      * refine _rebuild_from_desc
      
      * refine control eager deletion attr
      
      * refine used_for_jit
      
      * refine jit for infer
      
      * op size0 use ori program
      
      * polish code
      
      * refine jit
      
      * refine run_program_op ut
      
      * refine inplace
      
      * refine control
      
      * refine graph helper
      
      * refine control
      
      * refine inplace
      
      * refine buffer_share_inplace_pass
      
      * polish code
      
      * polish code
      
      * refine usage for compilerProgram
      
      * refine control
      
      * test
      
      * test core cache
      
      * refine code
      
      * refine io.py
      
      * increase test_seq2seq timeout
      
      * refine convert program
      
      * refine interpretercore_cache release
      
      * delete buildinplace
      
      * refine partial_program && io
      
      * refine code for io
      
      * test
      
      * test
      
      * test
      aba1295b
  4. 18 8月, 2022 1 次提交
  5. 15 8月, 2022 1 次提交
    • Y
      [Auto Parallel] Move the distributed info from python to c++ (#44510) · a52357fe
      Yulong Ao 提交于
      * [Auto Parallel] Move the distributed info from python to c++
      
      * [Auto Parallel] Add dist_attrs for VarDesc and OpDesc
      
      * [Auto Parallel] Add the lost file
      
      * [Auto Parallel] Make the dist attr be unique_ptr
      
      * [Auto Parallel] Add the proto conversion
      
      * [Auto Parallel] Improve the proto support
      
      * [Auto Parallel] Fix the bugs for adding a device or a link
      
      * [Auto Parallel] Add the C++ ProcessMesh and DistributedMapper
      
      * [Auto Parallel] Improve the impl of these dist attrs
      
      * [Auto Parallel] Pybind11 ProcessMesh and DeviceMesh
      
      * [Auto Parallel] Fix the unittest problem
      
      * [Auto Parallel] Explicitly add the src file for auto_parallel target
      
      * [Auto Parallel] Add the proto depedency explicitly
      
      * [Auto Parallel] Fix the cmake bug on windows and mac
      
      * [Auto Parallel] Remove the pybind11 header file in process_mesh.h
      
      * [Auto Parallel] Remove unused codes
      
      * [Auto Parallel] Check whether the dist attr is null
      
      * [Auto Parallel] Implement the assign operator for OpDesc explicitly
      a52357fe
  6. 12 8月, 2022 1 次提交
    • Y
      [Auto Parallel] Pybind ProcessMesh and DeviceMesh (#45013) · 5bf3dec9
      Yulong Ao 提交于
      * [Auto Parallel] Pybind11 ProcessMesh and DeviceMesh
      
      * [Auto Parallel] Fix the unittest problem
      
      * [Auto Parallel] Explicitly add the src file for auto_parallel target
      
      * [Auto Parallel] Add the proto depedency explicitly
      
      * [Auto Parallel] Fix the cmake bug on windows and mac
      
      * [Auto Parallel] Remove the pybind11 header file in process_mesh.h
      5bf3dec9
  7. 11 8月, 2022 1 次提交
  8. 09 8月, 2022 1 次提交
  9. 05 8月, 2022 1 次提交
  10. 01 8月, 2022 1 次提交
    • L
      unify gpu context (#44740) · 86763023
      Leo Chen 提交于
      * remove cudaDeviceContext
      
      * remove more template
      
      * fix rocm compile
      
      * remove alias name CUDADeviceContext
      
      * fix compile
      
      * fix tests
      
      * revert changes
      86763023
  11. 26 7月, 2022 1 次提交
  12. 21 7月, 2022 1 次提交
  13. 19 7月, 2022 1 次提交
  14. 14 7月, 2022 1 次提交
  15. 08 7月, 2022 1 次提交
  16. 06 7月, 2022 1 次提交
    • L
      Refine StandaloneExecutor (#44076) · 6eed9f49
      Leo Chen 提交于
      * not run startup program in constructor of StandaloneExecutor
      
      * clear interface of standalone executor
      
      * clean debug code
      6eed9f49
  17. 02 7月, 2022 1 次提交
    • L
      unify cpu context, part2 (#44012) · 755438a7
      Leo Chen 提交于
      * fix init()
      
      * delete test_device_context
      
      * replace CPUDeviceContext with CPUContext
      
      * fix test_scalar
      
      * remove dot_op.cc
      
      * fix compile
      755438a7
  18. 30 6月, 2022 2 次提交
  19. 29 6月, 2022 1 次提交
  20. 28 6月, 2022 1 次提交
    • R
      Remove boost::variant (#43100) · b3cf28f8
      Ruibiao Chen 提交于
      * boost::variant -> paddle::variant
      
      * boost::variant.apply_visit -> paddle::visit
      
      * Update pybind_boost_hraders.h
      
      * Fix CINN compilation errors
      
      * Revert FetchResultType
      b3cf28f8
  21. 27 6月, 2022 2 次提交
  22. 24 6月, 2022 1 次提交
    • C
      record memory and op supplement info (#43550) · 8dd0a3b9
      chenjian 提交于
      * record memory and op supplement info
      
      * update
      
      * update
      
      * fix a bug
      
      * fix memory recording
      
      * fix a bug
      
      * update
      
      * update
      
      * fix a bug
      
      * update
      
      * fix a bug
      
      * fix a bug
      
      * fix a bug
      
      * Revert "fix a bug"
      
      This reverts commit c1d4df52762ba9ae7c7e27cd2ba4fc3a7ed9c7a5.
      
      * fix a bug
      
      * fix format
      
      * fix
      8dd0a3b9
  23. 16 6月, 2022 1 次提交
  24. 05 6月, 2022 1 次提交
  25. 02 6月, 2022 1 次提交
  26. 27 5月, 2022 1 次提交
  27. 16 5月, 2022 1 次提交
    • J
      optimize cinn find graph by graph address (#42697) · 661d0800
      jiangcheng 提交于
      * optimize cinn find graph by graph address
      
      * graph_key use int64_t instead of program string
      
      * fix framework _to_readable_code python code
      
      * rename get_readable_comile_key to get_serialize_comile_key
      661d0800
  28. 11 5月, 2022 1 次提交
  29. 05 5月, 2022 2 次提交
  30. 27 4月, 2022 1 次提交
  31. 26 4月, 2022 2 次提交
    • S
      optimize graph_engine pybind (#42192) · 1bf08eca
      seemingwang 提交于
      * extract sub-graph
      
      * graph-engine merging
      
      * fix
      
      * fix
      
      * fix heter-ps config
      
      * test performance
      
      * test performance
      
      * test performance
      
      * test
      
      * test
      
      * update bfs
      
      * change cmake
      
      * test
      
      * test gpu speed
      
      * gpu_graph_engine optimization
      
      * add dsm sample method
      
      * add graph_neighbor_sample_v2
      
      * Add graph_neighbor_sample_v2
      
      * fix for loop
      
      * add cpu sample interface
      
      * fix kernel judgement
      
      * add ssd layer to graph_engine
      
      * fix allocation
      
      * fix syntax error
      
      * fix syntax error
      
      * fix pscore class
      
      * fix
      
      * change index settings
      
      * recover test
      
      * recover test
      
      * fix spelling
      
      * recover
      
      * fix
      
      * move cudamemcpy after cuda stream sync
      
      * fix linking problem
      
      * remove comment
      
      * add cpu test
      
      * test
      
      * add cpu test
      
      * change comment
      
      * combine feature table and graph table
      
      * test
      
      * test
      
      * pybind
      
      * test
      
      * test
      
      * test
      
      * test
      
      * pybind
      
      * pybind
      
      * fix cmake
      
      * pybind
      
      * fix
      
      * fix
      
      * add pybind
      
      * add pybind
      
      * optimize pybind
      
      * test
      
      * fix pybind
      
      * fix
      Co-authored-by: NDesmonDay <908660116@qq.com>
      1bf08eca
    • L
      fit for printing cinn_launch op (#42141) · ee56906e
      Leo Chen 提交于
      * fit for printing cinn_launch op
      
      * update boost::variant caster for bytes
      ee56906e
  32. 24 4月, 2022 2 次提交
    • R
      [CustomDevice] add eager mode support (#42034) · ccafd2e5
      ronnywang 提交于
      ccafd2e5
    • S
      combine graph_table and feature_table in graph_engine (#42134) · 0e0f7da6
      seemingwang 提交于
      * extract sub-graph
      
      * graph-engine merging
      
      * fix
      
      * fix
      
      * fix heter-ps config
      
      * test performance
      
      * test performance
      
      * test performance
      
      * test
      
      * test
      
      * update bfs
      
      * change cmake
      
      * test
      
      * test gpu speed
      
      * gpu_graph_engine optimization
      
      * add dsm sample method
      
      * add graph_neighbor_sample_v2
      
      * Add graph_neighbor_sample_v2
      
      * fix for loop
      
      * add cpu sample interface
      
      * fix kernel judgement
      
      * add ssd layer to graph_engine
      
      * fix allocation
      
      * fix syntax error
      
      * fix syntax error
      
      * fix pscore class
      
      * fix
      
      * change index settings
      
      * recover test
      
      * recover test
      
      * fix spelling
      
      * recover
      
      * fix
      
      * move cudamemcpy after cuda stream sync
      
      * fix linking problem
      
      * remove comment
      
      * add cpu test
      
      * test
      
      * add cpu test
      
      * change comment
      
      * combine feature table and graph table
      
      * test
      
      * test
      
      * pybind
      
      * test
      
      * test
      
      * test
      
      * test
      
      * pybind
      
      * pybind
      
      * fix cmake
      
      * pybind
      
      * fix
      
      * fix
      
      * add pybind
      
      * add pybind
      Co-authored-by: NDesmonDay <908660116@qq.com>
      0e0f7da6
  33. 19 4月, 2022 1 次提交
  34. 17 4月, 2022 1 次提交
    • C
      [Perf] Optimize dygraph scheduling performance (#41696) · 7ee31a96
      Chen Weihang 提交于
      * split phi and fluid infermeta context
      
      * resolve conflict
      
      * fix type error
      
      * optimize scheduling perf
      
      * spec small vector size
      
      * replace all grad var name
      
      * fix test failed
      
      * move init defalut signature
      
      * polish details
      
      * polish details
      
      * fix no init bug
      
      * init sig for tests
      
      * add init sig for infer
      
      * fix infrt error
      
      * fix infrt failed
      
      * fix kunlun error
      
      * fix infrt failed
      7ee31a96
  35. 15 4月, 2022 1 次提交
    • J
      Add eager string tensor (#41039) · a22b68b8
      Jack Zhou 提交于
      * Add core.eager.StringTensor __init__ which pyarray args can be passed
      
      * Add the numpy method of core.eager.StringTensor
      
      * revert tensor.to_string modification
      
      * Add ToPyObject for core.eager.StringTensor
      
      * Add debug string for core.eager.StringTensor
      
      * Remove place args of core.eager.StringTensor temporarily
      
      * Fix check string_tensor error
      
      * remove dtype of core.eager.StringTensor
      
      * add core.eager.StringTensor unittest
      
      * remove pstring from VarDesc
      
      * Add InitStringTensorWithStringTensor
      
      * Remove to_string modification
      
      * Remove zero_copy arg from StringTensor creator
      a22b68b8