1. 23 4月, 2023 1 次提交
  2. 20 3月, 2023 1 次提交
    • S
      [Hackathon NO.71] 为 Paddle-TRT 添加 pad3d 算子 (#50986) · c36e3fd2
      Sonder 提交于
      * update codes about pad3d
      
      * add codes about Tensor type Padding
      
      * update
      
      * 更新单测文件
      
      * format code style
      
      * update and to &&'
      
      * rewrite codes about pad3d
      
      * add codes about converting paddle pad format to tensorrt pad format
      
      * fix some errors
      
      * 指定trt版本范围
      
      * 修正dims初始化方式
      
      * fix code style
      
      * update test pad values
      
      * 指定pad3d trt版本
      
      * 更新 单测 文件范围
      
      * 更新单测文件
      
      * update pad3d paddings convert codes
      
      * update pad3d
      
      * add static mode support
      
      * update test file
      
      * fix bugs about dynamic mode test codes
      
      * fix bug and add limite in op_teller
      
      * use a new padding convert method[ITensor* padding with using Slice to split the pre_pad and the  post pad]
      
      * fix PADDLE_THROW grammaly error
      
      * update test codes
      
      * 添加对于Tensor padding 的 size 判断
      c36e3fd2
  3. 22 2月, 2023 1 次提交
  4. 09 1月, 2023 1 次提交
    • J
      Prim paddle Basic (#49272) · 2f601282
      Jiabin Yang 提交于
      * proto type of composite grad in paddle
      
      * proto type of composite grad in paddle
      
      * refactor composite api with phi
      
      * fix compile error
      
      * support static graph code-gen for squeeze op
      
      * generate static graph code of unsqueeze
      
      * refine op name
      
      * fix compile error
      
      * add extra output in op_compat
      
      * remove debug log
      
      * fix clang compile error
      
      * support prim switch flag
      
      * support prim switch flag
      
      * fix dygraph error
      
      * merge develop
      
      * add code_gen
      
      * add necessary files without codegen
      
      * fix code_gen bug
      
      * add deps
      
      * modify igmnore
      
      * add ignore
      
      * delete std cout
      
      * add composite logic for backward.py
      
      * add tanh first order grad composite
      
      * support enable_prim flag for static graph
      
      * throw expection when both GrapOpMaker and GradCompOpMaker not been registered
      
      * reorganize the directory of prim api tests
      
      * fix windows error
      
      * add eager_utils
      
      * add eager_utils
      
      * modify code gen
      
      * add composite parse
      
      * add unittest for get_grad_op_desc
      
      * code optimize
      
      * fix static test on windows
      
      * support generate static graph code for imag and real op
      
      * fix windows compile error in test_static_prim
      
      * merge develop
      
      * disable test eager in inference
      
      * prim code gen
      
      * disable eager compile in inference
      
      * rm other file
      
      * rm gitignore file
      
      * code_style
      
      * add eager test
      
      * code_style
      
      * merge develop
      
      * remove useless files
      
      * modify static test
      
      * support bool flag from singlton
      
      * merge develop
      
      * recover git ignore
      
      * fix conflict
      
      * recover git ignore for generated op
      
      * fix test compile error
      
      * remove some tests
      
      * add python test
      
      * fix some name issue
      
      * add composite code gen
      
      * modify backward yaml
      
      * fix static composite grad maker code gen
      
      * remove addtional files
      
      * add some static funcs unit test
      
      * fix some bugs
      
      * fix composite grad maker register code gen
      
      * optimize some functions
      Co-authored-by: Nzyfncg <zhangyunfei07@baidu.com>
      Co-authored-by: Nwangruting <wangruting@baidu.com>
      Co-authored-by: Ncxxly <chenxx_id@163.com>
      Co-authored-by: Ncharles-hit <wanghao107@baidu.com>
      Co-authored-by: Nxiaoguoguo626807 <100397923+xiaoguoguo626807@users.noreply.github.com>
      2f601282
  5. 22 11月, 2022 1 次提交
  6. 01 11月, 2022 1 次提交
  7. 30 8月, 2022 1 次提交
    • Z
      Remove extra attribute in OpMaker (#44310) · fe321f9a
      zyfncg 提交于
      * add runtime config in phi
      
      * add runtime attr for op desc and op
      
      * fix no proto error
      
      * adjust opdesc set_attr impl
      
      * try to remove conv_op extra attrs
      
      * add init runtime attr map
      
      * change extra header path
      
      * fix runtime_attr
      
      * fix trace_op
      
      * fix bug of pass
      
      * fix merge conflict
      
      * fix dygraph attrs
      
      * fix bug of pass
      
      * fix dygraph bug
      
      * fix unittest module
      
      * delete extra attr default
      
      * fix dropout kernel
      
      * polish code
      
      * fix extra output of instance_norm
      
      * fix merge confilct
      
      * fix op_desc bug
      
      * add extra attr in yaml for conv3d_transpose
      
      * don't remove extra input and output
      
      * fix save_inference_model
      
      * fix bug of batch_norm
      
      * revert some change
      
      * polish log
      
      * polish code
      
      * add code comment
      Co-authored-by: NChen Weihang <chenweihang@baidu.com>
      fe321f9a
  8. 16 8月, 2022 1 次提交
  9. 15 8月, 2022 1 次提交
    • Y
      [Auto Parallel] Move the distributed info from python to c++ (#44510) · a52357fe
      Yulong Ao 提交于
      * [Auto Parallel] Move the distributed info from python to c++
      
      * [Auto Parallel] Add dist_attrs for VarDesc and OpDesc
      
      * [Auto Parallel] Add the lost file
      
      * [Auto Parallel] Make the dist attr be unique_ptr
      
      * [Auto Parallel] Add the proto conversion
      
      * [Auto Parallel] Improve the proto support
      
      * [Auto Parallel] Fix the bugs for adding a device or a link
      
      * [Auto Parallel] Add the C++ ProcessMesh and DistributedMapper
      
      * [Auto Parallel] Improve the impl of these dist attrs
      
      * [Auto Parallel] Pybind11 ProcessMesh and DeviceMesh
      
      * [Auto Parallel] Fix the unittest problem
      
      * [Auto Parallel] Explicitly add the src file for auto_parallel target
      
      * [Auto Parallel] Add the proto depedency explicitly
      
      * [Auto Parallel] Fix the cmake bug on windows and mac
      
      * [Auto Parallel] Remove the pybind11 header file in process_mesh.h
      
      * [Auto Parallel] Remove unused codes
      
      * [Auto Parallel] Check whether the dist attr is null
      
      * [Auto Parallel] Implement the assign operator for OpDesc explicitly
      a52357fe
  10. 10 8月, 2022 2 次提交
  11. 19 7月, 2022 1 次提交
  12. 26 6月, 2022 1 次提交
  13. 04 4月, 2022 1 次提交
    • H
      Add dropout yaml (#41355) · 1c7001e7
      hong 提交于
      * add dropout slice yaml
      
      * remove useless code
      
      * fix infer shape error
      
      * skip infrt compile for dropout
      1c7001e7
  14. 30 12月, 2021 1 次提交
  15. 09 10月, 2021 1 次提交
  16. 18 9月, 2021 1 次提交
  17. 15 9月, 2021 1 次提交
  18. 24 8月, 2021 1 次提交
    • Y
      Add auto completion module for auto parallel (#34813) · 93d862b0
      Yulong Ao 提交于
      * add auto_parallel dir
      
      * mv to paddle.distributed
      
      * add shard_xx api
      
      * add distributed attrs for var
      
      * add ut, test=develop
      
      * add dist
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update, test=develop
      
      * update, test=develop
      
      * update, test=develop
      
      * update, test=develop
      
      * update, test=develop
      
      * update, test=develop
      
      * update, test=develop
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update, test=develop
      
      * update, test=develop
      
      * update
      
      * update
      
      * delete unused proto
      
      * resotre op_desc
      
      * restore type_defs
      
      * update var_desc
      
      * remove dimss_mapping for proto_pybind
      
      * update interface.py
      
      * update framework.py
      
      * update
      
      * update
      
      * add auto_parallel dir
      
      * mv to paddle.distributed
      
      * add shard_xx api
      
      * add distributed attrs for var
      
      * add ut, test=develop
      
      * [WIP] Add the auto completion feature and related codes
      
      * [WIP] Improve the auto completion and related codes
      
      * [WIP] Make the auto completion to support data-parallel
      
      * [WIP] Make the completion support mp and dp+mp
      
      * [WIP] Refactor auto completion unit test for MLP
      
      * [WIP] Refactor the implementation of DistributedOperatorImpl
      
      * [WIP] Improve dims_mapping update rule and fix a bug
      
      * [WIP] Support auto completion for one transformer decoder layer
      
      * [WIP] Add a minor change
      
      * [WIP] Fix a bug within the uint test
      
      * Shard XShape tensor, add embedding completion and refactor code
      
      * Add the distributed_operators dir to setup.py.in
      
      * Improve the completion process and add the unittest for gpt
      
      * fix process_mesh ut
      
      * fix process_mesh ut
      
      * update
      
      * update, test=develop
      
      * Add support for automatically completing distributed attrs of special ops
      
      * update
      
      * update
      
      * update
      
      * fix doc sample codes, test=develop
      
      * improve coverage, test=develop
      
      * add static_mode check, test=develop
      
      * Model the cluster for cost model and physical mapping
      
      * update, test=develop
      
      * add set_placement, test=develop
      
      * Add the check to make sure the candidate tensors' size is great than zero
      
      * update doc, test=develop
      
      * update doc, test=develop
      
      * update doc, test=develop
      
      * update doc, test=develop
      
      * update, test=develop
      
      * Auto mark dist attrs annotated by user
      
      * update ndarray to nested list, test=develop
      
      * update, test=develop
      
      * Add auto-completion module for auto-parallel (based on PR#33804)
      
      * Remove unnecessary files
      
      * Remove unrelated files for the auto completion pr
      
      * Update the unit test to improve the coverage
      
      * Modify codes based on reviews
      
      * Minor changes for CI
      
      * Improve some codes based on new comments
      
      * Fix bugs caused by shallow copy in attributes.py
      * Imporve amend_distributed_attr_for_program in context.py
      * Other changes for weihang's comments
      Co-authored-by: Nsandyhouse <lilong12@baidu.com>
      93d862b0
  19. 15 7月, 2021 1 次提交
    • H
      Class for processing program (#33439) · 85642a0d
      huangxu96 提交于
      This PR creates a class to process the program at the C++ level. Currently, this class has one class method:
      GetInputsOutputsInBlock()
      85642a0d
  20. 26 4月, 2021 1 次提交
  21. 24 9月, 2020 1 次提交
    • W
      use iwyu clean include (#27267) · df43905f
      wanghuancoder 提交于
      * use iwyu clean include, test=develop, test=win
      
      * compilation error, test=develop
      
      * fix compilation error2, test=develop
      
      * fix compilation error3, test=develop
      
      * fix compilation error4, test=develop
      
      * fix compilation error5, test=develop
      
      * fix compilation error6, test=develop
      
      * fix compilation error7, test=develop
      
      * fix compilation error8, test=develop
      
      * fix compilation error8, test=develop
      
      * fix compilation error10, test=develop
      
      * fix compilation error11, test=develop
      df43905f
  22. 23 6月, 2020 1 次提交
  23. 11 5月, 2020 1 次提交
    • C
      Add macro BOOST_GET to enrich the error information of boost :: get (#24175) · aa0f254f
      Chen Weihang 提交于
      * add new macro BOOST_GET_SAFELY & unittests, test=develop
      
      * add different macro type, test=develop
      
      * fix get macro type in executor, test=develop
      
      * four macro part change backup
      
      * using one macro for all case, test=develop
      
      * revert attribute change, test=develop
      
      * change to three func to solve gcc4.8 bug, test=develop
      
      * polish some details, test=develop
      aa0f254f
  24. 31 10月, 2019 1 次提交
    • H
      GradMaker for dygraph (#19706) · 8c4573a3
      hong 提交于
      * refactor dygraph,test=develop
      
      * fix failed unittest,test=develop
      
      * polish code,test=develop
      
      * check windows ci error,test=develop
      try to fix windows ci error by np.allclose,test=develop
      
      * polish vlog and profiler, test=develop
      
      * try to fix preceding ops order,test=develop
      
      * test transformer in windows ci, test=develop
      
      * use python c-api to speed up tracer.trace,test=develop
      
      * test=develop, fix docker with paddle nccl problem
      
      * test=develop, add ut for debug string and gradient_accumulator
      
      * test=develop, add tests for layer/gradient_accumulator/prepared_op
      
      * test=develop, fix complie error for test_prepared_op
      
      * test=develop, add more ut for dygraph
      
      * test=develop, create API.spec for dygraph api change
      
      * optimize grad maker; test=develop
      
      * optimize grad maker
      
      * test
      
      * grad make optim; test=develop
      
      * fix unittest bugs; test=develop
      
      * add dygraph grad op maker and split_op
      
      * grad op maker refactor; test=develop
      
      * add dygraph grad maker; test=develop
      
      * fix op deformable_conv_v1_op bug; test=develop
      
      * fix deformable_conv prroi pool bugs;
      
      * fix new op grad op maker bug; test=develop
      
      * fix split by ref bug; test=develop
      
      * fix dygraph auto prune bug; test=develop
      
      * fix test_trace bug; test=develop
      
      * fix fused emb seq pool bug; test=develop
      
      * remove useless code in op_desc file; test=develop
      
      * remove useless code, StrVarBaseNode; test=develop
      
      * fix review issues; test=develop
      
      * fix rank_loss grad maker; test=develop
      
      * remove flag in VarBase; test=develop
      
      * fix distributed_notify_op compile bug ; test=develop
      
      * fix reshape op double grad; test=develop
      
      * fix expand as op; test=develop
      
      * add impertive type_defs.h for demo_train; test=develop
      
      * fix inference lib cmake; test=develop
      
      * fix inference lib; test=develop
      
      * fix infernce_lib; test=develop
      
      * fix inference cmake; test=develop
      
      * fix inference lib; test=develop
      
      * fix inference lib; test=develop
      
      * remove condition dygraph grad maker, modify local name; test=develop
      
      * fix split grad maker bug; test=develop
      
      * fix pyramid_op bug; test=develop
      
      * change travis time out limit; test=develop
      
      * restore travis; test=develop
      
      * change timeout limit; test=develop
      8c4573a3
  25. 21 8月, 2019 1 次提交
  26. 28 3月, 2019 1 次提交
  27. 12 12月, 2018 1 次提交
  28. 10 12月, 2018 1 次提交
  29. 28 11月, 2018 1 次提交
  30. 26 10月, 2018 2 次提交
  31. 17 10月, 2018 1 次提交
  32. 24 8月, 2018 1 次提交
  33. 15 8月, 2018 1 次提交
  34. 14 8月, 2018 2 次提交
  35. 22 6月, 2018 1 次提交
  36. 31 5月, 2018 1 次提交
  37. 22 5月, 2018 1 次提交