- 06 9月, 2022 1 次提交
-
-
由 zyfncg 提交于
* set use_cudnn=true for conv2d * clear opmaker of matmul_v2 * fix bug of set_attr * add extra attr checker in infer_shape
-
- 05 9月, 2022 1 次提交
-
-
由 zyfncg 提交于
* remove extra attr of opmaker * remove extra attr of gru * revert scale * fix bug of create_op * add extra attr checker in infer_shape * fix gelu
-
- 30 8月, 2022 1 次提交
-
-
由 zyfncg 提交于
* add runtime config in phi * add runtime attr for op desc and op * fix no proto error * adjust opdesc set_attr impl * try to remove conv_op extra attrs * add init runtime attr map * change extra header path * fix runtime_attr * fix trace_op * fix bug of pass * fix merge conflict * fix dygraph attrs * fix bug of pass * fix dygraph bug * fix unittest module * delete extra attr default * fix dropout kernel * polish code * fix extra output of instance_norm * fix merge confilct * fix op_desc bug * add extra attr in yaml for conv3d_transpose * don't remove extra input and output * fix save_inference_model * fix bug of batch_norm * revert some change * polish log * polish code * add code comment Co-authored-by: NChen Weihang <chenweihang@baidu.com>
-
- 25 8月, 2022 1 次提交
-
-
由 Feiyu Chan 提交于
-
- 15 8月, 2022 1 次提交
-
-
由 Yulong Ao 提交于
* [Auto Parallel] Move the distributed info from python to c++ * [Auto Parallel] Add dist_attrs for VarDesc and OpDesc * [Auto Parallel] Add the lost file * [Auto Parallel] Make the dist attr be unique_ptr * [Auto Parallel] Add the proto conversion * [Auto Parallel] Improve the proto support * [Auto Parallel] Fix the bugs for adding a device or a link * [Auto Parallel] Add the C++ ProcessMesh and DistributedMapper * [Auto Parallel] Improve the impl of these dist attrs * [Auto Parallel] Pybind11 ProcessMesh and DeviceMesh * [Auto Parallel] Fix the unittest problem * [Auto Parallel] Explicitly add the src file for auto_parallel target * [Auto Parallel] Add the proto depedency explicitly * [Auto Parallel] Fix the cmake bug on windows and mac * [Auto Parallel] Remove the pybind11 header file in process_mesh.h * [Auto Parallel] Remove unused codes * [Auto Parallel] Check whether the dist attr is null * [Auto Parallel] Implement the assign operator for OpDesc explicitly
-
- 13 8月, 2022 1 次提交
-
-
由 Leo Chen 提交于
* add cached_serialize_str_ * support program hash * add sha * add ut * use hash_str only for new_exe * fix attr order
-
- 10 8月, 2022 2 次提交
-
-
由 Leo Chen 提交于
* fix proto bug * add ut * reset need_update for var_desc * refine code * fix var desc order issue
-
由 Aurelius84 提交于
* [OpAttr]Support VarDesc* and vector<VarDesc*> in Attribute * add unittest for inference predictor
-
- 19 7月, 2022 1 次提交
-
-
由 Ruibiao Chen 提交于
* Rename BOOST_GET macros * Fix conflicts
-
- 04 7月, 2022 2 次提交
-
-
由 Ruibiao Chen 提交于
-
由 Ruibiao Chen 提交于
-
- 28 6月, 2022 1 次提交
-
-
由 Ruibiao Chen 提交于
* boost::variant -> paddle::variant * boost::variant.apply_visit -> paddle::visit * Update pybind_boost_hraders.h * Fix CINN compilation errors * Revert FetchResultType
-
- 26 6月, 2022 1 次提交
-
-
由 Sing_chan 提交于
-
- 05 6月, 2022 1 次提交
-
-
由 Sing_chan 提交于
-
- 27 4月, 2022 1 次提交
-
-
由 Chen Weihang 提交于
* unify utils naming style * polish details
-
- 26 4月, 2022 1 次提交
-
-
由 zyfncg 提交于
* optimize the performanece of sum api * optimize IsDenseTensorInput * remove debug log
-
- 25 4月, 2022 1 次提交
-
-
由 Chen Weihang 提交于
* init commit * remove two hash impl * fix bug * polish details * fix compile failed * fix compile failed * fix compile failed * add default kernel sig cache * fix get kernel arg defs error * remove kernel arg defs cache * fix origin op execute
-
- 17 4月, 2022 1 次提交
-
-
由 Chen Weihang 提交于
* split phi and fluid infermeta context * resolve conflict * fix type error * optimize scheduling perf * spec small vector size * replace all grad var name * fix test failed * move init defalut signature * polish details * polish details * fix no init bug * init sig for tests * add init sig for infer * fix infrt error * fix infrt failed * fix kunlun error * fix infrt failed
-
- 13 4月, 2022 1 次提交
-
-
由 zyfncg 提交于
* remove stack_grad infershape * fix bug of output with null * fix bug
-
- 04 4月, 2022 1 次提交
-
-
由 hong 提交于
* add dropout slice yaml * remove useless code * fix infer shape error * skip infrt compile for dropout
-
- 20 2月, 2022 1 次提交
-
-
由 Chen Weihang 提交于
* rename pten dir to phi * rename namespace to phi * rename infrt pten dir to phi * resolve conflict * rename pten to phi in cmake * revert all infrt change * change needed files * fix infrt failed * fix inference failed
-
- 19 2月, 2022 1 次提交
-
-
由 Aurelius84 提交于
* Unify paddle/pten::framework::ddim into pten::ddim * fix paddle namespace * compile sucessfully * fix npu src file * fix conflict * fix conflict * fix tensorrt compiler error * fix conflict * fix conflict * fix tesst file conflict * fix conflict * fix mlu file conflict * fix mlu file conflict * fix cinn header file conflict * fix conflict * fix conflict * fix conflict * fix conflict
-
- 14 2月, 2022 1 次提交
-
-
由 Chen Weihang 提交于
* add has_attr for arg map context * skip useless attr now * skip attr if not exists * fix typo
-
- 13 1月, 2022 1 次提交
-
-
由 Chen Weihang 提交于
* fix mkldnn invalid infershape * add unittest for mkldnn in new executor * add import os
-
- 30 12月, 2021 1 次提交
-
-
由 Yulong Ao 提交于
* [Auto parallel] Make the id of var and op unique * [Auto Parallel] Rename back dist_context to distop_context
-
- 14 12月, 2021 1 次提交
-
-
由 Aurelius84 提交于
-
- 15 9月, 2021 1 次提交
-
-
由 王明冬 提交于
* clip op extra information when export model,test=ocr * rename clip_extra parameter to kwargs in save_inference_model, test=ocr
-
- 24 8月, 2021 1 次提交
-
-
由 Yulong Ao 提交于
* add auto_parallel dir * mv to paddle.distributed * add shard_xx api * add distributed attrs for var * add ut, test=develop * add dist * update * update * update * update * update * update, test=develop * update, test=develop * update, test=develop * update, test=develop * update, test=develop * update, test=develop * update, test=develop * update * update * update * update * update * update, test=develop * update, test=develop * update * update * delete unused proto * resotre op_desc * restore type_defs * update var_desc * remove dimss_mapping for proto_pybind * update interface.py * update framework.py * update * update * add auto_parallel dir * mv to paddle.distributed * add shard_xx api * add distributed attrs for var * add ut, test=develop * [WIP] Add the auto completion feature and related codes * [WIP] Improve the auto completion and related codes * [WIP] Make the auto completion to support data-parallel * [WIP] Make the completion support mp and dp+mp * [WIP] Refactor auto completion unit test for MLP * [WIP] Refactor the implementation of DistributedOperatorImpl * [WIP] Improve dims_mapping update rule and fix a bug * [WIP] Support auto completion for one transformer decoder layer * [WIP] Add a minor change * [WIP] Fix a bug within the uint test * Shard XShape tensor, add embedding completion and refactor code * Add the distributed_operators dir to setup.py.in * Improve the completion process and add the unittest for gpt * fix process_mesh ut * fix process_mesh ut * update * update, test=develop * Add support for automatically completing distributed attrs of special ops * update * update * update * fix doc sample codes, test=develop * improve coverage, test=develop * add static_mode check, test=develop * Model the cluster for cost model and physical mapping * update, test=develop * add set_placement, test=develop * Add the check to make sure the candidate tensors' size is great than zero * update doc, test=develop * update doc, test=develop * update doc, test=develop * update doc, test=develop * update, test=develop * Auto mark dist attrs annotated by user * update ndarray to nested list, test=develop * update, test=develop * Add auto-completion module for auto-parallel (based on PR#33804) * Remove unnecessary files * Remove unrelated files for the auto completion pr * Update the unit test to improve the coverage * Modify codes based on reviews * Minor changes for CI * Improve some codes based on new comments * Fix bugs caused by shallow copy in attributes.py * Imporve amend_distributed_attr_for_program in context.py * Other changes for weihang's comments Co-authored-by: Nsandyhouse <lilong12@baidu.com>
-
- 26 4月, 2021 1 次提交
-
-
由 Yiqun Liu 提交于
* Unset ReserveSpace for inference program. * Support training from an inference program.
-
- 04 2月, 2021 1 次提交
-
-
由 wanghuancoder 提交于
* use iwyu clean include second time, test=develop
-
- 11 1月, 2021 1 次提交
-
-
由 liym27 提交于
Support vector<double> as type of op attribute and op set_value suppport vector<double> as value (#30126)
-
- 20 8月, 2020 1 次提交
-
-
由 Chen Weihang 提交于
* polish framework error msg part 5 * revert enforce change * refine error type * trigger ci check * polish details by review comment
-
- 13 8月, 2020 1 次提交
-
-
由 Leo Chen 提交于
* add unchaged infershape function * add broadcast infershape function * fix bug * rename infershape functions * add UnaryOpUnchangedInferShapeCheckAxis * add error message * add test for common infer shape functions * dont update existed ops * dont update op_desc.h * add more test * add error check, refine error message
-
- 30 7月, 2020 1 次提交
-
-
由 Chen Weihang 提交于
* refine error stack format * polish compile traceback format * polish detail format
-
- 23 6月, 2020 1 次提交
-
-
由 Pei Yang 提交于
* Paddle-TensorRT support slim QAT. test=develop * add comments. test=develop * use RenameInput instead of ResetInputs. test=develop
-
- 11 5月, 2020 1 次提交
-
-
由 Chen Weihang 提交于
* add new macro BOOST_GET_SAFELY & unittests, test=develop * add different macro type, test=develop * fix get macro type in executor, test=develop * four macro part change backup * using one macro for all case, test=develop * revert attribute change, test=develop * change to three func to solve gcc4.8 bug, test=develop * polish some details, test=develop
-
- 12 4月, 2020 1 次提交
-
-
由 Yiqun Liu 提交于
-
- 23 2月, 2020 1 次提交
-
-
由 tianshuo78520a 提交于
-
- 14 1月, 2020 1 次提交
-
-
由 Zhen Wang 提交于
-
- 06 12月, 2019 1 次提交
-
-
由 Zeng Jinle 提交于
* polish infer shape registry, test=develop * modify some operators registry, test=develop
-