- 20 3月, 2023 1 次提交
-
-
由 Sonder 提交于
* update codes about pad3d * add codes about Tensor type Padding * update * 更新单测文件 * format code style * update and to &&' * rewrite codes about pad3d * add codes about converting paddle pad format to tensorrt pad format * fix some errors * 指定trt版本范围 * 修正dims初始化方式 * fix code style * update test pad values * 指定pad3d trt版本 * 更新 单测 文件范围 * 更新单测文件 * update pad3d paddings convert codes * update pad3d * add static mode support * update test file * fix bugs about dynamic mode test codes * fix bug and add limite in op_teller * use a new padding convert method[ITensor* padding with using Slice to split the pre_pad and the post pad] * fix PADDLE_THROW grammaly error * update test codes * 添加对于Tensor padding 的 size 判断
-
- 06 2月, 2023 1 次提交
-
-
由 zyfncg 提交于
* remove extra input of conv2d * fix bug * fix unittest bug * adjust conv2d.pbtxt * fix cpu_quantize_pass_tester * revert use_addto of conv2d * fix runtime attribute * fix bug * recover force_fp32_output in conv2d * refine error info * fix bug
-
- 21 12月, 2022 1 次提交
-
-
由 zyfncg 提交于
* refactor conv_activation_mkldnn_fuse_pass * refactor conv_affine_channel_mkldnn_fuse_pass * fix conv_activation_mkldnn_fuse_pass * fix mkldnn unittest * refactor int8_scale_calculation_mkldnn_pass and params_quantization_mkldnn_pass * refactor conv_elementwise_add_mkldnn_fuse_pass * fix quant * refactor conv_bn_fuse_pass * fix conv_bn_fuse_pass * refactor depthwise_conv_bn_fuse_pass * fix unittest * fix conv_bn_fuse_pass * remove redundant conv2d in params_quantization_mkldnn_pass * fix params_quantization_mkldnn_pass_tester
-
- 06 12月, 2022 1 次提交
-
-
由 zyfncg 提交于
* delete Bias and ResidualData in OpMaker of conv2d * delete extra input of conv3d * refactor pass of conv_bias_fusion * fix mkldnn dependency * fix mkldnn compile * fix test_conv_bias_mkldnn_fuse_pass * police some code * remove useless log * fix analyzer_vit_ocr_tester * fix conv_activation_mkldnn_fuse_pass * fix test_analyzer_ocr * add fused_conv_sig * fix performence regression * fix performance regression
-
- 01 11月, 2022 1 次提交
-
-
由 Ruibiao Chen 提交于
* [Auto Parallel] Improve the c++ dist attr * [Auto Parallel] Modify test_program.py * Support custom stream for standalone executor Co-authored-by: NYulong Ao <aoyulong@baidu.com>
-
- 31 10月, 2022 1 次提交
-
-
由 Yulong Ao 提交于
* [Auto Parallel] Improve the c++ dist attr * [Auto Parallel] Modify test_program.py * [Auto Parallel] Add the missiong import
-
- 26 10月, 2022 1 次提交
-
-
由 Chen Weihang 提交于
* remove using lodtensor part2 * resolve code format error * resolve conflict * resolve conflict * replace added frameworrk tensor
-
- 12 10月, 2022 1 次提交
-
-
由 zhouweiwei2014 提交于
* [Zero-Dim] support input 0D Tensor for unary api * fix CI
-
- 10 10月, 2022 1 次提交
-
-
由 YuanRisheng 提交于
* add yaml entry for rnn and rrnn_grad, move infershape function for rnn_grad to phi infer meta * WIP: move rnn kernrl to phi * Change the code generation to avoid converting from intializer list to tuple of heterogeneous types. This is only triggered when an api has intermediate outputs, and the result of the outputs are of heterogeneous types. * fix the bug that when none in a vector of tensors requires gradient, the conversion to InferShapeContext to InferMetaContext (a.k.a. BuildInferMetaContext) produces errorous results. * fix ci bugs * fix ci bugs * fix ci bugs * modify code according comment Co-authored-by: Nchenfeiyu <chenfeiyu@baidu.com>
-
- 19 9月, 2022 1 次提交
-
-
由 zyfncg 提交于
* clear extra attr of some ops in opmaker * revert clear use_cudnn for pool * fix test_operator_desc * fix Attr interface of OperatorBase
-
- 15 9月, 2022 1 次提交
-
-
由 zyfncg 提交于
* clear extra attrs of elementwise op in opmaker * fix op_debug_string_test * fix bug of grad_add * fix sort of runtime attrs
-
- 08 9月, 2022 1 次提交
-
-
由 Aurelius84 提交于
* [OpAttr]Refine Teller logic if encounter OpDesc with Variable type Attribute * fix iterator * fix typo * fix lambda expr * fix ptr
-
- 06 9月, 2022 1 次提交
-
-
由 zyfncg 提交于
* set use_cudnn=true for conv2d * clear opmaker of matmul_v2 * fix bug of set_attr * add extra attr checker in infer_shape
-
- 05 9月, 2022 1 次提交
-
-
由 zyfncg 提交于
* remove extra attr of opmaker * remove extra attr of gru * revert scale * fix bug of create_op * add extra attr checker in infer_shape * fix gelu
-
- 30 8月, 2022 1 次提交
-
-
由 zyfncg 提交于
* add runtime config in phi * add runtime attr for op desc and op * fix no proto error * adjust opdesc set_attr impl * try to remove conv_op extra attrs * add init runtime attr map * change extra header path * fix runtime_attr * fix trace_op * fix bug of pass * fix merge conflict * fix dygraph attrs * fix bug of pass * fix dygraph bug * fix unittest module * delete extra attr default * fix dropout kernel * polish code * fix extra output of instance_norm * fix merge confilct * fix op_desc bug * add extra attr in yaml for conv3d_transpose * don't remove extra input and output * fix save_inference_model * fix bug of batch_norm * revert some change * polish log * polish code * add code comment Co-authored-by: NChen Weihang <chenweihang@baidu.com>
-
- 25 8月, 2022 1 次提交
-
-
由 Feiyu Chan 提交于
-
- 15 8月, 2022 1 次提交
-
-
由 Yulong Ao 提交于
* [Auto Parallel] Move the distributed info from python to c++ * [Auto Parallel] Add dist_attrs for VarDesc and OpDesc * [Auto Parallel] Add the lost file * [Auto Parallel] Make the dist attr be unique_ptr * [Auto Parallel] Add the proto conversion * [Auto Parallel] Improve the proto support * [Auto Parallel] Fix the bugs for adding a device or a link * [Auto Parallel] Add the C++ ProcessMesh and DistributedMapper * [Auto Parallel] Improve the impl of these dist attrs * [Auto Parallel] Pybind11 ProcessMesh and DeviceMesh * [Auto Parallel] Fix the unittest problem * [Auto Parallel] Explicitly add the src file for auto_parallel target * [Auto Parallel] Add the proto depedency explicitly * [Auto Parallel] Fix the cmake bug on windows and mac * [Auto Parallel] Remove the pybind11 header file in process_mesh.h * [Auto Parallel] Remove unused codes * [Auto Parallel] Check whether the dist attr is null * [Auto Parallel] Implement the assign operator for OpDesc explicitly
-
- 13 8月, 2022 1 次提交
-
-
由 Leo Chen 提交于
* add cached_serialize_str_ * support program hash * add sha * add ut * use hash_str only for new_exe * fix attr order
-
- 10 8月, 2022 2 次提交
-
-
由 Leo Chen 提交于
* fix proto bug * add ut * reset need_update for var_desc * refine code * fix var desc order issue
-
由 Aurelius84 提交于
* [OpAttr]Support VarDesc* and vector<VarDesc*> in Attribute * add unittest for inference predictor
-
- 19 7月, 2022 1 次提交
-
-
由 Ruibiao Chen 提交于
* Rename BOOST_GET macros * Fix conflicts
-
- 04 7月, 2022 2 次提交
-
-
由 Ruibiao Chen 提交于
-
由 Ruibiao Chen 提交于
-
- 28 6月, 2022 1 次提交
-
-
由 Ruibiao Chen 提交于
* boost::variant -> paddle::variant * boost::variant.apply_visit -> paddle::visit * Update pybind_boost_hraders.h * Fix CINN compilation errors * Revert FetchResultType
-
- 26 6月, 2022 1 次提交
-
-
由 Sing_chan 提交于
-
- 05 6月, 2022 1 次提交
-
-
由 Sing_chan 提交于
-
- 27 4月, 2022 1 次提交
-
-
由 Chen Weihang 提交于
* unify utils naming style * polish details
-
- 26 4月, 2022 1 次提交
-
-
由 zyfncg 提交于
* optimize the performanece of sum api * optimize IsDenseTensorInput * remove debug log
-
- 25 4月, 2022 1 次提交
-
-
由 Chen Weihang 提交于
* init commit * remove two hash impl * fix bug * polish details * fix compile failed * fix compile failed * fix compile failed * add default kernel sig cache * fix get kernel arg defs error * remove kernel arg defs cache * fix origin op execute
-
- 17 4月, 2022 1 次提交
-
-
由 Chen Weihang 提交于
* split phi and fluid infermeta context * resolve conflict * fix type error * optimize scheduling perf * spec small vector size * replace all grad var name * fix test failed * move init defalut signature * polish details * polish details * fix no init bug * init sig for tests * add init sig for infer * fix infrt error * fix infrt failed * fix kunlun error * fix infrt failed
-
- 13 4月, 2022 1 次提交
-
-
由 zyfncg 提交于
* remove stack_grad infershape * fix bug of output with null * fix bug
-
- 04 4月, 2022 1 次提交
-
-
由 hong 提交于
* add dropout slice yaml * remove useless code * fix infer shape error * skip infrt compile for dropout
-
- 20 2月, 2022 1 次提交
-
-
由 Chen Weihang 提交于
* rename pten dir to phi * rename namespace to phi * rename infrt pten dir to phi * resolve conflict * rename pten to phi in cmake * revert all infrt change * change needed files * fix infrt failed * fix inference failed
-
- 19 2月, 2022 1 次提交
-
-
由 Aurelius84 提交于
* Unify paddle/pten::framework::ddim into pten::ddim * fix paddle namespace * compile sucessfully * fix npu src file * fix conflict * fix conflict * fix tensorrt compiler error * fix conflict * fix conflict * fix tesst file conflict * fix conflict * fix mlu file conflict * fix mlu file conflict * fix cinn header file conflict * fix conflict * fix conflict * fix conflict * fix conflict
-
- 14 2月, 2022 1 次提交
-
-
由 Chen Weihang 提交于
* add has_attr for arg map context * skip useless attr now * skip attr if not exists * fix typo
-
- 13 1月, 2022 1 次提交
-
-
由 Chen Weihang 提交于
* fix mkldnn invalid infershape * add unittest for mkldnn in new executor * add import os
-
- 30 12月, 2021 1 次提交
-
-
由 Yulong Ao 提交于
* [Auto parallel] Make the id of var and op unique * [Auto Parallel] Rename back dist_context to distop_context
-
- 14 12月, 2021 1 次提交
-
-
由 Aurelius84 提交于
-
- 15 9月, 2021 1 次提交
-
-
由 王明冬 提交于
* clip op extra information when export model,test=ocr * rename clip_extra parameter to kwargs in save_inference_model, test=ocr
-
- 24 8月, 2021 1 次提交
-
-
由 Yulong Ao 提交于
* add auto_parallel dir * mv to paddle.distributed * add shard_xx api * add distributed attrs for var * add ut, test=develop * add dist * update * update * update * update * update * update, test=develop * update, test=develop * update, test=develop * update, test=develop * update, test=develop * update, test=develop * update, test=develop * update * update * update * update * update * update, test=develop * update, test=develop * update * update * delete unused proto * resotre op_desc * restore type_defs * update var_desc * remove dimss_mapping for proto_pybind * update interface.py * update framework.py * update * update * add auto_parallel dir * mv to paddle.distributed * add shard_xx api * add distributed attrs for var * add ut, test=develop * [WIP] Add the auto completion feature and related codes * [WIP] Improve the auto completion and related codes * [WIP] Make the auto completion to support data-parallel * [WIP] Make the completion support mp and dp+mp * [WIP] Refactor auto completion unit test for MLP * [WIP] Refactor the implementation of DistributedOperatorImpl * [WIP] Improve dims_mapping update rule and fix a bug * [WIP] Support auto completion for one transformer decoder layer * [WIP] Add a minor change * [WIP] Fix a bug within the uint test * Shard XShape tensor, add embedding completion and refactor code * Add the distributed_operators dir to setup.py.in * Improve the completion process and add the unittest for gpt * fix process_mesh ut * fix process_mesh ut * update * update, test=develop * Add support for automatically completing distributed attrs of special ops * update * update * update * fix doc sample codes, test=develop * improve coverage, test=develop * add static_mode check, test=develop * Model the cluster for cost model and physical mapping * update, test=develop * add set_placement, test=develop * Add the check to make sure the candidate tensors' size is great than zero * update doc, test=develop * update doc, test=develop * update doc, test=develop * update doc, test=develop * update, test=develop * Auto mark dist attrs annotated by user * update ndarray to nested list, test=develop * update, test=develop * Add auto-completion module for auto-parallel (based on PR#33804) * Remove unnecessary files * Remove unrelated files for the auto completion pr * Update the unit test to improve the coverage * Modify codes based on reviews * Minor changes for CI * Improve some codes based on new comments * Fix bugs caused by shallow copy in attributes.py * Imporve amend_distributed_attr_for_program in context.py * Other changes for weihang's comments Co-authored-by: Nsandyhouse <lilong12@baidu.com>
-