- 12 8月, 2022 11 次提交
-
-
由 Sławomir Siwek 提交于
* remove v2_transpose_reshape * matmul_transpose_reshape * reshape_transpose_matmul * Add int8 support for matmulV2 * restore ut * adjust old ut * restore parallel UT ruels * remove mkldnn code from base ops * move enforces to pass * remove duplicated functions * delete duplicated enforces * feedback from review * add comments to variables * enable eltwise support * dynamic attribute * remove fusepass tests from op test * remove fuse pass cases from op test * revert introduction of dynamic attributes * style Co-authored-by: Nwozna <joanna.wozna@intel.com>
-
由 HongyuJia 提交于
* support optional<vector<Tensor>> in yaml and eager * delete useless comments in eager_gen.py * fix api_base.py support optional<vector<TTensor>> * python_c_gen.py support optional<vector<tensor>> * transfer linear_interp_v2 yaml from fluid to phi * fix op_test typo error * change linear_interp_v2 testcase * fix args in final_state_linear_interp_v2 * fix zeropad2d typo. test=document_fix
-
由 Allen Guo 提交于
-
由 kangguangli 提交于
* transfer memcpy_h2d from fluid to phi * use UnchangedInferMeta instead * restore test_standalone_executor * add newline to fix codestyle check * rename pt -> phi * simplify logic and add check * make the comment more clear * remove useless comment * refine code
-
由 Yuanle Liu 提交于
* trt engine input data type should be consistent with trt input bindings type * fix some bugs * fix some bugs * fix some bugs
-
由 zyfncg 提交于
* remove some custom_impl api and make them generated by yaml completely * delete useless code * fix adamw bug * fix infermeta * revert adamw * polish code * fix bug
-
由 zhiboniu 提交于
-
由 Yulong Ao 提交于
* [Auto Parallel] Pybind11 ProcessMesh and DeviceMesh * [Auto Parallel] Fix the unittest problem * [Auto Parallel] Explicitly add the src file for auto_parallel target * [Auto Parallel] Add the proto depedency explicitly * [Auto Parallel] Fix the cmake bug on windows and mac * [Auto Parallel] Remove the pybind11 header file in process_mesh.h
-
由 duanyanhui 提交于
* enhance grid_sampler to support 3d input
-
由 zyfncg 提交于
-
由 Siming Dai 提交于
* add init file * add op definition and infermeta * add kernel definition funcs * add broadcast infer shape * add gpu forward kernel * delete SUB and DIV * add x_grad * add template * add e_grad for min and max * fix small bug * temp commit * temp commit * add e_grad for sum and mean * fix some compile bug * fix compile bugs * fix compile problem * add sum forward unittest * fix broadcast error, add kernel sig, register e_grad, change unit test * fix grad * add temp grad fix * temp commit * add min max unittest * add max, min unittest, fix mul bug * add cpu forward sum and mean * add forward min max, fix mean unittest * add cpu backward min max * fix code-style * add backward sum mean * fix rocm ci * set uniitest timeout * fix bug of x broadcast to e, gpu grad * fix bug of x broadcast to e, cpu grad * rename BOOST_GET_CONST macro * fix rocm ci * mv graph_send_e_recv to graph_send_ue_recv * move out_size to IntArray * add eager op test * fix max pool type bug, add unittest for api * revise api doc * add fp16 for atomic min and max, add unittest * add unittest * add fp16 support for graph_send_recv * fix unittest fp16 bug * change OutSizeTensor to Out_size * move E to Y * add copyright, fix comment * review code * fix thread block size * fix thread block size * change api attribute name: pool_type to reduce_op, compute_type to message_op * change api attribute name, move pool_type to reduce_op, move compute_type to message_op
-
- 11 8月, 2022 6 次提交
-
-
由 carryyu 提交于
* make affine_grid_op support 5d_input on cpu and gpu
-
由 zhaocaibei123 提交于
* first refine * second refine * remove some code unuseful
-
由 chenjian 提交于
* fix * add control flag and input shapes for new dygraph * fix file mode * improve code coverage * fix a bug in statstic * fix according to review * optimize performance * fix
-
由 zhangkaihuo 提交于
* fix submanifold conv
-
由 whs 提交于
-
由 Weilong Wu 提交于
* Polish black_ops_list logic in eager_gen * update black_ops_list
-
- 10 8月, 2022 11 次提交
-
-
由 Wangzheee 提交于
* disable_skip_layernorm_fp16
-
由 yeliang2258 提交于
-
由 Chen Weihang 提交于
-
由 Feiyu Chan 提交于
1. change the codegen code to avoid conversion from heterogeneous 'initializer list' to tuple, which fails on gcc 5.4; (#45036) 2. add a template CheckTensorHasNanOrInf to handle arbitary tuple of supported types.
-
由 duanboqiang 提交于
* add class center sample infershape * add yaml * modify unittest * modify unittest * remove comment
-
由 zhangxiaoci 提交于
* add macro control in enforce_xpu.h, test=kunlun * minor bugfix * minor bugfix
-
由 z8hanghuan 提交于
* fix bug of adaptive pool2d_grad, *test=kunlun * fix bug of adaptive pool2d_grad, *test=kunlun * fix bug of adaptive pool2d_grad, *test=kunlun
-
由 xiaoxiaohehe001 提交于
* cuda_graph * cuda_graph_ * cuda_graph_ * cuda_graph_
-
由 Leo Chen 提交于
* set cuda device before run * add header file * fix compile
-
由 Leo Chen 提交于
* fix proto bug * add ut * reset need_update for var_desc * refine code * fix var desc order issue
-
由 Aurelius84 提交于
* [OpAttr]Support VarDesc* and vector<VarDesc*> in Attribute * add unittest for inference predictor
-
- 09 8月, 2022 12 次提交
-
-
由 Siming Dai 提交于
-
由 Ruibiao Chen 提交于
* Fix copy bug for same src and dst Tensor * Improve code design * Fix errors
-
由 WangZhen 提交于
* Rename class type Name2XX * Fix return type * Remove EngineMap function in layer
-
由 yeliang2258 提交于
-
由 Weilong Wu 提交于
* [Eager] use final_state_fill_constant_ * fill_constant use str_value * add fill_constant_ to no_amp_list * use float(value) as input * support final state full_ same as fill_constant
-
由 Siming Dai 提交于
* change out_size to INTArray * fix out_size eager bug * add unittest for out_size tensor * add deprecated for paddle.incubate.graph_send_recv, add paddle.geometric.send_u_recv and unittests * fix lowest bug * fix according review comment * add default value in yaml * change api file name * change name
-
由 Charles-hit 提交于
* move api(erfinv) from legacy_api.yaml to api.yaml * change inplace_map key
-
由 duanboqiang 提交于
* migrate class center sample kernel * fix Resize ddim error * set buffer ptr * add header * add header * remove comment * remove header
-
由 yeliang2258 提交于
-
由 Yulong Ao 提交于
* [Auto Parallel] Add the c++ dist attrs * [Auto Parallel] Remove some codes to be less than 1000 lines
-
由 WangZhen 提交于
* Pybind Fucniton and hide ExecutorEngine and PEEngine * Remove FunctionNames in compilation_unit
-
由 z8hanghuan 提交于
* add phi empty,*test=kunlun * support empty op in xpu, *test=kunlun * support empty op in xpu, *test=kunlun
-