- 20 9月, 2022 1 次提交
-
-
由 niuliling123 提交于
cherry-pick from #45826 LayoutAutotune 支持 inplace 类型的OP 根据 Add eager layout autotune #45409 修改意见调整UseAutotune 将LayoutAutotune判断放到controller中,与AMP 判断保持一致
-
- 19 9月, 2022 1 次提交
-
-
由 Chen Weihang 提交于
* unify core_avx and core_noavx * fix except error * revert mac compile logic * revert dylib to so * add core_noavx branch * remove core_noavx * replace paddle_core by lib paddle * polish var name * replace paddle_core by libpaddle * update custom device commit * polish code by comments
-
- 14 9月, 2022 1 次提交
-
-
由 JingZhuangzhuang 提交于
-
- 09 9月, 2022 1 次提交
-
-
由 ronnywang 提交于
* [CustomDevice] add dy2static support * update
-
- 08 9月, 2022 1 次提交
-
-
由 OccupyMars2025 提交于
-
- 06 9月, 2022 2 次提交
-
-
由 chenjian 提交于
* update protobuf format * fix protobuf content * fix file mode * fix compiling error when gpu not exists * fix compiling error when gpu not exists * fix compiling error when gpu not exists * fix compiling error when gpu not exists * support rocm
-
由 Chen Weihang 提交于
-
- 30 8月, 2022 1 次提交
-
-
由 zyfncg 提交于
* add runtime config in phi * add runtime attr for op desc and op * fix no proto error * adjust opdesc set_attr impl * try to remove conv_op extra attrs * add init runtime attr map * change extra header path * fix runtime_attr * fix trace_op * fix bug of pass * fix merge conflict * fix dygraph attrs * fix bug of pass * fix dygraph bug * fix unittest module * delete extra attr default * fix dropout kernel * polish code * fix extra output of instance_norm * fix merge confilct * fix op_desc bug * add extra attr in yaml for conv3d_transpose * don't remove extra input and output * fix save_inference_model * fix bug of batch_norm * revert some change * polish log * polish code * add code comment Co-authored-by: NChen Weihang <chenweihang@baidu.com>
-
- 29 8月, 2022 1 次提交
-
-
由 zhangbo9674 提交于
* add interpretercore * refine backward program id * add code * refine program * refine code * create forward/backward_program by prog2graph2prog method * test, do not care * refine code * refine code * refine code * test, do not care * add interpretorcore * add scope * refine scope create method * add jit for new_exe * solve conflict * delete unused code * polish code * polish code * refine scope in inplace * refine for datatransfer * refine _rebuild_from_desc * refine control eager deletion attr * refine used_for_jit * refine jit for infer * op size0 use ori program * polish code * refine jit * refine run_program_op ut * refine inplace * refine control * refine graph helper * refine control * refine inplace * refine buffer_share_inplace_pass * polish code * polish code * refine usage for compilerProgram * refine control * test * test core cache * refine code * refine io.py * increase test_seq2seq timeout * refine convert program * refine interpretercore_cache release * delete buildinplace * refine partial_program && io * refine code for io * test * test * test
-
- 18 8月, 2022 1 次提交
-
-
由 OccupyMars2025 提交于
-
- 15 8月, 2022 1 次提交
-
-
由 Yulong Ao 提交于
* [Auto Parallel] Move the distributed info from python to c++ * [Auto Parallel] Add dist_attrs for VarDesc and OpDesc * [Auto Parallel] Add the lost file * [Auto Parallel] Make the dist attr be unique_ptr * [Auto Parallel] Add the proto conversion * [Auto Parallel] Improve the proto support * [Auto Parallel] Fix the bugs for adding a device or a link * [Auto Parallel] Add the C++ ProcessMesh and DistributedMapper * [Auto Parallel] Improve the impl of these dist attrs * [Auto Parallel] Pybind11 ProcessMesh and DeviceMesh * [Auto Parallel] Fix the unittest problem * [Auto Parallel] Explicitly add the src file for auto_parallel target * [Auto Parallel] Add the proto depedency explicitly * [Auto Parallel] Fix the cmake bug on windows and mac * [Auto Parallel] Remove the pybind11 header file in process_mesh.h * [Auto Parallel] Remove unused codes * [Auto Parallel] Check whether the dist attr is null * [Auto Parallel] Implement the assign operator for OpDesc explicitly
-
- 12 8月, 2022 1 次提交
-
-
由 Yulong Ao 提交于
* [Auto Parallel] Pybind11 ProcessMesh and DeviceMesh * [Auto Parallel] Fix the unittest problem * [Auto Parallel] Explicitly add the src file for auto_parallel target * [Auto Parallel] Add the proto depedency explicitly * [Auto Parallel] Fix the cmake bug on windows and mac * [Auto Parallel] Remove the pybind11 header file in process_mesh.h
-
- 11 8月, 2022 1 次提交
-
-
由 chenjian 提交于
* fix * add control flag and input shapes for new dygraph * fix file mode * improve code coverage * fix a bug in statstic * fix according to review * optimize performance * fix
-
- 09 8月, 2022 1 次提交
-
-
由 Allen Guo 提交于
-
- 05 8月, 2022 1 次提交
-
-
由 OccupyMars2025 提交于
-
- 01 8月, 2022 1 次提交
-
-
由 Leo Chen 提交于
* remove cudaDeviceContext * remove more template * fix rocm compile * remove alias name CUDADeviceContext * fix compile * fix tests * revert changes
-
- 26 7月, 2022 1 次提交
-
-
由 Ruibiao Chen 提交于
-
- 21 7月, 2022 1 次提交
-
-
由 Hui Zhang 提交于
* add property.proto, can compiled * property get and deserilize * support get float * format code * format code * add unittest * add more set method * fix grammar error * Update paddle/fluid/jit/property.h Co-authored-by: NAurelius84 <zhangliujie@baidu.com> * Update paddle/fluid/jit/property.cc Co-authored-by: NAurelius84 <zhangliujie@baidu.com> * Update paddle/fluid/jit/property.cc Co-authored-by: NAurelius84 <zhangliujie@baidu.com> * Update paddle/fluid/jit/property.cc Co-authored-by: NAurelius84 <zhangliujie@baidu.com> * fix comment * fix error throw * fix property save unit test * fix error info * fix copyright and header import * reorder jit property tensor datatype Co-authored-by: NAurelius84 <zhangliujie@baidu.com>
-
- 19 7月, 2022 1 次提交
-
-
由 Ruibiao Chen 提交于
* Rename BOOST_GET macros * Fix conflicts
-
- 14 7月, 2022 1 次提交
-
-
由 wanghuancoder 提交于
* Compilation optimization
-
- 08 7月, 2022 1 次提交
-
-
由 WangZhen 提交于
* Pybind JitLayer VarBase Function and add python UT * Add multi program load UT * Fix UT place error * Update jit.save param name * Remove some comments * Polish cmakelists * Polish JitLayer in Python * Fix comments
-
- 06 7月, 2022 1 次提交
-
-
由 Leo Chen 提交于
* not run startup program in constructor of StandaloneExecutor * clear interface of standalone executor * clean debug code
-
- 02 7月, 2022 1 次提交
-
-
由 Leo Chen 提交于
* fix init() * delete test_device_context * replace CPUDeviceContext with CPUContext * fix test_scalar * remove dot_op.cc * fix compile
-
- 30 6月, 2022 2 次提交
-
-
由 Leo Chen 提交于
* support scope_guard * fix test
-
由 Ruibiao Chen 提交于
* Remove boost::variant for FetchResultType * Fix pybind errors
-
- 29 6月, 2022 1 次提交
-
-
由 ronnywang 提交于
-
- 28 6月, 2022 1 次提交
-
-
由 Ruibiao Chen 提交于
* boost::variant -> paddle::variant * boost::variant.apply_visit -> paddle::visit * Update pybind_boost_hraders.h * Fix CINN compilation errors * Revert FetchResultType
-
- 27 6月, 2022 2 次提交
-
-
由 Aganlengzi 提交于
* [CustomDevice]add custom place supports * sync format
-
由 Chen Weihang 提交于
* add get_op_names api * Update pybind.cc
-
- 24 6月, 2022 1 次提交
-
-
由 chenjian 提交于
* record memory and op supplement info * update * update * fix a bug * fix memory recording * fix a bug * update * update * fix a bug * update * fix a bug * fix a bug * fix a bug * Revert "fix a bug" This reverts commit c1d4df52762ba9ae7c7e27cd2ba4fc3a7ed9c7a5. * fix a bug * fix format * fix
-
- 16 6月, 2022 1 次提交
-
-
由 ronnywang 提交于
* [CustomKernel] add custom kernel c api * update * update * fix unable to export capi Co-authored-by: Nronny1996 <524019753@qq.com>
-
- 05 6月, 2022 1 次提交
-
-
由 Sing_chan 提交于
-
- 02 6月, 2022 1 次提交
-
-
由 sneaxiy 提交于
* support CUDAGraph for partial graph * add ut * fix ci * fix ut again because of eager mode * fix kunlun ci * fix win ci
-
- 27 5月, 2022 1 次提交
-
-
由 Ruibiao Chen 提交于
* Support memory stats for CPU * Add UTs * Fix typos * Fix typos
-
- 16 5月, 2022 1 次提交
-
-
由 jiangcheng 提交于
* optimize cinn find graph by graph address * graph_key use int64_t instead of program string * fix framework _to_readable_code python code * rename get_readable_comile_key to get_serialize_comile_key
-
- 11 5月, 2022 1 次提交
-
-
由 Allen Guo 提交于
* update to popart v2.5.0 * use a specific version of sdk2.5.0
-
- 05 5月, 2022 2 次提交
- 27 4月, 2022 1 次提交
-
-
由 Aganlengzi 提交于
* [DO NOT MERGE] test op_test * update with more related modifications * split op_test.py to use test=allcases for testing * split op_test.py to use test=allcases for testing
-
- 26 4月, 2022 1 次提交
-
-
由 seemingwang 提交于
* extract sub-graph * graph-engine merging * fix * fix * fix heter-ps config * test performance * test performance * test performance * test * test * update bfs * change cmake * test * test gpu speed * gpu_graph_engine optimization * add dsm sample method * add graph_neighbor_sample_v2 * Add graph_neighbor_sample_v2 * fix for loop * add cpu sample interface * fix kernel judgement * add ssd layer to graph_engine * fix allocation * fix syntax error * fix syntax error * fix pscore class * fix * change index settings * recover test * recover test * fix spelling * recover * fix * move cudamemcpy after cuda stream sync * fix linking problem * remove comment * add cpu test * test * add cpu test * change comment * combine feature table and graph table * test * test * pybind * test * test * test * test * pybind * pybind * fix cmake * pybind * fix * fix * add pybind * add pybind * optimize pybind * test * fix pybind * fix Co-authored-by: NDesmonDay <908660116@qq.com>
-