- 14 9月, 2022 2 次提交
-
-
由 JingZhuangzhuang 提交于
* merge python lib * Update third_party.cmake * Update CMakeLists.txt
-
由 wanghuancoder 提交于
-
- 09 9月, 2022 5 次提交
-
-
由 zhangbo9674 提交于
* add scope cache & reuse * add gc scope for end of each train step * del scope reuse for jit * refine code * test
-
由 Matsumoto Ruko 提交于
* add paddle.incubate.sparse.is_same_shape * add paddle.incubate.sparse.is_same_shape * add paddle.incubate.sparse.is_same_shape * add paddle.incubate.sparse.is_same_shape * fix doc
-
由 Leo Chen 提交于
* add operator<< for BuildStrategy * add fake_coalesce * fit allreduce mode for new_exe * remove dubeg code * follow comments
-
由 ronnywang 提交于
* [CustomDevice] add dy2static support * update
-
由 WangZhen 提交于
* Enhance slice to support 0 dims Tensor * Add UT
-
- 08 9月, 2022 1 次提交
-
-
由 OccupyMars2025 提交于
-
- 06 9月, 2022 4 次提交
-
-
由 YuanRisheng 提交于
* add tensor array * fix ci bugs * fix ci bugs * fix ci bugs * fix ci bugs * update by comment * update code
-
由 chenjian 提交于
* update protobuf format * fix protobuf content * fix file mode * fix compiling error when gpu not exists * fix compiling error when gpu not exists * fix compiling error when gpu not exists * fix compiling error when gpu not exists * support rocm
-
由 OccupyMars2025 提交于
-
由 Chen Weihang 提交于
-
- 05 9月, 2022 3 次提交
-
-
由 yeliang2258 提交于
* support onnx format quantized model * update code * add test * add test * fix * fix test * fix cmake * update code * change scale file path to calibration file path * update code * update code * fix build bug * fix build bugs * fix * fix
-
由 denglin-github 提交于
* add config param for enable_dlnne and support calibration mode * remove useless file * refine code and add annotation * refine code of Warnning tips
-
由 niuliling123 提交于
-
- 31 8月, 2022 1 次提交
-
-
由 LiYuRio 提交于
-
- 30 8月, 2022 2 次提交
-
-
由 wanghuancoder 提交于
-
由 zyfncg 提交于
* add runtime config in phi * add runtime attr for op desc and op * fix no proto error * adjust opdesc set_attr impl * try to remove conv_op extra attrs * add init runtime attr map * change extra header path * fix runtime_attr * fix trace_op * fix bug of pass * fix merge conflict * fix dygraph attrs * fix bug of pass * fix dygraph bug * fix unittest module * delete extra attr default * fix dropout kernel * polish code * fix extra output of instance_norm * fix merge confilct * fix op_desc bug * add extra attr in yaml for conv3d_transpose * don't remove extra input and output * fix save_inference_model * fix bug of batch_norm * revert some change * polish log * polish code * add code comment Co-authored-by: NChen Weihang <chenweihang@baidu.com>
-
- 29 8月, 2022 1 次提交
-
-
由 zhangbo9674 提交于
* add interpretercore * refine backward program id * add code * refine program * refine code * create forward/backward_program by prog2graph2prog method * test, do not care * refine code * refine code * refine code * test, do not care * add interpretorcore * add scope * refine scope create method * add jit for new_exe * solve conflict * delete unused code * polish code * polish code * refine scope in inplace * refine for datatransfer * refine _rebuild_from_desc * refine control eager deletion attr * refine used_for_jit * refine jit for infer * op size0 use ori program * polish code * refine jit * refine run_program_op ut * refine inplace * refine control * refine graph helper * refine control * refine inplace * refine buffer_share_inplace_pass * polish code * polish code * refine usage for compilerProgram * refine control * test * test core cache * refine code * refine io.py * increase test_seq2seq timeout * refine convert program * refine interpretercore_cache release * delete buildinplace * refine partial_program && io * refine code for io * test * test * test
-
- 26 8月, 2022 1 次提交
-
-
由 wanghuancoder 提交于
-
- 25 8月, 2022 3 次提交
-
-
由 Feiyu Chan 提交于
-
由 danleifeng 提交于
* update brpc version;test=develop
-
由 Sing_chan 提交于
* make full_like support double_max in dygraph * fix bug
-
- 24 8月, 2022 1 次提交
-
-
由 ShenLiang 提交于
* fix utest * fix utest * fix utest * fix log * fix random utest
-
- 23 8月, 2022 1 次提交
-
-
由 LiYuRio 提交于
-
- 22 8月, 2022 1 次提交
-
-
由 wanghuancoder 提交于
some python c api use final state
-
- 20 8月, 2022 1 次提交
-
-
由 wanghuancoder 提交于
* pylayer detach output tensor if it is equal with input * pylayer detach output tensor if it is equal with input
-
- 18 8月, 2022 2 次提交
-
-
由 Weilong Wu 提交于
* [Eager] add get_tensor_from_selected_rows * add PADDLE_ENFORCE to check SelectedRows * use _ prefix in temp
-
由 OccupyMars2025 提交于
-
- 17 8月, 2022 2 次提交
-
-
由 wanghuancoder 提交于
* fix_stop_gradient
-
由 fwenguang 提交于
-
- 16 8月, 2022 1 次提交
-
-
由 Feiyu Chan 提交于
-
- 15 8月, 2022 2 次提交
-
-
由 Hui Zhang 提交于
* rm useless pybind * rm useless ut
-
由 Yulong Ao 提交于
* [Auto Parallel] Move the distributed info from python to c++ * [Auto Parallel] Add dist_attrs for VarDesc and OpDesc * [Auto Parallel] Add the lost file * [Auto Parallel] Make the dist attr be unique_ptr * [Auto Parallel] Add the proto conversion * [Auto Parallel] Improve the proto support * [Auto Parallel] Fix the bugs for adding a device or a link * [Auto Parallel] Add the C++ ProcessMesh and DistributedMapper * [Auto Parallel] Improve the impl of these dist attrs * [Auto Parallel] Pybind11 ProcessMesh and DeviceMesh * [Auto Parallel] Fix the unittest problem * [Auto Parallel] Explicitly add the src file for auto_parallel target * [Auto Parallel] Add the proto depedency explicitly * [Auto Parallel] Fix the cmake bug on windows and mac * [Auto Parallel] Remove the pybind11 header file in process_mesh.h * [Auto Parallel] Remove unused codes * [Auto Parallel] Check whether the dist attr is null * [Auto Parallel] Implement the assign operator for OpDesc explicitly
-
- 13 8月, 2022 2 次提交
-
-
由 Leo Chen 提交于
* add cached_serialize_str_ * support program hash * add sha * add ut * use hash_str only for new_exe * fix attr order
-
由 ziyoujiyi 提交于
* back fl * delete ssl cert * . * make warning * . * unittest paral degree * solve unittest * heter & multi cloud commm ready * . * . * fl-ps v1.0 * . * support N + N mode * . * . * . * . * delete print * . * . * . * . * fix bug * . * . * fl-ps with coordinator ready * merge dev * update message parse only * update fl client scheduler * fix bug * update multithreads sync * fix ci errors * update role_maker.py * update role_maker.py * fix ci error: windows py import error * fix ci error: windows py import error * fix windows ci pylib import error * add dump fields & params * try to fix windows import fleet error * fix ps FLAGS error * fix logging risk * fix logging possible risk * write trainer_desc file * support split sparse params in local & remote * fix import paddle.fluid.core.PSGPU * fix import paddle.fluid.core.PSGPU * add remote_sparse & local_sparse config * fix unittest * fix test_dist_fleet_geo table error * fix PADDLE_ENFORCE error * fix other's pr conflict
-
- 12 8月, 2022 3 次提交
-
-
由 HongyuJia 提交于
* support optional<vector<Tensor>> in yaml and eager * delete useless comments in eager_gen.py * fix api_base.py support optional<vector<TTensor>> * python_c_gen.py support optional<vector<tensor>> * transfer linear_interp_v2 yaml from fluid to phi * fix op_test typo error * change linear_interp_v2 testcase * fix args in final_state_linear_interp_v2 * fix zeropad2d typo. test=document_fix
-
由 Yulong Ao 提交于
* [Auto Parallel] Pybind11 ProcessMesh and DeviceMesh * [Auto Parallel] Fix the unittest problem * [Auto Parallel] Explicitly add the src file for auto_parallel target * [Auto Parallel] Add the proto depedency explicitly * [Auto Parallel] Fix the cmake bug on windows and mac * [Auto Parallel] Remove the pybind11 header file in process_mesh.h
-
由 Siming Dai 提交于
* add init file * add op definition and infermeta * add kernel definition funcs * add broadcast infer shape * add gpu forward kernel * delete SUB and DIV * add x_grad * add template * add e_grad for min and max * fix small bug * temp commit * temp commit * add e_grad for sum and mean * fix some compile bug * fix compile bugs * fix compile problem * add sum forward unittest * fix broadcast error, add kernel sig, register e_grad, change unit test * fix grad * add temp grad fix * temp commit * add min max unittest * add max, min unittest, fix mul bug * add cpu forward sum and mean * add forward min max, fix mean unittest * add cpu backward min max * fix code-style * add backward sum mean * fix rocm ci * set uniitest timeout * fix bug of x broadcast to e, gpu grad * fix bug of x broadcast to e, cpu grad * rename BOOST_GET_CONST macro * fix rocm ci * mv graph_send_e_recv to graph_send_ue_recv * move out_size to IntArray * add eager op test * fix max pool type bug, add unittest for api * revise api doc * add fp16 for atomic min and max, add unittest * add unittest * add fp16 support for graph_send_recv * fix unittest fp16 bug * change OutSizeTensor to Out_size * move E to Y * add copyright, fix comment * review code * fix thread block size * fix thread block size * change api attribute name: pool_type to reduce_op, compute_type to message_op * change api attribute name, move pool_type to reduce_op, move compute_type to message_op
-
- 11 8月, 2022 1 次提交
-
-
由 zhaocaibei123 提交于
* first refine * second refine * remove some code unuseful
-