- 17 8月, 2022 3 次提交
-
-
由 HongyuJia 提交于
* transfer bicubic_interp op to phi, change name from bicubic_interp_v2 to bicubic_interp * test final_state_bicubic_interp api * testcase match imperative case
-
由 sneaxiy 提交于
* fix squared_l2_norm bug * update buffer.h
-
由 Zhang Zheng 提交于
-
- 16 8月, 2022 22 次提交
-
-
由 Chen Weihang 提交于
* move check finite and unscale kernel into phi * move infershape into phi * move update_loss_scaling kernel into phi * remove original kernels * move update loss scaling infershape into phi * add header for xpu and npu * solve coverage failed * fix npu test failed * remove mutable data in cu file * fix new executor failed * add valid check for meta tensor output
-
由 Siming Dai 提交于
* initial commit * fix op maker bug * fix mul grad bug * add unittest * fix add grad bug, add cpu kernel * add paddle.geometric.message_passing * add paddle.geometric.send_uv api, add unittest * add fp16 judgement * fix file typo, move compute_type to message_op * add impl file * fix unittest timeout time * add review revise
-
由 caozhou 提交于
* update reshard cost and cost estimator * add unittest * add dropout cost * fix import error * fix reshard code style error * improve unittest coverage
-
由 Weilong Wu 提交于
* [Eager draft] forward_only interface migrate to autograd_api * strings api add dygraph forward function * rm useless comments * draft version for check CI * fix ci * forward-only no need compute_require_grad and pass stop_gradient, rm useless comments * polish yaml and using CPUPlace = phi::CPUPlace * rm useless comments * polish yaml and update some test case * rm useless funcs * polish eager_gen code * polish code
-
由 feng_shuai 提交于
* convert multihead to oss * fix:bug * fix:delete const cast * fix:don't support bias_qk * add vit pass * fix:convert bug and add preln_residual_bias * support length=-1 * add UT for convert * add no_bias_qk support for gpu_multihead_op * delete infer_shape depends on bias_qk * oss just can be used in T4 and A* * fix:change api for ROCM CI
-
由 Aganlengzi 提交于
-
由 feifei-111 提交于
* fix_shape * code style * fix assert * fix to_tensor badreturn
-
由 Yiqun Liu 提交于
-
由 wawltor 提交于
* fix the api for the add * update the document for the api add * update add docs; test=document_fix Co-authored-by: NLigoml <39876205+Ligoml@users.noreply.github.com>
-
由 Yulv-git 提交于
* Add CI for assert-allclose. * Update CI script. * Update check_approval. * Specify the destination path for the git diff. * Add test samples. * Add CI for assert-allclose with \n. * Update test samples. * Update ALL_ADDED_LINES_IN_TARGET_PATH. * update GitHub username to userid, test=document_fix * add rfc as a specification, test=document_fix * try to integrate single and multiple rows together, test=document_fix * remove duplicate dirs, test=document_fix * add anchor `#background`, test=document_fix * remove original scripts, test=document_fix * remove test files, test=document_fix Co-authored-by: NSigureMo <sigure.qaq@gmail.com>
-
由 Haohongxiang 提交于
* reconstruct_of_fleet_api * update
-
由 HongyuJia 提交于
-
由 Wangzheee 提交于
-
由 zhangkaihuo 提交于
-
由 Siming Dai 提交于
-
由 houj04 提交于
-
由 JZ-LIANG 提交于
* prune d2h memcpy for fp16 pass
-
由 Wilber 提交于
-
由 Sing_chan 提交于
* add select_p * fix bugs * add custom test for select_p; modify select_p primrules * modify according to xiaoxu's comment * add eq_p, select_p, pow_p, use autograd to test grad of high order * add requirement of autograd, modify expected type of eq * modify according to xiaoxu's comment * import primops to use primops.pow
-
由 Feiyu Chan 提交于
-
由 Charles-hit 提交于
-
由 Yulong Ao 提交于
-
- 15 8月, 2022 15 次提交
-
-
由 Charles-hit 提交于
-
由 wuhuachaocoding 提交于
* refactor fleet. * refact fleet.py. * update fleet/__init__.py. * update fleet.py * update code style. * update fleet * update fleet * update fleet * update fleet * update model.py * update fleet. * update __init__.py * update fleet. * update fleet. * update fleet * update fleet * update fleet * update fleet. * update optimizer.py * update optimizer * update fleet.py * update scaler.py * update setup.py.in
-
由 RichardWooSJTU 提交于
Co-authored-by: NminghaoBD <liminghao03@baidu.com>
-
由 HongyuJia 提交于
* change name linear_interp_v2 to linear_interp * fix deprecated_op_names * deprecated_op_names add linear_interp_grad
-
由 Yuanle Liu 提交于
-
由 zlsh80826 提交于
* Reduce pool2d test configuration * Reduce depthwise_conv2d test configuration * Reduce trt_convert_conv2d_fusion test configuration * Reduce trt_convert_conv2d test configuration * Reduce trt_convert_conv2d_transpose test configuration * Reduce trt_convert_hard_swish test configuration * Enhance trt auto scan test error message and mechanism * Increase FP16 trt ut tolerance
-
由 wanghuancoder 提交于
* fix sync batch norm to inplace
-
由 duanyanhui 提交于
* fix compile error
-
由 zhangyikun02 提交于
-
由 zhaoyingli 提交于
* add collate_fn * fix number of inputs
-
由 Hui Zhang 提交于
* rm useless pybind * rm useless ut
-
由 Yulong Ao 提交于
* [Auto Parallel] Move the distributed info from python to c++ * [Auto Parallel] Add dist_attrs for VarDesc and OpDesc * [Auto Parallel] Add the lost file * [Auto Parallel] Make the dist attr be unique_ptr * [Auto Parallel] Add the proto conversion * [Auto Parallel] Improve the proto support * [Auto Parallel] Fix the bugs for adding a device or a link * [Auto Parallel] Add the C++ ProcessMesh and DistributedMapper * [Auto Parallel] Improve the impl of these dist attrs * [Auto Parallel] Pybind11 ProcessMesh and DeviceMesh * [Auto Parallel] Fix the unittest problem * [Auto Parallel] Explicitly add the src file for auto_parallel target * [Auto Parallel] Add the proto depedency explicitly * [Auto Parallel] Fix the cmake bug on windows and mac * [Auto Parallel] Remove the pybind11 header file in process_mesh.h * [Auto Parallel] Remove unused codes * [Auto Parallel] Check whether the dist attr is null * [Auto Parallel] Implement the assign operator for OpDesc explicitly
-
由 houj04 提交于
* [XPU] add some collective ops. test=kunlun * use XPUOpTestWrapper. test=kunlun * skip kl1 for collective ops. fix typo: deivce -> device. test=kunlun
-
由 Ruibiao Chen 提交于
* Update FLAGS for standalone executor * Update FLAGS_FORCE_USE_PROGRAM_CACHE
-
由 Wilber 提交于
* convert_fp16 support multi block * update * update
-