- 26 8月, 2022 1 次提交
-
-
由 wanghuancoder 提交于
-
- 03 8月, 2022 1 次提交
-
-
由 sneaxiy 提交于
* add use_hierarchical_allreduce * support hierarchical allreduce for more cases
-
- 27 7月, 2022 1 次提交
-
-
由 Wang Bojun 提交于
* move average_accumulates op to phi kernel
-
- 09 6月, 2022 1 次提交
-
-
由 sneaxiy 提交于
* add nproc_per_node for DistributedFusedLamb * fix nproc_per_node communicator bug * fix ring_id = 1 init bug * fix ci * fix test_parallel_executor_mnist.py
-
- 07 6月, 2022 1 次提交
-
-
由 sneaxiy 提交于
* add use_master_acc_grad * add ut
-
- 05 6月, 2022 1 次提交
-
-
由 Sing_chan 提交于
* use yapf to format all python file * yapf exclude two unittests file for they rely on writing and reading file, and format will break them * disable diff_py_file because too many diff files cause command following failed
-
- 10 5月, 2022 1 次提交
-
-
由 Sing_chan 提交于
* improve introduction of bfgs args; test=document_fix * modify according to zhouwei's comment; test=document_fix
-
- 28 4月, 2022 1 次提交
-
-
由 sneaxiy 提交于
* add gradient merge for DistributedFusedLamb * use master acc gradient * fix CI ut * polish * remove math_function_impl.h change * fix test_update_loss_scaling_op.py * try to fix XPU/NPU CI * add gm ut
-
- 14 4月, 2022 1 次提交
-
-
由 Sing_chan 提交于
* fix bfgs_doc; test=document_fix * add parameter name; test=document_fix * modify according to chenlong's comments;test=document_fix
-
- 08 4月, 2022 1 次提交
-
-
由 sneaxiy 提交于
* fix image cv2 import * fix lamb
-
- 07 4月, 2022 1 次提交
-
-
由 sneaxiy 提交于
* add Output(Step) to distributed fused lamb op * add _set_step
-
- 04 4月, 2022 1 次提交
-
-
由 Sing_chan 提交于
* cut off relation between xk and initial_position's graph * fix_bug * add detach to cut off with original graph
-
- 01 4月, 2022 2 次提交
-
-
由 Sing_chan 提交于
* change vjp to paddle.grad * use grad and gradients api * fix preprocess for x * fix a bug, val_and_grad should return a Tensor * detach value and grad to avoid assign error Co-authored-by: Nlevi131 <limaolin01@baidu.com>
-
由 Sing_chan 提交于
-
- 31 3月, 2022 1 次提交
-
-
由 Sing_chan 提交于
* [New API]: miminize_bfgs and miminize_lbfgs * modify for python module call correctly * add functional package, add error raise in static_graph, change assign to set_value * unify static_graph and dygraph, fix bug when x or H0 is float64 * now only accept input is tensor, put check args in utils.py, put exception test together * temp * add more detailed algorithm illustration and comment, reduce test case to limit test time in 15s * change in_dygraph_mode to in_dynamic_mode * fix bug of sample code; reduce test case to reduce test time * change dir to incubate
-
- 25 3月, 2022 1 次提交
-
-
由 Jiabin Yang 提交于
* refactor eager flags * fix flags error when we switch from eager to dygraph * fix ci problem * fix ci * fix ci * merge develop and fix code style * merge develop and fix code style * fix op test error * fix op test error * fix op test error * fix op test error * fix op test error * merge develop
-
- 01 3月, 2022 1 次提交
-
-
由 sneaxiy 提交于
* vectorize lamb kernel * remove flags, add ut * remove useless codes * refine code, add param order
-
- 25 2月, 2022 1 次提交
-
-
由 sneaxiy 提交于
* add multi tensor apply l2 norm * add multi_tensor_apply code * make sizeof(TensorMeta) smalller * move code to distributed_fused_lamb_op.cu * remove useless FLAGS
-
- 19 2月, 2022 1 次提交
-
-
由 sneaxiy 提交于
* add DistributedFusedLamb op * polish code * fix compile error * compatible with pten changement * fix rocm compile error * improve converage * update upstream/develop * fix cast_with_ptr.h * add FLAGS_distributed_lamb_divide_nranks_when_allreduce=1 * fix clip before allreduce * add use_master_param_norm * code polish * fix bug * fix ROCM ci
-
- 15 7月, 2021 1 次提交
-
-
由 wanghuancoder 提交于
* cache core.ops, test=develop * refine, test=develop
-
- 11 6月, 2021 1 次提交
-
-
由 zhiboniu 提交于
* update 2.0 public api in all left files * reverse device.py all list; fix some flake8 errors
-
- 25 1月, 2021 1 次提交
-
-
由 123malin 提交于
* test=develop, fix test_lookahead
-
- 07 1月, 2021 1 次提交
-
-
由 123malin 提交于
* test=develop, add model_average and lookahead
-