- 11 7月, 2022 1 次提交
-
-
由 houj04 提交于
* rmsprop for xpu. test=kunlun * minor fix (follow comments). test=kunlun
-
- 02 7月, 2022 1 次提交
-
-
由 Leo Chen 提交于
* fix init() * delete test_device_context * replace CPUDeviceContext with CPUContext * fix test_scalar * remove dot_op.cc * fix compile
-
- 26 6月, 2022 1 次提交
-
-
由 Sing_chan 提交于
-
- 13 6月, 2022 1 次提交
-
-
由 qipengh 提交于
-
- 10 6月, 2022 1 次提交
-
-
由 sneaxiy 提交于
-
- 09 6月, 2022 1 次提交
-
-
由 sneaxiy 提交于
* add nproc_per_node for DistributedFusedLamb * fix nproc_per_node communicator bug * fix ring_id = 1 init bug * fix ci * fix test_parallel_executor_mnist.py
-
- 07 6月, 2022 2 次提交
- 05 6月, 2022 1 次提交
-
-
由 Sing_chan 提交于
-
- 04 6月, 2022 1 次提交
-
-
由 Sing_chan 提交于
-
- 27 5月, 2022 1 次提交
-
-
由 zyfncg 提交于
* refactor the optional tensor * remove optiona<MetaTensor> in InferMeta * fix bug * fix optional<vector<Tensor>> * fix bug * fix rmsprop * fix amp of eager_gen * polish code * fix deleted code * fix merge conflict * polish code * remove is_nullopt_ * fix merge conflict * fix merge conflict
-
- 16 5月, 2022 1 次提交
-
-
由 wbn 提交于
* Add the new XDNN implementation. test=kunlun * Add the new XDNN implementation. test=kunlun * Modify the code based on review, test=kunlun
-
- 11 5月, 2022 1 次提交
-
-
由 taixiurong 提交于
-
- 10 5月, 2022 1 次提交
-
-
由 qipengh 提交于
-
- 29 4月, 2022 1 次提交
-
-
由 Aurelius84 提交于
-
- 28 4月, 2022 1 次提交
-
-
由 sneaxiy 提交于
* add gradient merge for DistributedFusedLamb * use master acc gradient * fix CI ut * polish * remove math_function_impl.h change * fix test_update_loss_scaling_op.py * try to fix XPU/NPU CI * add gm ut
-
- 20 4月, 2022 1 次提交
-
-
由 fwenguang 提交于
-
- 15 4月, 2022 1 次提交
-
-
由 fwenguang 提交于
-
- 13 4月, 2022 1 次提交
-
-
由 zyfncg 提交于
* add sgd yaml * change python api * open eager mode in sgd * fix bug
-
- 07 4月, 2022 2 次提交
- 03 4月, 2022 1 次提交
-
-
由 hong 提交于
* add some infer meta * fix bug * fix bugs; * fix bug and add set data type * revert infer shape of lookup table * recover test
-
- 28 3月, 2022 1 次提交
-
-
由 hong 提交于
* move momentum, rmsprop to phi; test=develop * update * update * update * update * udpate; test=develop * fix xpu npu bugs; test=develop * fix npu bug; test=develop * fix windows compile error; test=develop * fix windows compile error; test=develop * polish code; test=develop * fix conflict; test=develop * add meshgrid; * update * polish code * polish code; * fix bug * format; remove useless code * fix npu bug * fix bug
-
- 25 3月, 2022 3 次提交
-
-
由 duanboqiang 提交于
* fix lars optitmizer bug * Update optimizer.py
-
由 Aurelius84 提交于
* [Phi] Migrate Adam and Adamw into Phi * fix compile error and unittest ok * fix compile error and unittest ok * fix undefined reference to fLI::FLAGS * test depend on operator * fix cmake * fix xpu compile * fix infrt * fix amp_type_traits * fix amp_type_traits * modify according reviewer * modify according reviewer * fix dtype float16 * fix typo * fix Cmake * fix code style
-
由 Aganlengzi 提交于
* [NPU] add merged_momentum * fix * fix device
-
- 14 3月, 2022 1 次提交
-
-
由 fwenguang 提交于
-
- 07 3月, 2022 2 次提交
-
-
由 Aurelius84 提交于
-
由 Aurelius84 提交于
* [Phi]Migrate Adamax into phi * Add adadelta kernel
-
- 04 3月, 2022 1 次提交
-
-
由 Leo Chen 提交于
* clean distribution_helper, index_impl, aligned_vector code in fluid * fix conflicts
-
- 02 3月, 2022 2 次提交
- 01 3月, 2022 1 次提交
-
-
由 sneaxiy 提交于
* vectorize lamb kernel * remove flags, add ut * remove useless codes * refine code, add param order
-
- 25 2月, 2022 1 次提交
-
-
由 sneaxiy 提交于
* add multi tensor apply l2 norm * add multi_tensor_apply code * make sizeof(TensorMeta) smalller * move code to distributed_fused_lamb_op.cu * remove useless FLAGS
-
- 22 2月, 2022 1 次提交
-
-
由 xiongkun 提交于
* change Vector to std::vector and provide MixVector class as a helper wrapper class * solve the multi-gpu hang problem * remove the duplicate template instantialize * Copy vector to cpu * add CopyToCPU * xxx * final version: fix the problem of all reduce * remove mixvector dependence * fix * merge * fix code * fix by CI
-
- 21 2月, 2022 1 次提交
-
-
由 sneaxiy 提交于
-
- 20 2月, 2022 1 次提交
-
-
由 Chen Weihang 提交于
* rename pten dir to phi * rename namespace to phi * rename infrt pten dir to phi * resolve conflict * rename pten to phi in cmake * revert all infrt change * change needed files * fix infrt failed * fix inference failed
-
- 19 2月, 2022 2 次提交
-
-
由 Aurelius84 提交于
* Unify paddle/pten::framework::ddim into pten::ddim * fix paddle namespace * compile sucessfully * fix npu src file * fix conflict * fix conflict * fix tensorrt compiler error * fix conflict * fix conflict * fix tesst file conflict * fix conflict * fix mlu file conflict * fix mlu file conflict * fix cinn header file conflict * fix conflict * fix conflict * fix conflict * fix conflict
-
由 sneaxiy 提交于
* add DistributedFusedLamb op * polish code * fix compile error * compatible with pten changement * fix rocm compile error * improve converage * update upstream/develop * fix cast_with_ptr.h * add FLAGS_distributed_lamb_divide_nranks_when_allreduce=1 * fix clip before allreduce * add use_master_param_norm * code polish * fix bug * fix ROCM ci
-
- 15 2月, 2022 1 次提交
-
-
由 Feiyu Chan 提交于
Move paddle/fluid/operators/math/algorithm.h to paddle/pten/kernels/funcs and rename all references to symbols in it.
-