- 14 6月, 2023 1 次提交
-
-
由 pangengzheng 提交于
* support sharding stage1 * fix unittest * format * pass sharded sharding params_and_grads to inner_opt apply_pptimize * change sharding gradient allreduce to reduce * support save state_dict adptively and support sharding with mp * fix sharding test * test set_state_dict * add more unit test * fix global norm of mp case * polish * hack to calculate global norm in order to remove diff in calculating global norm values in HybridParallelClipGrad compared to dp * remove print
-
- 13 4月, 2023 1 次提交
-
-
由 HongyuJia 提交于
* [enforce.h Decouple logging.h] Delete glog/logging.h from enforce.h * Add logging.h for profiler.cc * Add logging.h for gloo_utils.h * Add logging.h for addmm_kernel_impl.h * Add logging.h for addmm_grad_kernel_impl.h * Add logging.h for p_send_kernel.cu * Add logging.h for determinant_grad_kernel_impl.h * Add logging.h for p_recv_kernel.cu * Add logging.h for elementwise_grad_base.h * Add logging.h for transfer_layout_kernel.cc * Add logging.h for eigvals_kernel.cc and index_select_impl.h * Add logging.h for all files in kernel directory * Add logging.h for xpu_info.cc * Add logging.h for xpu
-
- 11 4月, 2023 2 次提交
-
-
由 zhangyuqin1998 提交于
-
由 wangzhen38 提交于
-
- 04 4月, 2023 1 次提交
-
-
由 zhangyuqin1998 提交于
* rename_bilinear_tensor_product * fix
-
- 30 3月, 2023 2 次提交
-
-
由 zhouweiwei2014 提交于
-
由 pangengzheng 提交于
* support run haokanctr model in heterps-models * polish setup.py * polish JVM_LIB in evn_dict * align infer auc with DistPsArch pre-stable * async and multi thread data feed * rewrite dense tensor intialization * async infer shape and reuse memory
-
- 27 3月, 2023 2 次提交
-
-
由 ZhangDY-6483 提交于
-
由 risemeup1 提交于
* fix_gcc12_error * patch on eigen3 for fixing gcc12 error * Update multiary.cc
-
- 24 3月, 2023 2 次提交
-
-
由 PuQing 提交于
* fix momentum dtype infer * fix momentum datatype * fix on cpu * add momentum
-
由 ZhangDY-6483 提交于
* first version, notest * return final rst, notest * use infinity() instead of max * ut structure * start up of ut * generate lse * update * add depense * reconstruct cmake * move file * add memory efficient attention and fix blasimpl * update * update cmake * add namespace * update cmake * use .cu * update for pad3d * bug fix * bug fix * update * bug fix * update enforce * add test case * merge the lse pad * fix kernel_fn of backward * fix PADDLE_ENFORCE_EQ and phi_api * fix PADDLE_ENFORCE * fix PADDLE_ENFORCE * rerun coverage * fix memory efficient attention test * rerun ci * add cuda version condition * add cuda version condition * delete WIP test * replace PADDLE_ENFORCE * edit the namespace of datatype in multiple.cc * rerun * rerun --------- Co-authored-by: Nliuyuang <liuyuang@baidu.com>
-
- 23 3月, 2023 1 次提交
-
-
由 chenjian 提交于
* add meshgrid composite rule * add meshgrid composite rule * update * add into CMakeLists * fix * update * update * optimize code * fix meshgrid op * update test
-
- 22 3月, 2023 1 次提交
-
-
由 sneaxiy 提交于
* add fused_linear_param_grad_add_kernel * fix compile error * remove flag * fix ci compile error * fix ci compile error * revert pylayer revision * fix ci ut * improve performance
-
- 21 3月, 2023 1 次提交
-
-
由 iSerendipity 提交于
* move DataType from paddle::experimental to phi * convert namespace * convert namespace * convert namespace * clarify namespace * convert more datatype * Revert "convert more datatype" This reverts commit 083b462959e6a22d4d8767707b628b95b396642e. * convert more in auto_code_generator * fix conflicts for XPU * fix namespace conflicts * fix errors * Revert "fix errors" This reverts commit f9d9958b54ee32141112274c8a5c3c381ab0f876. * fix errors * fix formatting
-
- 09 3月, 2023 1 次提交
-
-
由 iSerendipity 提交于
* Add output defs for sgd kernel * add datatype infer for sgd * add infer logic
-
- 08 3月, 2023 1 次提交
-
-
由 niuliling123 提交于
-
- 06 3月, 2023 1 次提交
-
-
由 niuliling123 提交于
-
- 03 3月, 2023 1 次提交
-
-
由 niuliling123 提交于
-
- 01 3月, 2023 1 次提交
-
-
由 niuliling123 提交于
-
- 17 2月, 2023 1 次提交
-
-
由 yuehuayingxueluo 提交于
* rename multi_tensor_adam to fused_adam * fix some bugs * fix CI coverage * rename test_fused_adam.py * fix some bug * add test_fused_adam_op.py * fix some bugs * fix fused_adam_op.cc * fix CI bugs * fix CI bug * fix CI bug
-
- 16 2月, 2023 1 次提交
-
-
由 Chen Weihang 提交于
* add logspace yaml * update by comments * resolve test framework conflicct
-
- 09 2月, 2023 1 次提交
-
-
由 yuehuayingxueluo 提交于
* add multi_tenosr_adam * update multi_tensor_base.py, test_multi_tensor_adam.py, adamw.py * fix adam.py optimizer.py * fix adamw.py * fix test_multi_tensor_adam.py * fix CI bug * fix CI coverage * fix ci bug * fix betapow * fix some bugs * fix test_adamw_op.py * fix CI coverage * fix multi_tensor_adam_kernel.cc * fix CI bug * fix multi_tensor_adam_op.cc and test_multi_tensor_adam.py * fix code style * update C++ parts * remove python parts modification temporarily * add C++ ut * update betapow copy code logic * fix ci ut * fix windows ci * fix coverage ci * improve coverage rate --------- Co-authored-by: Nsneaxiy <sneaxiy@126.com>
-
- 31 1月, 2023 2 次提交
-
-
由 RedContritio 提交于
* fix incorrect output shape of broadcast * add unittest
-
由 xiaoting 提交于
* support 0d tensor for interpolate * support 0d tensor for interpolate * add xpu unittest for interp * update unittest for interpolate * fix coverage * fix code style * fix for coverage * fix coverage
-
- 16 1月, 2023 1 次提交
-
-
由 wawltor 提交于
-
- 28 12月, 2022 1 次提交
-
-
由 Haohongxiang 提交于
-
- 26 12月, 2022 1 次提交
-
-
由 Roc 提交于
* revert concat and change concat to stack * let stack kernel support int8, uint8 and bool type
-
- 23 12月, 2022 1 次提交
-
-
由 Hui Zhang 提交于
* add warp transducer code
-
- 22 12月, 2022 1 次提交
-
-
由 xiaoxiaohehe001 提交于
-
- 09 12月, 2022 1 次提交
-
-
由 Leo Chen 提交于
* move share_buffer kernel to phi * fix ut * add source file * fix window links
-
- 05 12月, 2022 1 次提交
-
-
由 Roc 提交于
-
- 17 11月, 2022 1 次提交
-
-
由 YuanRisheng 提交于
* standard api * fix xpu bugs
-
- 11 11月, 2022 1 次提交
-
-
由 zhouweiwei2014 提交于
-
- 02 11月, 2022 1 次提交
-
-
由 YuanRisheng 提交于
* Standardise batch norm * standardize conv3d and depwise_conv2d * fix ci bugs
-
- 01 11月, 2022 1 次提交
-
-
由 YuanRisheng 提交于
* standard_api * add hardtanh
-
- 31 10月, 2022 1 次提交
-
-
由 YuanRisheng 提交于
* standard api * fix ci bugs * fix ci bugs * fix ce bugs
-
- 17 10月, 2022 1 次提交
-
-
由 YuanRisheng 提交于
* namespace modify * update by comment
-
- 12 10月, 2022 1 次提交
-
-
由 zhouweiwei2014 提交于
* [Zero-Dim] support input 0D Tensor for unary api * fix CI
-
- 10 10月, 2022 1 次提交
-
-
由 YuanRisheng 提交于
* add yaml entry for rnn and rrnn_grad, move infershape function for rnn_grad to phi infer meta * WIP: move rnn kernrl to phi * Change the code generation to avoid converting from intializer list to tuple of heterogeneous types. This is only triggered when an api has intermediate outputs, and the result of the outputs are of heterogeneous types. * fix the bug that when none in a vector of tensors requires gradient, the conversion to InferShapeContext to InferMetaContext (a.k.a. BuildInferMetaContext) produces errorous results. * fix ci bugs * fix ci bugs * fix ci bugs * modify code according comment Co-authored-by: Nchenfeiyu <chenfeiyu@baidu.com>
-
- 09 10月, 2022 1 次提交
-
-
由 zhangkaihuo 提交于
-