- 13 4月, 2023 6 次提交
-
-
由 Yuanle Liu 提交于
-
由 Kim Yann 提交于
* rem cncl in new test * rem cncl in build sh * rem cncl in old test
-
由 csy0225 提交于
-
由 RedContritio 提交于
-
由 risemeup1 提交于
* fix ninja error * fix_ninja_error_qa
-
由 gaoziyuan 提交于
--------- Co-authored-by: NZhang Jun <ewalker@live.cn>
-
- 12 4月, 2023 34 次提交
-
-
由 Wang Xin 提交于
* add autogen code for clip_by_norm op * bug fixed
-
由 zqw_1997 提交于
* slight modify * support cuda12+ arch, Hopper arch and discard 30 arch * add arch 90 for each paddle_known_gpu_archs12 * for comments
-
由 ShenLiang 提交于
-
由 Yulong Ao 提交于
* [Auto Parallel] Speedup the completion process * [Auto Parallel] Skip the property of dist_context when deepcopying * [Auto Parallel] Remove the unnecessary print * [Auto Parallel] Move some changes from 2.4 branch to develop * Update engine.py * [Auto Parallel] Fix a bug
-
由 zhangbo9674 提交于
* add Value OpResult OpOperand class * add Value OpResult OpOperand class * fix bug * fix bug * add utils * refine code * add ptr offset and reset method * add value impl * fix bug * refine comment of ValueImpl * refine code of OpResult * refine code of Value * add some comment * fix cpu compile bug * refine code * add op * add method for op & test value * refine unittest * refine code by comment * refine code * refine code * refine code * refine code
-
由 Zhang Zheng 提交于
* Optimize performance of unique kernel * fix ci
-
由 gouzil 提交于
* [phi] mv sequence_pooling_test * [test] fix include
-
由 zhupengyang 提交于
-
由 WangZhen 提交于
* Eval during train for ResNet
-
由 xiaoguoguo626807 提交于
-
由 liuruyan 提交于
-
由 YepKong 提交于
* add autogen code support for squared_l2_norm_op * Update ops.yaml
-
由 jiangcheng 提交于
* [CINN] add python.version.cinn_commit api * update cinn version get function * fix cinn_commit in setup.py also need len>0 check bug
-
由 RedContritio 提交于
-
由 cyber-pioneer 提交于
-
由 张春乔 提交于
* remove c_comm_init_hccl_op.cc and c_gen_hccl_id_op.cc * remove gen_hccl_id_op.cc
-
由 CHANGer 提交于
-
由 Aurelius84 提交于
-
由 qizhaoaoe 提交于
* fix dtype cast in amp. * add test case and update docs. * remove set_prim.
-
由 gaoziyuan 提交于
-
由 WangZhen 提交于
-
由 Wei Shengyu 提交于
* add bf16 support and bf16/fp16 unittest for pool2d * add include files * dbg * reformat * reformat * modify code according to review comment * remove duplicate code * remove dup code * remove useless include * dbg
-
由 xiongkun 提交于
-
由 Galaxy1458 提交于
* test, test=develop * test, test=develop * test, test=develop
-
由 wangzhen38 提交于
* [DO NOT MERGE] adadelta lr support * [DO NOT MERGE] gpu support * [test] follow torch * fix acc update order * for ci * [bug fix] update master para * [bug fix] update test * [bug fix] for ci test * for ci * fix xpu * [adadelta fix] del fluid head file * for ci * del notes
-
由 jiangcheng 提交于
-
由 Yuanle Liu 提交于
-
由 Yuanle Liu 提交于
* update save model * update
-
由 RedContritio 提交于
* move python/paddle/fluid/tests/unittests/xpu to test/xpu * update CMakeLists.txt * remove xpu in fluid/tests/unittests/ * add path to op_test_xpu * fix incorrect path * update test script * fix test_adadelta_op_xpu error
-
由 Guoxia Wang 提交于
* [AMP OP&Test] support bf16 for batchnorm * codestyle * Update batch_norm_grad_kernel.cu * Update batch_norm_kernel.cu * fix codestyle * fix * fix * fix * fix * fix * Update batch_norm_kernel.cc
-
由 Huihuang Zheng 提交于
* [Do NOT merge] Expr PR on Composite * Expr PR on Composite * Revert some compsite experiment * Remove unnecessary composite code * Add rsqrt as sub primitives
-
由 chenjian 提交于
* fix * fix * fix * fix * fix * fix * fix * fix * fix * fix * fix * fix * fix * fix * fix * isamp * gpu * cpu * noamp * fix instance_norm * fix * fix unit test * fix unit test * add unit test * fix * add big data tests * fix * fix * fix * fix * fix * fix * fix * add test case * fix * fix * fix * fix * fix * remove amp test --------- Co-authored-by: Nheyanru01 <429520051@qq.com>
-
由 megemini 提交于
* 【Hackathon 4th No.13】为 Paddle 新增 Bernoulli API * [Change]change unittest_py scipy version * [Change]修改BernoulliNumpy的类型参数;优化静态图测试流程 * [Change]优化类的初始化及逻辑;增加0D相关测试用例
-
由 wanghuancoder 提交于
-