- 30 3月, 2023 3 次提交
-
-
由 jiangcheng 提交于
* [CINN] pass global seed to CINN * fix cu not include cinn/runtime/flags.h bug * fix DefaultCUDAGenerator should has device id bug
-
由 cyberslack_lee 提交于
[CodeStyle][C416][C417] rewrite unnecessary comprehension with function call and use generator instead of map (#52140) * codestyle c416 c417 * fix error * fix inc * unify all C4 rules into one * fix inc --------- Co-authored-by: NSigureMo <sigure.qaq@gmail.com>
-
由 yuehuayingxueluo 提交于
* add gloo send_recv * fix code_stype * fix CI bug * fix send_recv.cc * add send_recv without sync_op * fix send_recv test * fix gather.cc
-
- 29 3月, 2023 26 次提交
-
-
由 zengshao0622 提交于
* pad3d add unittests of fp16 and bf16 * pad3d add unittests of fp16 and bf16 * fix cuda place * fix random to uniform * fix class name * fix fp16 max relative error to 1.5e-3 * add dytpe register for onednn * add pad uint16 check of common.py * remove check_eager * test_check_grad --> test_check_grad_normal
-
由 jjyaoao 提交于
* Clear the infrt-related code * remove tools/infrt
-
由 Yichen Zhang 提交于
* add group_norm composite rule * add test for scale_grad and bias_grad * resolve conflicts * remove amp in composite_rule.py * add float16 test * deal with NHWC format * keep the composite rule in float16 identical as original kernel * resolve conflicts
-
由 wanghuancoder 提交于
* delete old dygraph op test
-
由 hjyp 提交于
* regist output type for GraphSampleNeighbors and GroupNorm * Update return type * fix return type * update * fix detail
-
由 chenxujun 提交于
-
由 HongyuJia 提交于
* [CustomOP Inplace] Automap inplace dtype and shape, prepare for vector<Tensor> output * delete dtype,shape func of multi_inplace op * [CustomOP Inplace] Automap inplace dtype and shape, support vector<Tensor> output * [CustomOP Inplace] Auto-generate python API for inplace vector<Tensor> output
-
由 Galaxy1458 提交于
-
由 张春乔 提交于
* add up34 * modify var name in loop * revert changes in test_slice * Revert "modify var name in loop" This reverts commit 6d748e371afb417054ed0c6b36fd11e87959a90d. * temporarily ignore test_slice.py * add comment * empty commit, re-trigger all ci * fix inc --------- Co-authored-by: NSigureMo <sigure.qaq@gmail.com>
-
由 ShenLiang 提交于
* fix bg * add utest * add utest
-
由 zhouweiwei2014 提交于
-
由 xiaoguoguo626807 提交于
* tanh_double_grad_rules * delete log got api_base * modify composite yaml * optimize rules
-
由 zhupengyang 提交于
-
由 QingshuChen 提交于
-
由 Zheng-Bicheng 提交于
* update * update
-
由 jjyaoao 提交于
-
由 yuehuayingxueluo 提交于
* add fuse adamw pass * fix some bugs * fix CIbug * change chunk_size * fix CI bug * rm test_fused_adam_op.py * fix CI bugs * fix fuse_adamw_op_pass.cc * change code style * fix CI bug * fix ut bug and use_adamw_op_pass.cc * fix test_fuse_adamw_pass.py * fix CI bug * remove fluid * fix ci bug * fix CI bug
-
由 Ghost Screaming 提交于
* Support ignore_index for c_softmax_with_cross_entropy_op. * Polish code. Remove useless comments and add Testcase. * Polish code for TestCase. * Polish code. * Polish code style. * Polish code. * Change loss calculation formula and ignore_index dtype. * Polish TestCase.
-
由 niuliling123 提交于
-
由 zhangyikun02 提交于
-
由 jameszhang 提交于
* [kunlun] support min/max in dygraph mode * update xccl to 1.0.13
-
由 ronnywang 提交于
-
由 huangjiyi 提交于
* fix kp compile * test * Revert "test" This reverts commit 3a1cbfaa0f23e6e06d3dcd8d0b0c28aa63a98e70. * update copyright * update cmake * update cmake * update cmake * update cmake
-
由 傅剑寒 提交于
-
由 YuhangLi 提交于
-
由 sneaxiy 提交于
* fix generate_kernels.py in CUDA 12.0 * fix attrs bug
-
- 28 3月, 2023 11 次提交
-
-
由 Feiyu Chan 提交于
Add basic functionalities to support Scalar & Scalars in operator attribute. 1. extend allowed types in operator's attribute type, add `paddle::experimental::Scalar`, add corresponding protobuf Message types; 2. Scalar enhancement, add formatting, equality; 3. add code to handle Scalar & Scalars in opmaker, conversion from paddle operator to phi kernel, opdesc construction and manipulation, tensorrt converter, tracer, operator construction, etc; 4. bind `paddle::experimental::Scalar` to python, as `libpaddle.Scalar`; 5. add functionality to canonicalize attribute map according to OpProto(if the op the attribute map used for has an OpProto); 6. add code to manipulate Scalar proto message via protobuffer python API; Add unittests. 1. add test cases for formatting, equality for Scalars, and WrapAsScalars; 2. add test cases for 'casting' between different morphs of attributes; 3. add test cases for extracting scalar & scalars from attribute; 4. add test cases for CanonicalizeScalarAttrs(and fix a bug in type index offset); 5. fix gmock's library filename on windows platform. 6. clean code: use canonicalize_attrs instead of inlining the function; 7. add test cases for libpaddle.Scalar in python code. 8. add test cases for `make_scalar_proto`, which manipulate proto message `Scalar` via protobuffer python API.
-
由 Zhang Jun 提交于
* Remove log about fluid * Remove useless forward declarations * Fix uninitialization warning (trt onehot)
-
由 cyberslack_lee 提交于
-
由 张春乔 提交于
* mv cumprod * add attrs * Update backward.yaml * Update backward.yaml
-
由 wanghuancoder 提交于
* delete old dygraph op test
-
由 xiaoguoguo626807 提交于
* support layer_norm prim and cinn test * enable cinn test * fix merge conflict * polish input for check_output_with_place * fix merge conflict * add more test case * fix merge conflict * polish test case * polish op_test * change ln_g rules * modify scale is none case * modify scale is none case * add public_python_api for check prim * modify setoutputgrad and fp64bug * add todo & delete log * delete Single***varname * delete get varname * modify FP64 bug * delete op test * recover * fix conflict --------- Co-authored-by: NWeilong Wu <veyron_wu@163.com>
-
由 Leo Chen 提交于
* add flag to set chunk size * use the flag * add vlog * add ut * rename ut
-
由 sneaxiy 提交于
* add overflow check in memory efficient attention * fix ci compile error * fix ci compile error
-
由 houj04 提交于
* fix int8 support for full kernel * fix ut.
-
由 cyberslack_lee 提交于
* fix huber_loss * fix * fix ops.yaml add intermediate * fix * fix test
-
由 RedContritio 提交于
* support auto generate static for one_hot_v2 * format
-