- 27 2月, 2023 3 次提交
-
-
由 zhouweiwei2014 提交于
-
由 wangshengxiang 提交于
* [XPU] bind op scatter_nd_add * [XPU] add more data type for op: clip, transpose2 & assign_value
-
由 shaojie_wang 提交于
* register bfloat16 datatype for squared l2 norm * register bfloat16 datatype for softmax with upper triangular mask * register bfloat16 for tril triu cuda kernel
-
- 26 2月, 2023 2 次提交
-
-
由 limingshu 提交于
* implement of matmul using cublasLt instead of cublas * Update matmul_kernel_impl_via_blasLt.h --------- Co-authored-by: Nzhangbopd <1299246947@qq.com> Co-authored-by: NBo Zhang <105368690+zhangbopd@users.noreply.github.com> Co-authored-by: NLiu Yiqun <liuyiqun01@baidu.com>
-
由 Yiqun Liu 提交于
* Enable matmul + bias fusion in fused_gat_attention. * Add a variable to control whether using fused matmul + bias.
-
- 25 2月, 2023 1 次提交
-
-
由 zyfncg 提交于
* rename elementwise_heaviside to heaviside * delete __init__.py * fix bug
-
- 24 2月, 2023 8 次提交
-
-
由 yunyaoXYY 提交于
-
由 niuliling123 提交于
-
由 Yuanle Liu 提交于
-
由 HappyHeavyRain 提交于
* support 'backend' in static ops * change bitwise_xx comment in python * change bitwise_xxx comment in python * change 'backend' and 'data_type' in GetExpectedKernelType
-
由 YuanRisheng 提交于
-
由 xiaoguoguo626807 提交于
* support prim test in OpTest * fix cmake * fix op test * fix test_input_spec * disable cinn in reduce_sum unit test * add bfloat16 dtype for sum * add approve rules * polish code * add clear jit program function * convert grad out from tensor to numpy * remove unnecessary code * add only_prim flag * fix flag * fix op test * add attr * fix optest comp inplace error * fix op test * fix op test with guard * add initialization of check_comp flag * fix comp inplace error in op test * rename check_comp with check_prim and add bfloat16 dtype convert * rename comp_op_type to prim_op_type * rename comp to prim * remove useless code * skip ci check for only prim * add no_grad_vars and grad_outputs in prim test * fix var_dict * fix op test for only_prim * fix dy2static bugs * polish some code * temp * modify op test * except cinn test * modify bfp16 * modify pad grad * add pad_grad dtype * start cinn part --------- Co-authored-by: NCharles-hit <wanghao107@baidu.com>
-
由 HongyuJia 提交于
-
由 ronnywang 提交于
* [XPU] add expand_grad, isnan, meshgrid kernels * update
-
- 23 2月, 2023 12 次提交
-
-
由 limingshu 提交于
-
由 csy0225 提交于
-
由 HongyuJia 提交于
* change phi tensor_gen->tensor_operants_gen * [Tensor API] Support multiple Tensor C++ api * [Tensor API] Unsupport prob Tensor API * accept reviewers comment of #50731 * delete tensor_api.yaml
-
由 Huang Jiyi 提交于
* move fluid generator to phi * move fluid generator to phi * update .gitignore * fix bugs * fix cannot find "glog/logging.h" in "generator.h" * fix bugs
-
由 limingshu 提交于
* first commit * main codes has been developed * fix all bugs * add vectorize input&output * a test for optimization_of_layer_norm_fwd * add some changes * fix memory coalesced access for more optimization. * fix addition ctest error * fix according to ci-approval * remove change on slice
-
由 risemeup1 提交于
-
由 RuohengMa 提交于
* fix accurary diff issue when XPU op batch_norm is added to XPU blacklist * remap op output tensor to input tensor when the op has fallen back to CPU * rename function name and fix bug causing by InplaceCounter
-
由 HongyuJia 提交于
* polish namespace * change static_tensor_operants * polish namespace * support add, subtract, divide * add unit test * polish unittest * fix cmake error * solve conflicts, merge auto code-gen * add scalar operator in tensor.h * tensorbase * static prim full support more datatype * fix prim unittest * polish codes * fix cmake error
-
由 YuanRisheng 提交于
* move profiler * fix compile bugs
-
由 HappyHeavyRain 提交于
* support 'complex promote' in yaml * change the compplex_promote * change 'kron' in math.py * change 'kron' comment in python * change kron comment in python * change kron comment in python
-
由 Jiabin Yang 提交于
* tmp gather vjp * support gather * remove useless code * fix compiling error * fix ut * add eager test * add eager test * add seed * fix cpu error * fix transpose op compat * remove tensor index case * fix prim_cinn * fix ut * add gather composite
-
由 jameszhang 提交于
* kunlun support c_softmax_with_cross_entropy * fix grad calc error * replace mutable_data() and ShareDataWith() * update xdnn * update xpu toolchain to 20230215 * remove fluid from test file
-
- 22 2月, 2023 9 次提交
-
-
由 HongyuJia 提交于
* change phi tensor_gen->tensor_operants_gen * [Tensor API] Support multiple Tensor C++ api
-
由 zhouweiwei2014 提交于
-
由 risemeup1 提交于
-
由 Shuangchi He 提交于
* Fix some typos. Signed-off-by:
Yulv-git <yulvchi@qq.com> * pre-commit Signed-off-by:
Yulv-git <yulvchi@qq.com> --------- Signed-off-by:
Yulv-git <yulvchi@qq.com>
-
由 zhupengyang 提交于
-
由 HongyuJia 提交于
* fix py::array_t calling bug * fix typo, implemention->implementation, test=document_fix
-
由 zhangkaihuo 提交于
-
由 Jiabin Yang 提交于
* tmp gather vjp * support gather * remove useless code * fix compiling error * fix ut * add eager test * add eager test * add seed * fix cpu error * fix transpose op compat * remove tensor index case * fix prim_cinn * fix ut
-
由 houj04 提交于
-
- 21 2月, 2023 5 次提交
-
-
由 HappyHeavyRain 提交于
* support bw invoke fw * fix scale in static_backward.yaml * fix the bug in tensorrt/convert * move 'scale','sign' into ops.yaml * add scale_grad of scale in op_compat.yaml * change generated_static_op in CMakeLists.txt
-
由 cyber-pioneer 提交于
* fix flatten op map * remove prim op all list * add op map info of full_like * polish code
-
由 QingshuChen 提交于
-
由 YuanRisheng 提交于
* decouple_memory * perfect memory utils * fix ci bugs * fix inference bugs * fix custom test bugs * fix converage bugs * modify code according comment * modify namespace * deal with compile bugs
-
由 Huang Jiyi 提交于
* move sequence_padding to phi * fix bugs * fix bugs * fix bugs * fix bugs * fix bugs * fix bugs * fix bugs * fix bugs * fix buga * fix bugs * revert and update phi::XPUContext
-