- 31 10月, 2022 5 次提交
-
-
由 xiongkun 提交于
* add unittest for einsum-v2-trace and diagonal * repeat labels. * einsum support repeated labels. * forward is ok for diagonal and undiagonalized. TODO: check backward is ok by our theorem. * backward is ok! * fix by PR suggestions. * fix ci error * fix ci error * fix ci warning
-
由 ronnywang 提交于
* [CustomDevice] GetCCLComm add custom device support * update * update * update
-
由 kangguangli 提交于
* replace executor in conditional_block_op.run with standalone_executor * add block_id as the argument of standalone executor's method run; add print for program * fix scope bug about conditional block op * fix bug: unnecessary return of fetch value * fix typo * fix: quantization will set variable persistable, and these variables must exist in global scope * add interpretercore cache for conditional block op but not activate in default * fix bug: local scope reuse for conditional block op * reset scope when conditional block op runs * fix typo * fix typo and code style * add build scope for conditional block op * add skip for transfer_layout kernel * refind code * fix reset_scope * fix reset_scope * refine code * refine code * refine code 1. remove flag use in conditional_block_op 2. pass execution_config to BuildOpFuncList instead of individual parameter * refine code * remove the use of FLAGS_control_flow_use_new_executor_cache * change FLAGS_control_flow_use_new_executor to false
-
由 zhouweiwei2014 提交于
-
由 Wang Xin 提交于
-
- 28 10月, 2022 1 次提交
-
-
由 zyfncg 提交于
-
- 27 10月, 2022 2 次提交
-
-
由 Jacek Czaja 提交于
* - halfway transforming transpose grad - Fixes - buildable * - lint * rerunning the process
-
由 Bo Zhang 提交于
* fix reduce_any kernel data race on sharedMem * use bit operation instead of div & mod * unbranch * modified according to PR comments
-
- 26 10月, 2022 3 次提交
-
-
由 Lin Manhui 提交于
* Fix paddle.pow() bugs * Add unittest cases * Fix ut cases * Add ut cases on multiple devices
-
由 zhangkaihuo 提交于
-
由 Wang Xin 提交于
fix uninitialized, tautological-constant-out-of-range-compare and literal-conversion warning on macos (#47341)
-
- 25 10月, 2022 2 次提交
-
-
由 jakpiase 提交于
-
由 zhouweiwei2014 提交于
-
- 24 10月, 2022 4 次提交
-
-
由 zyfncg 提交于
* support selected_rows kernel for multiply in dygraph * delete useless code of slice in fluid * fix complie bug * move slice_array from fluid to phi * fix strided_slice_op_npu
-
由 Yiqun Liu 提交于
-
由 Zhang Ting 提交于
-
由 Yiqun Liu 提交于
-
- 21 10月, 2022 1 次提交
-
-
由 zhangyikun02 提交于
-
- 20 10月, 2022 2 次提交
-
-
由 JingZhuangzhuang 提交于
* Add infer prune function * Update phi.cmake * Update operators.cmake * add fusion op
-
由 thunder95 提交于
-
- 19 10月, 2022 4 次提交
-
-
由 Chen Weihang 提交于
-
由 Yiqun Liu 提交于
Enable to record whether the conv algo is got by exhaustive search to fix autotune cache bug. (#47065)
-
由 will-jl944 提交于
-
由 xiongkun 提交于
* [Dy2Static] Remove GradTransformer 1. fix einsum infershape bugs. 2. remove grad_transformer and unify paddle.grad and paddle.static.gradient. 3. add dygraph_and_dy2static_only decorator for dy2static. * fix bugs * rename
-
- 18 10月, 2022 3 次提交
-
-
由 seemingwang 提交于
* add embedding range check * change head file * change head file * fix
-
由 liu zhengxi 提交于
-
由 houj04 提交于
* [XPU] update xpu cmake to 1016. test=kunlun * fix special case of transpose op. test=kunlun
-
- 17 10月, 2022 4 次提交
-
-
由 Ghost Screaming 提交于
* Fix bug of reduce_sum op. When input.numel() > INT32_MAX, its result is wrong. * support pure bfloat16 * support bf16 linear * update PR to pass CI * tiny fix where_grad_kernel.cu * Support bfloat16 type for reducer and sharding. * Fix some bug. * Polish code. * Polise code. * Add bfloat16 datatype in fill_grad kernels. Co-authored-by: Nsneaxiy <sneaxiy@126.com>
-
由 YuanRisheng 提交于
* namespace modify * update by comment
-
由 OccupyMars2025 提交于
* add sparse reshape * change the dtype in all test cases to int64 * just one test case * modify comments * Update test_sparse_reshape_op.py * chang the type of "shape" from vector<int64_t> to IntArray * check whether sp_out.to_dense() is the cause of error * print sp_out * Update reshape_kernel.cc * use numpy to generate the equal paddle tensor * just check dense_tensor.numpy() * check cpu and cuda versions * Update test_sparse_reshape_op.py * supply all test cases for cpu forward coo kernel * test forward coo cuda kernel * change configuration of cuda kernel * keep only one test case * test coo cpu kernel (forward and backward) * row major or column major ??? * test cuda coo forward kernel * complete declaration and registration * Update __init__.py * rebuild * retrigger CI * add cudaMalloc and cudaMemcpy in ReshapeCooKernel and change back to row major order in a cuda dense tensor * midify minor error * test only cpu coo forward kernel * add all test cases for coo forward kernel (both cpu and gpu) * test all forward kernels (coo, csr; cpu, gpu) * add all test cases for all kinds of kernels * just retrigger CI * Update sparse_ops.yaml * Update sparse_ops.yaml * Update sparse_ops.yaml * resolve conflicts * Update sparse_ops.yaml * don't specify tensor place * new shape has -1 or 0 in it * Update unary_grad_kernel.h * correct lvalue error * code style * Update sparse_backward.yaml * Update sparse_ops.yaml * Update unary_kernel.h * Update unary.py * Update sparse_backward.yaml * Update unary.py * code style * code style * code style * Update unary.py * specify tensor place explicitly * do not use numpy array * use numpy array in unit test again * modify example code in docstring
-
由 Leo Guo 提交于
test=kunlun
-
- 14 10月, 2022 2 次提交
-
-
由 Rayman 提交于
-
由 Wang Bojun 提交于
* draft with debug print
-
- 13 10月, 2022 5 次提交
-
-
由 xiaohemaikoo 提交于
-
由 zhouweiwei2014 提交于
-
由 carryyu 提交于
-
由 Zhang Ting 提交于
* Revert "【Hackathon No.56&38】deformable_conv_v1 算子实现 float16 数据类型支持&前向运行加速 (#46111)"
-
由 Zhang Zheng 提交于
* Correct the logic and remove unnecessary template param * fix error throw * fix print format * fix ci
-
- 12 10月, 2022 2 次提交
-
-
由 Zhang Ting 提交于
This reverts commit 8a5f17e8.
-
由 zhangkaihuo 提交于
-