- 24 2月, 2022 4 次提交
-
-
由 jakpiase 提交于
* added nearest interp v2 bf16 * disabled bilinear interp nhwc test * added skipping UT for gpu * added NHWC support * removed unnecessary statements * minor change * CI fix * added appropriate changes to interpolate_v1 * fix after review * minor change * minor change * revert unwanted deletions * CI fix
-
由 huangxu96 提交于
* Optimize the where_op by the elementwise_op funtion * Modified where_op & abs_grad_op by elementwise interface
-
由 niuliling123 提交于
-
由 Li Min 提交于
* optimize block config and fp16 atomicAdd perf for lookup_table_v2_grad.
-
- 23 2月, 2022 11 次提交
-
-
由 Sing_chan 提交于
* move trunc_op's infere shape * modify according to risheng's comment
-
由 Leo Chen 提交于
* move randperm to phi * fix npu * fix memory::Copy
-
由 Yang 提交于
-
由 zhouweiwei2014 提交于
* change CUDA implementaion of bernoulli OP * fix CI
-
由 ronnywang 提交于
-
由 Leo Chen 提交于
* move unbind to phi * revert infer shape * add header file * move concat_and_split to phi
-
由 Liu-xiandong 提交于
* [KP] Add elementwise add xpu, test=develop * modify the File Permissions * modify the copyright time * modify code style * modify code style
-
由 Aurelius84 提交于
* [Phi] Migrate lable_smooth_op into Phi * fix PT->PD
-
由 zhangbo9674 提交于
* add elementwise_div * refine rocm * refine code * refine op register * solve conflict * refine unittest * refine unittest precision * add rocm
-
由 chenjian 提交于
* fix RecordEvent interface * modify default level to 4 * update interface use * add const default trace level * update record event interface using * update record event interface using * update record event interface using * update operator.cc * update part2 * update part1 * update part3 * fix include profiler.h header in ps server * fix include profiler.h header in ps server * fix profiler.h header * fix profiler.h header * fix merge buf * update * fix bug * fix bug
-
由 zyfncg 提交于
* remove fill_any_like kernel in fluid and fix data transform bug * support scalar in infershpe * recover infershape in fill_and_like
-
- 22 2月, 2022 7 次提交
-
-
由 From00 提交于
* Move Real OP to phi * Move Imag OP to phi * Move Real and Imag InferShape to phi * Move Real and Imag to complex_kernel * Change PT_REGISTER_XXX to PD_REGISTER_XXX
-
由 jakpiase 提交于
-
由 Leo Guo 提交于
* Adapt to batch_norm_grad op and add align function in roi_align op for kunlun, *test=kunlun * Adapt to batch_norm, batch_norm_grad op api for kunlun, and add unit-tests of batch_norm, roi_align. *test=kunlun
-
由 xiongkun 提交于
* change Vector to std::vector and provide MixVector class as a helper wrapper class * solve the multi-gpu hang problem * remove the duplicate template instantialize * Copy vector to cpu * add CopyToCPU * xxx * final version: fix the problem of all reduce * remove mixvector dependence * fix * merge * fix code * fix by CI
-
由 Aurelius84 提交于
* [Phi] Migrate unfold_op into phi * fix im2col CPUContext template instantial * fix unfold_op.h header include problem * fix unittest * fix PT->PD
-
由 houj04 提交于
-
由 niuliling123 提交于
* Modified RandomKernel with Kernel Primitive API * update pten.h to phi.h * update * update fullKernel
-
- 21 2月, 2022 9 次提交
-
-
由 From00 提交于
* Move Abs InferShaper to phi * Fix CI error
-
由 Aurelius84 提交于
* migrate huber_loss into phi * migrate infershape * modify pten into phi
-
由 0x45f 提交于
* fix cond grad error when handle tensor array * add UT
-
由 chentianyu03 提交于
* rm reduce_sum raw kernel * remove reduce_mean kernel * remove reduce_mean kernel * reduce support int and int64_t * mean support int and int64_t type
-
由 zhangbo9674 提交于
* add elementwise_max & unittest * refine cuda register and unittest * refine unittest * refine uinttest for bf16 * refine optest * refine code * refine unittest * refine unittest
-
由 YuanRisheng 提交于
* remove infershape and Xshape * add xshape * fix bugs when run ci * fix bugs when run ci * fix bugs when run infrt test * pass converage
-
由 zmxdream 提交于
* fix. test=develop * fix. test=develop * fix code style. test=develop * fix. test=develop * fix. test=develop
-
由 sneaxiy 提交于
-
由 ShiningZhang 提交于
-
- 20 2月, 2022 4 次提交
-
-
由 Chen Weihang 提交于
* rename pten dir to phi * rename namespace to phi * rename infrt pten dir to phi * resolve conflict * rename pten to phi in cmake * revert all infrt change * change needed files * fix infrt failed * fix inference failed
-
由 FlyingQianMM 提交于
add index initialization in the block loop for index_sample kernel when dealing with a input tensor whose shape is larger than block_dim * grid_dim (#39736) * add block and grid loop for index_sample kernel to deal with a large-shape tensor * fix code format * limit grid dim * fix the omissive initialization of index_i in the second cycle for index_sample kernel * fix conflicts
-
由 Yiqun Liu 提交于
-
由 sneaxiy 提交于
* add more op int16 support * fix xpu ci
-
- 19 2月, 2022 5 次提交
-
-
由 Aurelius84 提交于
* Unify paddle/pten::framework::ddim into pten::ddim * fix paddle namespace * compile sucessfully * fix npu src file * fix conflict * fix conflict * fix tensorrt compiler error * fix conflict * fix conflict * fix tesst file conflict * fix conflict * fix mlu file conflict * fix mlu file conflict * fix cinn header file conflict * fix conflict * fix conflict * fix conflict * fix conflict
-
由 zyfncg 提交于
* Add selected_rows kernel for full * remove fill_constant register in fluid * fix bug without GPU * add jit_kernel_helper dependency for fc * do some refactor * add unittest for ops signatures * add coverage unittest * fix merge conflict * fix full selectew_rows bug
-
由 chenjian 提交于
* fix RecordEvent interface * modify default level to 4 * update interface use * add const default trace level * update operator.cc
-
由 zyfncg 提交于
* remove manual_api * change sig map of full and empty * fix fill_any_like_xpu_op * fix fill_any_like_xpu_op * fix problem of fill_any_like_xpu_op * fix conflict * polish code
-
由 sneaxiy 提交于
* add DistributedFusedLamb op * polish code * fix compile error * compatible with pten changement * fix rocm compile error * improve converage * update upstream/develop * fix cast_with_ptr.h * add FLAGS_distributed_lamb_divide_nranks_when_allreduce=1 * fix clip before allreduce * add use_master_param_norm * code polish * fix bug * fix ROCM ci
-