- 01 4月, 2022 5 次提交
-
-
由 From00 提交于
* Fix compilation error for gcc-54 * Remove const for gpuStream_t
-
由 Leo Chen 提交于
* move WaitEvent/RecordEvent into try-catch * refine supportNpu
-
由 zhiboniu 提交于
-
由 Chen Weihang 提交于
* add several yaml * polish strided slice kernel & add yaml * reorder yaml * add several yaml * revert yaml config change * resolve conflict * Update test_strided_slice_op.py
-
由 hong 提交于
* fix error; test=develop * update * close some yaml * fix backward attrite error; test=develop * add div test * polish code; test=develop * update * update * fix bug * update bitwise code; test=develop * update * update * fix some bug * update * revert cmakelist * fix optional bug; * fix bug * fix bug; * add backward test * open bn * update * update * revert eager_gen * polish code * fix topk error * update * update * fix bug; * move label smooth, nll loss * revert topk * fix topk label smooth bug; * remove batch_norm * remove topk * change flip infer meta * fix flip bug * update yaml * close abs * fix histogram bug * fix histogram bug * add abs * fix histogram kernel * remove expand
-
- 31 3月, 2022 35 次提交
-
-
由 ziyoujiyi 提交于
* back fl * delete ssl cert * . * make warning * . * unittest paral degree * solve unittest * heter & multi cloud commm ready * . * .
-
由 Chen Weihang 提交于
-
由 Sing_chan 提交于
* add CUDA_TOOLKIT_ROOT_DIR option in cmake command * make sure dir in cmake use / rather than \
-
由 0x45f 提交于
-
由 csy0225 提交于
-
由 zyfncg 提交于
* rename scalar_array to int_array * update cmake * fix conflict * remove useless log
-
由 Aurelius84 提交于
* [Yaml] Migrate sqrt/square/reciprocal yaml * clean file * fix unittest error
-
由 tianshuo78520a 提交于
-
由 0x45f 提交于
-
由 zlsh80826 提交于
* Enhance test_eigh_op * Use eigen decomposition to validate eigen values and vectors * Fix that TestEighBatchAPI didn't run the batched input * Enhance test_eigvalsh_op * Align cusolver tolerance to validate eigenvalues * Fix that BatchAPI didn't run the batched input * Add abs for |d_ref| * Remove comment
-
由 wanghuancoder 提交于
-
由 zhangbo9674 提交于
-
由 wangxinxin08 提交于
* add multiclass_nms3 converter
-
由 Leo Chen 提交于
* fix bug that some op has no op_role attr * add mkldnn support for new executor * fit for mkldnn data_transfer * fit for mkldnn data_transfer
-
由 TeslaZhao 提交于
* Pass compat of conv_transpose_bias_mkldnn_fuse_pass * Fix a bug of strided_slice op, about the axes parameter access memory out of bounds * Fix a bug of strided_slice op, about the axes parameter access memory out of bounds * Fix a bug of transpose op, about accessing memory out of bounds of the perm param * op:transpose_op supports bool type * op:transpose_op supports bool type * Keep strided_slice op behavior consistent with slice op when starts input is less than -rank * Using DistConfig in inference
-
由 chenjian 提交于
* no * maintain old profiler * exclude new python record events for old profiler * maintain old profiler * maintain * maintain old profiler * maintain * fix cmakes
-
由 heliqi 提交于
* add flatten2,reshape2,squueze2_trt_fuse_pass test cast * add flatten2,reshape2,squueze2_trt_fuse_pass test cast * add flatten2,reshape2,squueze2_trt_fuse_pass test cast
-
由 Sing_chan 提交于
* [New API]: miminize_bfgs and miminize_lbfgs * modify for python module call correctly * add functional package, add error raise in static_graph, change assign to set_value * unify static_graph and dygraph, fix bug when x or H0 is float64 * now only accept input is tensor, put check args in utils.py, put exception test together * temp * add more detailed algorithm illustration and comment, reduce test case to limit test time in 15s * change in_dygraph_mode to in_dynamic_mode * fix bug of sample code; reduce test case to reduce test time * change dir to incubate
-
由 LiYuRio 提交于
-
由 0x45f 提交于
-
由 Chen Weihang 提交于
* fix kps compile failed * remove useless cond * add xpu for xpu_kp
-
由 wuyefeilin 提交于
* mv yolov3_loss op to phi * fix as review * update operator.h
-
由 wangguanqun 提交于
* fix load bug and add distributed strategy from pslib * add unittest * use cvm config * trainer and worker config * add unittest * add unittest * add test * code style
-
由 Leo Chen 提交于
* fix dependency of fused optimizer * add ut
-
由 chenjian 提交于
* no * fix operator summary table * update unit test
-
由 Xiaoxu Chen 提交于
* add random varaiable transformations API for paddle's distribution package * add TransformedDistribution API for paddle's probability distribution package * add random variable transformation unitests for static graph * replace math.prod which not support python3.7 with functools.reduce * add Independent and TransformedDistribution distribution * add unittests for constraint * fix typo and AffineTransform sample code error * add mean,variance,rsample abstract method for Distribution
-
由 Zhang Ting 提交于
-
由 chenjian 提交于
* no * fix bugs * fix doc according to review * fix api doc format * fix api doc according to review * fix bug and add unit test * fix record event bug * optimize chrome tracing display * fix bug * add comment * add unit test * fix a bug * fix * fix * fix format
-
由 zhangkaihuo 提交于
-
由 Liu-xiandong 提交于
* [KP] fix bug in phi kp * delete useless comment * update * update * choose the xpu kp kernel in phi
-
由 limingshu 提交于
* for 1st time interface combine. * modification with kernel factory * first auto_tune version. * first version. * basic version * add warm up step. * a debug version. * optimize the functionality of class auto_tuner. * add some quotes for optimized auto_tuner class. * add some quotes for optimized auto_tuner class. * add namespace. * modification according to the advices * replace fluid header with phi header. * replace fluid header with phi header.
-
由 Zhang Zheng 提交于
* Restrict compilation conditions of optimized topk kernel * fix
-
由 ykkk2333 提交于
-
由 Wilber 提交于
* add weight unfold pass and handle trt fc op * update * add kernel * update * update
-
由 wenbin 提交于
-