- 05 4月, 2022 1 次提交
-
-
由 Zhang Ting 提交于
* switch autotune * implement AutoTuneCache * implement AutoTuneCache class * add pybind api * add dygraph test * support static mode and eager mode and improve unittests * rename the SwitchAutoTune Class and improve tests * improve AutoTuneStatus and reduce the cost of tests
-
- 04 4月, 2022 4 次提交
-
-
由 0x45f 提交于
-
由 FlyingQianMM 提交于
-
由 hong 提交于
* update * fix bug
-
由 From00 提交于
* Add yaml for flatten_contiguous_range OP * update * Fix typos Co-authored-by: NShixiaowei02 <39303645+Shixiaowei02@users.noreply.github.com>
-
- 03 4月, 2022 4 次提交
-
-
由 chentianyu03 提交于
* add concat_grad kernel * fix error * remove comment code * fix outs nullptr error * change to phi header * add concat_grad declare for standalone_executor_test
-
由 FlyingQianMM 提交于
* limit grid dim for index select * mv LimitGridDim into gpu_launch_config.h * fix conflicts * fix conflicts * fix code style * set block to 256 * fix grid setting * set dtype of block_dim to unsigned int
-
由 zyfncg 提交于
* add randperm and range yaml * add eager test for randperm
-
由 From00 提交于
* Add yaml config * Add yaml for flatten_contiguous_range_op * Remove h_sigmoid yaml * Fix CI errors * Fix code format * Fix flatten OP errors * Fix conflicts * Fix CI errors * Remove flatten_contiguous_range OP * Remove redundant code * Fix typos
-
- 02 4月, 2022 9 次提交
-
-
由 Siming Dai 提交于
* Add graph_reindex API * add graph_sample_neighbors api * Add buffer * delete VLOG * delete thrust::copy for output * add ShareDataWith * delete graph_reindex hashtable output * add graph_reindex dispensable * add reindex unittest, move memset to cuda kernel, change api * fix conflict * add reindex buffer for gpu version note * fix conflicts for op_func_generator * Add fisher_yates sampling, add dispensable, change infermeta * add dtype for edge_id * fix rocm ci and static check ci * add unittest * fix unittest * fix unittest * fix bug
-
由 xiongkun 提交于
* add yaml for ele_max ele_min * add yaml for: mish / logexpsum / expm1 / elemenwise_pow / elementwise_floordiv
-
由 wuyefeilin 提交于
* move clip op to phi * fix as review * update hierarchical_sigmoid_kernel.cc * update selected_rows * update clip_kernel.cu * fix as review
-
由 Leo Chen 提交于
* enable new-executor on windows to test it * add message * fix ut
-
由 zhangkaihuo 提交于
-
由 niuliling123 提交于
-
由 Zhang Zheng 提交于
Co-authored-by: Nroot <root@yq01-sys-hic-k8s-v100-box-a225-0186.yq01.baidu.com>
-
由 xiongkun 提交于
* 1. add the python api grad 2. add final and intermediate state vlog 3. change the python_api error logic * add python api or close the check_eager=True * fix the compatibility
-
由 zhangkaihuo 提交于
-
- 01 4月, 2022 7 次提交
-
-
由 hong 提交于
-
由 wanghuancoder 提交于
* support pinned, test=develop * support async_write, test=develop * refine, test=develop * refine, test=develop * refine, test=develop * refine,test=develop * refine, test=develop * refine, test=develop * refine, test=develop * refine, test=develop
-
由 Chen Weihang 提交于
* add cross_entropy_with_softmax phi kernel * remove softmax_with_cross_entropy kernel * add softmax_with_cross_entropy grad kernel * remove original op kernel * refine cross entropy impl * fix pointer error * revert kernel cu change * fix xpu failed * fix cinn failed * fix npu failed * add forward sig * add check_nan_inf for pt kernel * remove repeat cmake item * fix unittest error
-
由 chentianyu03 提交于
* add interploate cpu kernel * fix nullptr bug * add interpolate gpu kernel * fix unit test error * remove raw kernels * add cuda kernel impl * add infermeta * recover accidentally deleted kernels in interpolate op * fix grad x_grad name error * remove interpolate_v2_op.h * rm unused codes * fix xpu build error * fix build error * fix namespace error * add register header for nup * fix infermeta error * modify by review * add the missing args in test_trt_convert_nearest_interp_v2
-
由 zhangkaihuo 提交于
-
由 Chen Weihang 提交于
* add several yaml * polish strided slice kernel & add yaml * reorder yaml * add several yaml * revert yaml config change * resolve conflict * Update test_strided_slice_op.py
-
由 hong 提交于
* fix error; test=develop * update * close some yaml * fix backward attrite error; test=develop * add div test * polish code; test=develop * update * update * fix bug * update bitwise code; test=develop * update * update * fix some bug * update * revert cmakelist * fix optional bug; * fix bug * fix bug; * add backward test * open bn * update * update * revert eager_gen * polish code * fix topk error * update * update * fix bug; * move label smooth, nll loss * revert topk * fix topk label smooth bug; * remove batch_norm * remove topk * change flip infer meta * fix flip bug * update yaml * close abs * fix histogram bug * fix histogram bug * add abs * fix histogram kernel * remove expand
-
- 31 3月, 2022 6 次提交
-
-
由 csy0225 提交于
-
由 zyfncg 提交于
* rename scalar_array to int_array * update cmake * fix conflict * remove useless log
-
由 wuyefeilin 提交于
* mv yolov3_loss op to phi * fix as review * update operator.h
-
由 Zhang Ting 提交于
-
由 zhangkaihuo 提交于
-
由 limingshu 提交于
* for 1st time interface combine. * modification with kernel factory * first auto_tune version. * first version. * basic version * add warm up step. * a debug version. * optimize the functionality of class auto_tuner. * add some quotes for optimized auto_tuner class. * add some quotes for optimized auto_tuner class. * add namespace. * modification according to the advices * replace fluid header with phi header. * replace fluid header with phi header.
-
- 30 3月, 2022 9 次提交
-
-
由 zyfncg 提交于
* move rnn kernel to phi * move infershape of rnn to phi * fix HIP bug * rename function * fix HIP bug * fix hip bug
-
由 Haohongxiang 提交于
* fix uncontrolled randomness of op * fix bugs
-
由 Chen Weihang 提交于
Revert "Revert "[Phi] Move elementwise_floordiv and elementwise_pow to phi (#40993)" (#41065)" (#41110) This reverts commit 3a6f1135.
-
由 Chen Weihang 提交于
This reverts commit 054fc997.
-
由 YuanRisheng 提交于
-
由 Zhang Zheng 提交于
* Optimize the perf of top_k when k is too large * fix rcom compile * fix * only compile in cuda * fix log info
-
由 pangyoki 提交于
* support view strategy in eager_final state * perfect reshape kernel * fix bugs of sig * add unittest for reshape_sig * fix bugs when run converage * fix inplace bug in final_state eager_gen * fix python_c_gen * support view strategy for final state * fix order of out and xshape in reshape * fix Coverage_CI unittest timeout error * support reshape view * fix reshape_sig * fix yml and api_base Co-authored-by: NYuanRisheng <yuanrisheng@baidu.com>
-
由 wawltor 提交于
* fix the argosrt cpu * add the test case for the paddle.argsort
-