- 20 4月, 2022 1 次提交
-
-
由 BrilliantYuKaimin 提交于
* 增加logspace的算子描述 * 增加logspace的形状推断 * 增加logspace核函数实现 * 在python中增加logspace接口 * 增加logspace单测 * 增加logspace * Update logspace_kernel.cu * Update logspace_op.cc * 调整代码格式 * Update doc of logspace * Update tensor.py * Update logspace_op.cc * Update logspace_kernel.cc * Update logspace_kernel.cu * Update test_logspace.py * 调整 logspace 的位置 * 调整代码格式
-
- 19 4月, 2022 1 次提交
-
-
由 YuanRisheng 提交于
[Phi]Separate AddKernel/DivideKernel/SubtractKernel/MultiplyKernel from ElementwiseKernel(Part1) (#41806) * seperate add/div/sub/mul from elementwise * delete code * fix compile bugs * deal with conflict * fix bugs when compile * fix windows unit test bug * fix ci converage bugs
-
- 18 4月, 2022 2 次提交
-
-
由 Lijunhui 提交于
-
由 Siming Dai 提交于
* add eids result for graph_sample_neighbors * fix bug * move fisher_yates sample to warp * add cpu eid output * delete comment * delete comment * change nullptr placeholder * optimize sample kernel * fix mutable_data
-
- 15 4月, 2022 2 次提交
-
-
由 chentianyu03 提交于
* split reduce_kernel * rm reduce_kernel in cmake * split reduce_grad kernels * fix cmake build error * format code * fix standalone_executor_test error
-
由 hong 提交于
* try to fix batch norm memory issue * fix batch norm memroy alloc bug * polish some code
-
- 14 4月, 2022 3 次提交
-
-
由 Lijunhui 提交于
* regist elementwise_xxx
-
由 Aurelius84 提交于
-
由 Chen Weihang 提交于
* chnage dispatch to visit * resolve conflict
-
- 12 4月, 2022 6 次提交
-
-
由 hong 提交于
* add layer norm infermeta * add layer norm yaml * polish layer norm infer meta * add layer norm to black list
-
由 Lijunhui 提交于
* init commit no push * collect comile errors * bitwise UT * fix compile problem * cancel comments * restore miss deletion * fix compilation * fix UT * NO stash in multiple branch at the same times * fix error * combine .cu from gpu and kps * replace gpu by kps * fix by Chen-weihang * Revert "Fix kps compile error in Junhui logic compare bitwise" * fix backend test * rm comments Co-authored-by: NChen Weihang <chenweihang@baidu.com>
-
由 wuyefeilin 提交于
-
由 Zhanlue Yang 提交于
* [DoubleGrad] Enabled double grad test cases in eager_mode for test_imperative_double_grad * Fixed elementwise issue * Addressed CI failures
-
由 Aurelius84 提交于
* [Phi]Fix beta1_pow/beta2_pow/skip_update data transform problem in adam/adamw * fix xpu unittest failed
-
由 FlyingQianMM 提交于
add a inner loop for index_select_grad_init() in index_select op when dealing with large-shape data (#41563) * replace for with CUDA_KERNEL_LOOP for index_select_grad_init() in index_select op * use CUDA_KERNEL_LOOP_TYPE * fix code style * replace index_select_grad_init with SetConstant
-
- 11 4月, 2022 1 次提交
-
-
由 sneaxiy 提交于
-
- 07 4月, 2022 4 次提交
-
-
由 zhouweiwei2014 提交于
-
由 Siming Dai 提交于
* add one_hot gpu hint * move allow_out_of_range judgement * delete useless unittest
-
由 zhiboniu 提交于
-
由 Chen Weihang 提交于
* polish truncated normal kernel * add yaml * add truncated normal kernel and add yaml * polish unittests and yaml * import dygraph mehtod
-
- 06 4月, 2022 3 次提交
-
-
由 YuanRisheng 提交于
* add graph_send_recv yaml * deal with confict * fix compile bugs
-
由 hong 提交于
* update * add conv yaml * add backward * remove useless code * fix bug * fix bug * revert fluid dygraph conv2d * remove useless infermeta function * fix meta fn deluplicat error * conv using custom impl * remove amp include * fix bug * use cudnn = true * fix test mkldnn caching bug
-
由 xiongkun 提交于
* fix less than * fix some tests * fix additional 3 unittest case
-
- 05 4月, 2022 2 次提交
-
-
由 zyfncg 提交于
* fix bug of data transform in inference executor * fix bug
-
由 Guanghua Yu 提交于
-
- 04 4月, 2022 2 次提交
-
-
由 FlyingQianMM 提交于
-
由 hong 提交于
* update * fix bug
-
- 03 4月, 2022 3 次提交
-
-
由 chentianyu03 提交于
* add concat_grad kernel * fix error * remove comment code * fix outs nullptr error * change to phi header * add concat_grad declare for standalone_executor_test
-
由 FlyingQianMM 提交于
* limit grid dim for index select * mv LimitGridDim into gpu_launch_config.h * fix conflicts * fix conflicts * fix code style * set block to 256 * fix grid setting * set dtype of block_dim to unsigned int
-
由 zyfncg 提交于
* add randperm and range yaml * add eager test for randperm
-
- 02 4月, 2022 5 次提交
-
-
由 Siming Dai 提交于
* Add graph_reindex API * add graph_sample_neighbors api * Add buffer * delete VLOG * delete thrust::copy for output * add ShareDataWith * delete graph_reindex hashtable output * add graph_reindex dispensable * add reindex unittest, move memset to cuda kernel, change api * fix conflict * add reindex buffer for gpu version note * fix conflicts for op_func_generator * Add fisher_yates sampling, add dispensable, change infermeta * add dtype for edge_id * fix rocm ci and static check ci * add unittest * fix unittest * fix unittest * fix bug
-
由 wuyefeilin 提交于
* move clip op to phi * fix as review * update hierarchical_sigmoid_kernel.cc * update selected_rows * update clip_kernel.cu * fix as review
-
由 Leo Chen 提交于
* enable new-executor on windows to test it * add message * fix ut
-
由 Zhang Zheng 提交于
Co-authored-by: Nroot <root@yq01-sys-hic-k8s-v100-box-a225-0186.yq01.baidu.com>
-
由 xiongkun 提交于
* 1. add the python api grad 2. add final and intermediate state vlog 3. change the python_api error logic * add python api or close the check_eager=True * fix the compatibility
-
- 01 4月, 2022 5 次提交
-
-
由 hong 提交于
-
由 wanghuancoder 提交于
* support pinned, test=develop * support async_write, test=develop * refine, test=develop * refine, test=develop * refine, test=develop * refine,test=develop * refine, test=develop * refine, test=develop * refine, test=develop * refine, test=develop
-
由 Chen Weihang 提交于
* add cross_entropy_with_softmax phi kernel * remove softmax_with_cross_entropy kernel * add softmax_with_cross_entropy grad kernel * remove original op kernel * refine cross entropy impl * fix pointer error * revert kernel cu change * fix xpu failed * fix cinn failed * fix npu failed * add forward sig * add check_nan_inf for pt kernel * remove repeat cmake item * fix unittest error
-
由 chentianyu03 提交于
* add interploate cpu kernel * fix nullptr bug * add interpolate gpu kernel * fix unit test error * remove raw kernels * add cuda kernel impl * add infermeta * recover accidentally deleted kernels in interpolate op * fix grad x_grad name error * remove interpolate_v2_op.h * rm unused codes * fix xpu build error * fix build error * fix namespace error * add register header for nup * fix infermeta error * modify by review * add the missing args in test_trt_convert_nearest_interp_v2
-
由 Chen Weihang 提交于
* add several yaml * polish strided slice kernel & add yaml * reorder yaml * add several yaml * revert yaml config change * resolve conflict * Update test_strided_slice_op.py
-