- 12 4月, 2022 5 次提交
-
-
由 Lijunhui 提交于
* init commit no push * collect comile errors * bitwise UT * fix compile problem * cancel comments * restore miss deletion * fix compilation * fix UT * NO stash in multiple branch at the same times * fix error * combine .cu from gpu and kps * replace gpu by kps * fix by Chen-weihang * Revert "Fix kps compile error in Junhui logic compare bitwise" * fix backend test * rm comments Co-authored-by: NChen Weihang <chenweihang@baidu.com>
-
由 wuyefeilin 提交于
-
由 Zhanlue Yang 提交于
* [DoubleGrad] Enabled double grad test cases in eager_mode for test_imperative_double_grad * Fixed elementwise issue * Addressed CI failures
-
由 Aurelius84 提交于
* [Phi]Fix beta1_pow/beta2_pow/skip_update data transform problem in adam/adamw * fix xpu unittest failed
-
由 FlyingQianMM 提交于
add a inner loop for index_select_grad_init() in index_select op when dealing with large-shape data (#41563) * replace for with CUDA_KERNEL_LOOP for index_select_grad_init() in index_select op * use CUDA_KERNEL_LOOP_TYPE * fix code style * replace index_select_grad_init with SetConstant
-
- 11 4月, 2022 3 次提交
-
-
由 YuanRisheng 提交于
* add multi_dot,maxout,multiplex yaml * add code converage
-
由 chentianyu03 提交于
* add assign yaml * add assign api * add assign backward api * add assign * add assign yaml * add assign * assign yaml * add assign raw kernel and use assign_raw in yaml * merge develop branch * add missing python_api
-
由 sneaxiy 提交于
-
- 10 4月, 2022 1 次提交
-
-
由 Chen Weihang 提交于
-
- 09 4月, 2022 2 次提交
-
-
由 hong 提交于
-
由 limingshu 提交于
* Using the maximum workspace_size of all alogirhms to limit the workspace size in exhaustive search mode. * Use the system cudaMalloc and cudaFree to allocate workspace during searching. * Enable switch of two kind of workspace setting methods. Co-authored-by: NLiu Yiqun <liuyiqun01@baidu.com>
-
- 08 4月, 2022 1 次提交
-
-
由 Jack Zhou 提交于
-
- 07 4月, 2022 8 次提交
-
-
由 zhouweiwei2014 提交于
-
由 YuanRisheng 提交于
* add yaml * perfect converage
-
由 zhouweiwei2014 提交于
-
由 zhangkaihuo 提交于
-
由 Siming Dai 提交于
* add one_hot gpu hint * move allow_out_of_range judgement * delete useless unittest
-
由 zhiboniu 提交于
-
由 Chen Weihang 提交于
* polish truncated normal kernel * add yaml * add truncated normal kernel and add yaml * polish unittests and yaml * import dygraph mehtod
-
由 YuanRisheng 提交于
-
- 06 4月, 2022 4 次提交
-
-
由 YuanRisheng 提交于
* add graph_send_recv yaml * deal with confict * fix compile bugs
-
由 Sing_chan 提交于
-
由 hong 提交于
* update * add conv yaml * add backward * remove useless code * fix bug * fix bug * revert fluid dygraph conv2d * remove useless infermeta function * fix meta fn deluplicat error * conv using custom impl * remove amp include * fix bug * use cudnn = true * fix test mkldnn caching bug
-
由 xiongkun 提交于
* fix less than * fix some tests * fix additional 3 unittest case
-
- 05 4月, 2022 4 次提交
-
-
由 zyfncg 提交于
* fix bug of data transform in inference executor * fix bug
-
由 Zhanlue Yang 提交于
* [Refactor] refactored eager_gen.py PR #2 * [DoubleGrad PR #1] Decoupled code generation logics for Dygraph ForwardFunctions and GradNodes * Fixed minor issue * Adjusted logics of GenerateNodeCreationCodes and GenerateForwardDefinition * Fixed issues * Supported higher-order grad node generation * [DoubleGrad PR #4] Supported higher-order GradNode generation * [DoubleGrad #4] Bug Fixes to Double Grad Node Generation * Fixed yaml typo * Fixed yaml typo * fixed minor issues * [DoubleGrad PR #5] Enabled gradient computations for grad_tensors passed to paddle.grad() * Fixed minor issue * Fixed CI-Inference issue * Fixed CI-inference issues * [DoubleGrad PR #7] paddle.grad() to copy backward graph before backward run * Fixed minor issues * Fixed issue with backward graph construction logic * Fixed implementation issues with backward graph reconstruction * Fixed unittest issue * Fixed issues * [DoubleGrad PR #8] Enabled triple grads for sigmoid and matmul * Fixed issues with phi kernel * Added triple grad test case * Fixed minor issue
-
由 Guanghua Yu 提交于
-
由 Zhang Ting 提交于
* switch autotune * implement AutoTuneCache * implement AutoTuneCache class * add pybind api * add dygraph test * support static mode and eager mode and improve unittests * rename the SwitchAutoTune Class and improve tests * improve AutoTuneStatus and reduce the cost of tests
-
- 04 4月, 2022 4 次提交
-
-
由 0x45f 提交于
-
由 FlyingQianMM 提交于
-
由 hong 提交于
* update * fix bug
-
由 From00 提交于
* Add yaml for flatten_contiguous_range OP * update * Fix typos Co-authored-by: NShixiaowei02 <39303645+Shixiaowei02@users.noreply.github.com>
-
- 03 4月, 2022 4 次提交
-
-
由 chentianyu03 提交于
* add concat_grad kernel * fix error * remove comment code * fix outs nullptr error * change to phi header * add concat_grad declare for standalone_executor_test
-
由 FlyingQianMM 提交于
* limit grid dim for index select * mv LimitGridDim into gpu_launch_config.h * fix conflicts * fix conflicts * fix code style * set block to 256 * fix grid setting * set dtype of block_dim to unsigned int
-
由 zyfncg 提交于
* add randperm and range yaml * add eager test for randperm
-
由 From00 提交于
* Add yaml config * Add yaml for flatten_contiguous_range_op * Remove h_sigmoid yaml * Fix CI errors * Fix code format * Fix flatten OP errors * Fix conflicts * Fix CI errors * Remove flatten_contiguous_range OP * Remove redundant code * Fix typos
-
- 02 4月, 2022 4 次提交
-
-
由 Siming Dai 提交于
* Add graph_reindex API * add graph_sample_neighbors api * Add buffer * delete VLOG * delete thrust::copy for output * add ShareDataWith * delete graph_reindex hashtable output * add graph_reindex dispensable * add reindex unittest, move memset to cuda kernel, change api * fix conflict * add reindex buffer for gpu version note * fix conflicts for op_func_generator * Add fisher_yates sampling, add dispensable, change infermeta * add dtype for edge_id * fix rocm ci and static check ci * add unittest * fix unittest * fix unittest * fix bug
-
由 xiongkun 提交于
* add yaml for ele_max ele_min * add yaml for: mish / logexpsum / expm1 / elemenwise_pow / elementwise_floordiv
-
由 wuyefeilin 提交于
* move clip op to phi * fix as review * update hierarchical_sigmoid_kernel.cc * update selected_rows * update clip_kernel.cu * fix as review
-
由 Leo Chen 提交于
* enable new-executor on windows to test it * add message * fix ut
-