- 23 3月, 2022 12 次提交
-
-
由 From00 提交于
* Performance optimize * Optimize GetAllocator, RWLock and ProcessUnfreedAllocation * Remove test file * Fix CI error * Fix CI errors * Fix CI errors
-
由 chenjian 提交于
* add event record for model profiling * fix format * fix format * fix code example bug * no * add profiler statistic * add profiler feature * fix bug * fix bug * fix bug * fix bug * required: gpu * required: gpu * fix bug * required: gpu * fix ci bug * fix ci error * fix ci error * upgrade document * fix doc * fix ci bug * add doc and fix bug * nothing * fix bug * fix format bug * modify format * add deprecated description for old profiler * fix bug * fix bug * fix * add load_profiler_reuslt doc * add load_profiler_reuslt doc * add load_profiler_reuslt doc * help fix old profiler sample code * add api doc * fix format * fix api doc * fix api doc format * fix api doc format * fix api doc c format * fix api doc format
-
由 zhouweiwei2014 提交于
-
由 niuliling123 提交于
-
由 jiangcheng 提交于
-
由 Weilong Wu 提交于
-
由 xiongkun 提交于
* transfer unsqueeze to phi * fix conflict * add squeeze * add infershape * fix xpu and npu error
-
由 YuanRisheng 提交于
* move activation * fix bugs when run ce
-
由 Chen Weihang 提交于
* add full_batch_size_like phi kernel * remove fill constant bs like * update year
-
由 Zhang Ting 提交于
* add kernel profiler * add gpu timer tool * remove warmup * fix rocm complilation error
-
由 Zhanlue Yang 提交于
* Removed redundant use of declarations.h * Fixed minor bug
-
由 tianshuo78520a 提交于
* Fix test case timeout * test=document_fix * test=document_fix
-
- 22 3月, 2022 16 次提交
-
-
由 Leo Chen 提交于
* async prepare deps * fix bug that std::future is not set * add ut * refine code * fix standalone ut * disable prof
-
由 hong 提交于
* move mutable_data to context alloc * move mutable_data to context alloc * remvoe duplicate code
-
由 huzhiqiang 提交于
-
由 xiongkun 提交于
* 1. add the python api grad 2. add final and intermediate state vlog 3. change the python_api error logic * add python api or close the check_eager=True * fix the compatibility * matmul * disable unittests: test_elementwise_add_op test_scatter_nd_op test_gather_nd_op test_scatter_op test_index_sample_op test_elementwise_add_mkldnn_op
-
由 Chen Weihang 提交于
-
由 hong 提交于
* move embeding to phi; * update sig; test=develop * move reset impl to phi; test=develop * remove old register; test=develop * fix cpu bf16 bug; test=develop * fix lookup speed error * polish code * fix paddle throw type
-
由 Chen Weihang 提交于
* add reverse phi kernel * add reverse infermeta * remove original reverse op kernl & infershape
-
由 Chen Weihang 提交于
* fix context pool sync init * add lock for insert
-
由 Jiabin Yang 提交于
-
由 zyfncg 提交于
* optimize performance of C++ API * remove stop_data_transform flag temparily
-
由 zyfncg 提交于
* replace Backend by Place in C++ API * fix left code * fix test_to_api bug
-
由 Siming Dai 提交于
* add out_size shape for graph_send_recv * fix bug in register kernel: no const int& support * add out_size in infermeta * change unittest * fix unittest * fix out_size default value * fix doc * delete arg mapping * add sig * move -1 to 0 * move -1 to 0
-
由 Chen Weihang 提交于
-
由 chentianyu03 提交于
* modify out and out_grad order in reduce_grad_kernel * delete unsed boolReduceKernel * fix conflict
-
由 pangyoki 提交于
* support inplace in eager_final state * little change * little bug
-
由 crystal 提交于
* fix group_norm address misalignment * fix vectorize * fix code * fix vectorize length * optimize code
-
- 21 3月, 2022 12 次提交
-
-
由 0x45f 提交于
* Refine to_tensor for eager mode * support gpu_pinned * refine code * support gpu_pinned copy_to * fix layer.__setattr__ * support to_tensor for gpu_pinned * fix unit test * refine gpu_pinned * restore the original code * add is_gup_pinned() and refine eager.Tensor._copy_to()
-
由 niuliling123 提交于
* Support MaskedSelectGrad op with Kernel Primitive API
-
由 sneaxiy 提交于
-
由 hong 提交于
* remove duplicate code; * add some line; test=document_fix
-
由 hong 提交于
* switch eager mode and change it * set default is eager * set default is eager * fix error; test=develop * fix some error; test=develop * update * upd * update code; test=develop * update * fix some bug; test=develop * fix bug; test=develop * fix bug; test=develop * fix bug; test=develop * fix error; test=develop * format; test=develop Co-authored-by: NJiabinYang <360788950@qq.com>
-
由 yaoxuefeng 提交于
-
由 chentianyu03 提交于
* add pad3d kernel into phi * add pad3d infermeta * fix build error * remove raw pad3d infershape function
-
由 Chen Weihang 提交于
* add phi device context pool * change year * fix compile error * fix operator = error * refine init impl * polish details * refine init impl
-
由 zhangyikun02 提交于
-
由 From00 提交于
* Move conv-transpose OPs to phi * Fix CI errors * Fix CI errors
-
由 石晓伟 提交于
-
由 Siming Dai 提交于
* gpu ps graph engine * remove logs * Add neighbor sampling method * Add actual_sample_size and offset for sampling * Delete Chinese comment * Fix code style Co-authored-by: seemingwang <zsasuke@qq.com>
-