- 24 3月, 2022 3 次提交
-
-
由 Chen Weihang 提交于
* add mul phi kernel * remove mul op kernel * remove original mul grad op * fix cinn test * fix dygraph test failed
-
由 Wilber 提交于
* infrt add trt engine * fix register * file generate * fix ci error * fix conflict * add copyright * update * update * update * update engine name * refactor trt code * update * update * update * update * fix conflict * update * refactor code * first commit * update pdtensor to denseTensor * code * style * code * code style * add the tensor map, test=develop * update * update * update * trt engine * update trt mlir and runtime * update mlir test * update * update * update Co-authored-by: NDannyIsFunny <912790387@qq.com> Co-authored-by: NShixiaowei02 <39303645+Shixiaowei02@users.noreply.github.com>
-
由 niuliling123 提交于
-
- 23 3月, 2022 29 次提交
-
-
由 Leo Chen 提交于
-
由 王明冬 提交于
-
由 jakpiase 提交于
* added missing BF16 activations * added softplus bf16 * minor change * disabled tests for GPU
-
由 furnace 提交于
* [NPU] add npu support for conv3d and conv3d_grad * [NPU] delete failed unittests due to Ascend not support * [NPU] delete debug codes * [NPU] optimize codes, notest * [NPU] remove const_cast * [NPU] optimize for remove const_cast * [NPU] fix written errors
-
由 Zhanlue Yang 提交于
-
由 zhaocaibei123 提交于
* fix benchmark and communicator config * fix bugs of the_one_ps * multi program and fix bug in optimizer * multi program in the_one_ps * public commcontext * ps optimizer multi programs * cvm & datanorm backend * fix dim * fix unittest * fix * the one ps merge * remove comm * add DownpourLiteWorker * all * fix * fix * device worker downpour lite * fix * fix bug in global shuffle * save inference model * fix & add log * fix * remove log * fix * fix save summary * fix * fix pscore * fix * fix * fix * fix * fix * remove logs * fix * fix * fix * fix * fix * add some comments * fix Co-authored-by: Nesythan <esythan@126.com>
-
由 YuanRisheng 提交于
-
由 Weilong Wu 提交于
* disable scatter case in test_inplace_eager_fluid * Update register_hook logic * Add register_hook test cases Co-authored-by: Npangyoki <pangyoki@126.com>
-
由 Jiabin Yang 提交于
* suppor sharding api * support multi api for sharding in eager * support multi api for sharding in eager * fix test * fix test coverage
-
由 liutiexing 提交于
* AddAwaitableTask for WorkQueue Co-authored-by: Nliutiexing <liutiexing@google.com>
-
由 zyfncg 提交于
* move deformable_conv_grad to phi * move infershape of deformable_conv to phi * adjust some code format * move deformable_conv_v1 to phi
-
由 YuanRisheng 提交于
* remove flatten infermeta * fix bugs when run inference ci * fix bugs when run inference ci * fix bugs when run ci * support infrt * inplace infershape code'
-
由 hong 提交于
* fix error; test=develop * update * close some yaml * fix backward attrite error; test=develop * add div test * polish code; test=develop * remove none gbk charactor; * remove some yaml; * fix optional bug * recover yaml config * resolve confilct; test=develop * close div; test=develop
-
由 wanghuancoder 提交于
* fix some slice bug, test=develop * eager slice, test=develop * eager slice, test=develop * refine, test=develop * refine, test=develop * fix bug, test=develop * refine, test=develop * rename function name, test=develop
-
由 Zhanlue Yang 提交于
* Supported Complex2Real Conversion for Eager Dygraph * Supported Complex2Real Conversion for Eager Dygraph * Enabled complex type promotion test for matmul_v2 * Fix CI issues * Support initializing specific grad tensors to zero for selected operators * Merged adj_edges_ with GradSlotMeta * Fixed monir issue * Adjusted num runs * Recovered Eager performance tests configurations * Recovered Eager performance tests configurations * Adjusted performance tests configurations * Fixed Minor Issues with performance tests * Moved out Edge from GradSlotMeta * Fixed issues from merge * Fixed typo * Addressed review comments * Fixed merge issues * Fixed minor issues * Fixed minor issue * Fixed major issues and enabled auto_prune test cases * Fixed issues from merge
-
由 whs 提交于
-
由 KP 提交于
* Add stft_op. * Add stft_grad_op. * Add stft_op unittest. * [DLTP-45176] Add complex compatibility in static mode for stft api. * [DLTP-45176] Add complex compatibility in static mode for stft api. * Add doc. * Update unitests of stft op. * Update spectral helper. * fix coding style.
-
由 From00 提交于
* Performance optimize * Optimize GetAllocator, RWLock and ProcessUnfreedAllocation * Remove test file * Fix CI error * Fix CI errors * Fix CI errors
-
由 chenjian 提交于
* add event record for model profiling * fix format * fix format * fix code example bug * no * add profiler statistic * add profiler feature * fix bug * fix bug * fix bug * fix bug * required: gpu * required: gpu * fix bug * required: gpu * fix ci bug * fix ci error * fix ci error * upgrade document * fix doc * fix ci bug * add doc and fix bug * nothing * fix bug * fix format bug * modify format * add deprecated description for old profiler * fix bug * fix bug * fix * add load_profiler_reuslt doc * add load_profiler_reuslt doc * add load_profiler_reuslt doc * help fix old profiler sample code * add api doc * fix format * fix api doc * fix api doc format * fix api doc format * fix api doc c format * fix api doc format
-
由 zhouweiwei2014 提交于
-
由 niuliling123 提交于
-
由 jiangcheng 提交于
-
由 Weilong Wu 提交于
-
由 xiongkun 提交于
* transfer unsqueeze to phi * fix conflict * add squeeze * add infershape * fix xpu and npu error
-
由 YuanRisheng 提交于
* move activation * fix bugs when run ce
-
由 Chen Weihang 提交于
* add full_batch_size_like phi kernel * remove fill constant bs like * update year
-
由 Zhang Ting 提交于
* add kernel profiler * add gpu timer tool * remove warmup * fix rocm complilation error
-
由 Zhanlue Yang 提交于
* Removed redundant use of declarations.h * Fixed minor bug
-
由 tianshuo78520a 提交于
* Fix test case timeout * test=document_fix * test=document_fix
-
- 22 3月, 2022 8 次提交
-
-
由 Leo Chen 提交于
* async prepare deps * fix bug that std::future is not set * add ut * refine code * fix standalone ut * disable prof
-
由 hong 提交于
* move mutable_data to context alloc * move mutable_data to context alloc * remvoe duplicate code
-
由 huzhiqiang 提交于
-
由 xiongkun 提交于
* 1. add the python api grad 2. add final and intermediate state vlog 3. change the python_api error logic * add python api or close the check_eager=True * fix the compatibility * matmul * disable unittests: test_elementwise_add_op test_scatter_nd_op test_gather_nd_op test_scatter_op test_index_sample_op test_elementwise_add_mkldnn_op
-
由 Chen Weihang 提交于
-
由 hong 提交于
* move embeding to phi; * update sig; test=develop * move reset impl to phi; test=develop * remove old register; test=develop * fix cpu bf16 bug; test=develop * fix lookup speed error * polish code * fix paddle throw type
-
由 Chen Weihang 提交于
* add reverse phi kernel * add reverse infermeta * remove original reverse op kernl & infershape
-
由 Chen Weihang 提交于
* fix context pool sync init * add lock for insert
-