- 02 4月, 2022 2 次提交
-
-
由 Wilber 提交于
-
由 zhangkaihuo 提交于
-
- 01 4月, 2022 25 次提交
-
-
由 hong 提交于
-
由 Sing_chan 提交于
new cuda arch compile method;test=document_fix;test=windows_op;test=windows_ci_inference;test=windows_ci (#41259)
-
由 zyfncg 提交于
* add nll_loss yaml * fix nll loss * fix nll loss bug * fix bug * fix bug * fix infrt problem Co-authored-by: Nxiongkun <xiongkun03@baidu.com>
-
由 xiaoxiaohehe001 提交于
fix_reshape: for paddle-trt
-
由 wanghuancoder 提交于
* support pinned, test=develop * support async_write, test=develop * refine, test=develop * refine, test=develop * refine, test=develop * refine,test=develop * refine, test=develop * refine, test=develop * refine, test=develop * refine, test=develop
-
由 pangyoki 提交于
* fix inplace fill_ and zero_ API * add eager unittest
-
由 pangyoki 提交于
* change inplace_version checking log * fix
-
由 pangyoki 提交于
* support C_ops assign * open unittest * fix clone
-
由 Aurelius84 提交于
* [Eager]Enhance eager_trace_op logic to support Optimizer Op * fix AsDispensable
-
由 liutiexing 提交于
* fix mac c++ version * update * fix apple systems
-
由 Chen Weihang 提交于
-
由 Chen Weihang 提交于
* add cross_entropy_with_softmax phi kernel * remove softmax_with_cross_entropy kernel * add softmax_with_cross_entropy grad kernel * remove original op kernel * refine cross entropy impl * fix pointer error * revert kernel cu change * fix xpu failed * fix cinn failed * fix npu failed * add forward sig * add check_nan_inf for pt kernel * remove repeat cmake item * fix unittest error
-
由 chentianyu03 提交于
* add interploate cpu kernel * fix nullptr bug * add interpolate gpu kernel * fix unit test error * remove raw kernels * add cuda kernel impl * add infermeta * recover accidentally deleted kernels in interpolate op * fix grad x_grad name error * remove interpolate_v2_op.h * rm unused codes * fix xpu build error * fix build error * fix namespace error * add register header for nup * fix infermeta error * modify by review * add the missing args in test_trt_convert_nearest_interp_v2
-
由 Jacek Czaja 提交于
-
由 Liu-xiandong 提交于
* fix bug in activation xpu kp kernel * delete useless comment
-
由 z8hanghuan 提交于
* support multi_layer of bilstm,*test=kunlun * support multi_layer of bilstm, *test=kunlun * support multi_layer of bilstm, *test=kunlun * support multi_layer of bilstm, *test=kunlun
-
由 Zhanlue Yang 提交于
* [Refactor] refactored eager_gen.py PR #2 * [DoubleGrad PR #1] Decoupled code generation logics for Dygraph ForwardFunctions and GradNodes * Fixed minor issue * Adjusted logics of GenerateNodeCreationCodes and GenerateForwardDefinition * Fixed issues * Supported higher-order grad node generation * [DoubleGrad PR #4] Supported higher-order GradNode generation * [DoubleGrad #4] Bug Fixes to Double Grad Node Generation * Fixed yaml typo * Fixed yaml typo * fixed minor issues * Fixed minor issue
-
由 zmxdream 提交于
* fix cmake. test=develop * fix. test=develop * fix dep for graphs_ps_gpu. test=develop * update. test=develop * update. test=develop
-
由 zhangkaihuo 提交于
-
由 Aganlengzi 提交于
-
由 yaoxuefeng 提交于
* modify api name of ps accessor * update * code format
-
由 From00 提交于
* Fix compilation error for gcc-54 * Remove const for gpuStream_t
-
由 Leo Chen 提交于
* move WaitEvent/RecordEvent into try-catch * refine supportNpu
-
由 Chen Weihang 提交于
* add several yaml * polish strided slice kernel & add yaml * reorder yaml * add several yaml * revert yaml config change * resolve conflict * Update test_strided_slice_op.py
-
由 hong 提交于
* fix error; test=develop * update * close some yaml * fix backward attrite error; test=develop * add div test * polish code; test=develop * update * update * fix bug * update bitwise code; test=develop * update * update * fix some bug * update * revert cmakelist * fix optional bug; * fix bug * fix bug; * add backward test * open bn * update * update * revert eager_gen * polish code * fix topk error * update * update * fix bug; * move label smooth, nll loss * revert topk * fix topk label smooth bug; * remove batch_norm * remove topk * change flip infer meta * fix flip bug * update yaml * close abs * fix histogram bug * fix histogram bug * add abs * fix histogram kernel * remove expand
-
- 31 3月, 2022 13 次提交
-
-
由 ziyoujiyi 提交于
* back fl * delete ssl cert * . * make warning * . * unittest paral degree * solve unittest * heter & multi cloud commm ready * . * .
-
由 Sing_chan 提交于
* add CUDA_TOOLKIT_ROOT_DIR option in cmake command * make sure dir in cmake use / rather than \
-
由 csy0225 提交于
-
由 zyfncg 提交于
* rename scalar_array to int_array * update cmake * fix conflict * remove useless log
-
由 Aurelius84 提交于
* [Yaml] Migrate sqrt/square/reciprocal yaml * clean file * fix unittest error
-
由 tianshuo78520a 提交于
-
由 wanghuancoder 提交于
-
由 zhangbo9674 提交于
-
由 wangxinxin08 提交于
* add multiclass_nms3 converter
-
由 Leo Chen 提交于
* fix bug that some op has no op_role attr * add mkldnn support for new executor * fit for mkldnn data_transfer * fit for mkldnn data_transfer
-
由 TeslaZhao 提交于
* Pass compat of conv_transpose_bias_mkldnn_fuse_pass * Fix a bug of strided_slice op, about the axes parameter access memory out of bounds * Fix a bug of strided_slice op, about the axes parameter access memory out of bounds * Fix a bug of transpose op, about accessing memory out of bounds of the perm param * op:transpose_op supports bool type * op:transpose_op supports bool type * Keep strided_slice op behavior consistent with slice op when starts input is less than -rank * Using DistConfig in inference
-
由 chenjian 提交于
* no * maintain old profiler * exclude new python record events for old profiler * maintain old profiler * maintain * maintain old profiler * maintain * fix cmakes
-
由 heliqi 提交于
* add flatten2,reshape2,squueze2_trt_fuse_pass test cast * add flatten2,reshape2,squueze2_trt_fuse_pass test cast * add flatten2,reshape2,squueze2_trt_fuse_pass test cast
-