- 30 8月, 2022 9 次提交
-
-
由 WangZhen 提交于
* Adapt tensor axis for argmin/max * Add UT * Polish UT
-
由 pangyoki 提交于
* move layer_norm xpu kernel to phi, test=kunlun * fix, test=kunlun
-
由 WangZhen 提交于
* [OpAttr]Adapt tensor axis for reduce_min/max/mean/sum/prod
-
由 zyfncg 提交于
* add runtime config in phi * add runtime attr for op desc and op * fix no proto error * adjust opdesc set_attr impl * try to remove conv_op extra attrs * add init runtime attr map * change extra header path * fix runtime_attr * fix trace_op * fix bug of pass * fix merge conflict * fix dygraph attrs * fix bug of pass * fix dygraph bug * fix unittest module * delete extra attr default * fix dropout kernel * polish code * fix extra output of instance_norm * fix merge confilct * fix op_desc bug * add extra attr in yaml for conv3d_transpose * don't remove extra input and output * fix save_inference_model * fix bug of batch_norm * revert some change * polish log * polish code * add code comment Co-authored-by: NChen Weihang <chenweihang@baidu.com>
-
由 haosicheng 提交于
fix missing keep_dim variable fix missing grad check in unittest add new test case
-
由 Aurelius84 提交于
* [OpAttr]padding_value of Pad support Tensor type * fix unittest * fix unittest * fix coverage
-
由 risemeup1 提交于
-
由 Leo Chen 提交于
* move xpu kernel to phi * delete fluid file * fix compile * add guard, test=kunlun * xpu set constant * fix xpu error, test=kunlun
-
由 WangZhen 提交于
-
- 29 8月, 2022 13 次提交
-
-
由 zhangbo9674 提交于
* add interpretercore * refine backward program id * add code * refine program * refine code * create forward/backward_program by prog2graph2prog method * test, do not care * refine code * refine code * refine code * test, do not care * add interpretorcore * add scope * refine scope create method * add jit for new_exe * solve conflict * delete unused code * polish code * polish code * refine scope in inplace * refine for datatransfer * refine _rebuild_from_desc * refine control eager deletion attr * refine used_for_jit * refine jit for infer * op size0 use ori program * polish code * refine jit * refine run_program_op ut * refine inplace * refine control * refine graph helper * refine control * refine inplace * refine buffer_share_inplace_pass * polish code * polish code * refine usage for compilerProgram * refine control * test * test core cache * refine code * refine io.py * increase test_seq2seq timeout * refine convert program * refine interpretercore_cache release * delete buildinplace * refine partial_program && io * refine code for io * test * test * test
-
由 Qi Li 提交于
* [MLU] fix compile error, test=develop * fix more compile error, test=develop
-
由 YuanRisheng 提交于
* mv elementwise add to xpu , test=kunlun * fix ci bugs, test=kunlun * fix ci bugs , test=kunlun
-
由 Sławomir Siwek 提交于
* abs relu6 fwd * abs bwd * gaussian_random_kernel and mkldnn-onednn renaming * scale kernel * whitespace * whitespace * revert scale migration * whitespaces * revert changes to gaussian kernel * whitespaces
-
由 Weilong Wu 提交于
* [XPU] migrate mul to phi;test=kunlun * rm fluid mul xpu op;test=kunlun
-
由 Chen Weihang 提交于
* migrate assign xpu kernel, test=kunlun * remove assign_value xpu, test=kunlun
-
由 wanghuancoder 提交于
* gather gather_grad gather_nd gaussian_random xpu to phi
-
由 cambriconhsq 提交于
-
由 Aurelius84 提交于
* [OpAttr]num_rows/num_colums of eye support Tensor type * fix attr cast with long type
-
由 zhangbo9674 提交于
-
由 zhangbo9674 提交于
-
由 zyfncg 提交于
* move matmul_v2 kernel of xpu from fluid to phi, test=kunlun * fix complie bug, test=kunlun * fix complie bug, test=kunlun * fix complie bug, test=kunlun
-
由 Weilong Wu 提交于
-
- 26 8月, 2022 8 次提交
-
-
由 Ruibiao Chen 提交于
-
由 Ruibiao Chen 提交于
-
由 Ruibiao Chen 提交于
-
由 zyfncg 提交于
* delete fill xpu op in fluid * delete fill_constant header, test=kunlun * fix npu header, test=kunlun
-
由 houj04 提交于
-
由 kangguangli 提交于
* remove fluid kernel and activate phi kernel * fix parameter error * transfer mkldnn part * modify header file path * fix compile error * transfer special case * fix lod setting and special case for layout setting * add testcase and refine code
-
由 haosicheng 提交于
* add temporal shift and grad *test=kunlun * fix reduce mean grad bug *test=kunlun
-
由 xiongkun 提交于
* while support for python container. It is convenient to convert more dynamic graph codes into static graphs. * cond support python container * 1. make select_input output shape = input[1] 2. add warning in while_loop risky assign * fix 2 problem in GPT export: 1. a bug in while_op no_need_copy_var, which causes gpu memory leakage 2. a bug in undefined_var where the stop_gradient should be False. * change name by code review * format
-
- 25 8月, 2022 7 次提交
-
-
由 Aurelius84 提交于
* [OpAttr]axis of Reverse Support Tensor type * fix coverage * fix unittest
-
由 Aurelius84 提交于
* [OpAttr]min/max of Uniform_rand support Tensor type * fix typo
-
由 kangguangli 提交于
* transfer memcpy_d2h from fluid to phi * refine arg check and add comment * fix cannot fallback to phi kernel * fix gpu_context host alloc when tensor size = 0 * add kernel for std::vector<DenseTensor> args * fix bugs in MemcpyD2HMultiIOKernel * remove useless header file * polish format * fix typo * add testcase for cudapinned place * refine check condition in test * polish error message * polish error message * remove header in fluid directory * merge memcpy_h2d and memcpy_d2h into one file, change register method to simplify implementation * fix code style check
-
由 ronnywang 提交于
* [NPU] add run_program_op_npu * add run_program_op_npu ut
-
由 hong 提交于
* optimizer conv alog speed * code polish * remove useless code * fix compile error * fix cpu compile error * not use cudnn alog t * add search cache max number * polish code * fix cache test bug * add groups data format to conv args * fix cache test bug * fix cudnn_deterministic bug * fix test switch auto tune bug * fix test swith autotune bug; * fix conv cache bug * fix cache test error * fix cache test bug * fix windows mac compile error * fix workspace search error * update cudnn cache * fix cache test bug; test=develop * fix autotune swith test error * polish code * oplish code
-
由 Rayman 提交于
-
由 USTCKAY 提交于
-
- 24 8月, 2022 3 次提交
-
-
由 Leo Chen 提交于
* make tensor_util contains no cuda code * refine isfinite * revert ut * move isfinite function to its op * fix test * fix compile * std::isnan is not defined for int type on windows * fix windows compile * fix fp16 * fix rocm compile * revert gradient node
-
由 WangZhen 提交于
-
由 mengqingchun02 提交于
* support beam_search operator on xpu. test=kunlun * support beam_search operator on xpu. test=kunlun * support beam_search operator on xpu. test=kunlun * support beam_search operator on xpu. test=kunlun * support beam_search operator on xpu. test=kunlun * support fp16 of adam operator in xpu environment. test=kunlun * support fp16 of adam operator in xpu environment. test=kunlun * support fp16 of adam operator in xpu environment. test=kunlun
-