- 25 8月, 2022 10 次提交
-
-
由 Aurelius84 提交于
* [OpAttr]min/max of Uniform_rand support Tensor type * fix typo
-
由 kangguangli 提交于
* transfer memcpy_d2h from fluid to phi * refine arg check and add comment * fix cannot fallback to phi kernel * fix gpu_context host alloc when tensor size = 0 * add kernel for std::vector<DenseTensor> args * fix bugs in MemcpyD2HMultiIOKernel * remove useless header file * polish format * fix typo * add testcase for cudapinned place * refine check condition in test * polish error message * polish error message * remove header in fluid directory * merge memcpy_h2d and memcpy_d2h into one file, change register method to simplify implementation * fix code style check
-
由 ronnywang 提交于
* [NPU] add run_program_op_npu * add run_program_op_npu ut
-
由 Sing_chan 提交于
* make full_like support double_max in dygraph * fix bug
-
由 hong 提交于
* optimizer conv alog speed * code polish * remove useless code * fix compile error * fix cpu compile error * not use cudnn alog t * add search cache max number * polish code * fix cache test bug * add groups data format to conv args * fix cache test bug * fix cudnn_deterministic bug * fix test switch auto tune bug * fix test swith autotune bug; * fix conv cache bug * fix cache test error * fix cache test bug * fix windows mac compile error * fix workspace search error * update cudnn cache * fix cache test bug; test=develop * fix autotune swith test error * polish code * oplish code
-
由 Rayman 提交于
-
由 Wilber 提交于
-
由 USTCKAY 提交于
-
由 haosicheng 提交于
-
由 zhoutianzi666 提交于
-
- 24 8月, 2022 13 次提交
-
-
由 Wang Bojun 提交于
* fix bug fix
-
由 ShenLiang 提交于
* fix utest * fix utest * fix utest * fix log * fix random utest
-
由 Leo Chen 提交于
* make tensor_util contains no cuda code * refine isfinite * revert ut * move isfinite function to its op * fix test * fix compile * std::isnan is not defined for int type on windows * fix windows compile * fix fp16 * fix rocm compile * revert gradient node
-
由 Yuanle Liu 提交于
-
由 Jiabin Yang 提交于
-
由 HongyuJia 提交于
* add legacy_api.yaml * set merged_momentum inplace only * support inplace optional<vector<tensor>> * add dygraph_mode api * optimize TensorToConstDenseTensorPtr
-
由 WangZhen 提交于
-
由 Wilber 提交于
-
由 mengqingchun02 提交于
* support beam_search operator on xpu. test=kunlun * support beam_search operator on xpu. test=kunlun * support beam_search operator on xpu. test=kunlun * support beam_search operator on xpu. test=kunlun * support beam_search operator on xpu. test=kunlun * support fp16 of adam operator in xpu environment. test=kunlun * support fp16 of adam operator in xpu environment. test=kunlun * support fp16 of adam operator in xpu environment. test=kunlun
-
由 WangZhen 提交于
* Adapt minlength attr for bincount
-
由 wenbin 提交于
* fix * optimize
-
由 zhaoying9105 提交于
-
由 Wilber 提交于
-
- 23 8月, 2022 9 次提交
-
-
由 pangyoki 提交于
-
由 zhaoyingli 提交于
* add quant pass
-
由 LiYuRio 提交于
-
由 niuliling123 提交于
-
由 OccupyMars2025 提交于
-
由 LiYuRio 提交于
-
由 OccupyMars2025 提交于
* Update scope.h * typo * Update dense_tensor.inl
-
由 YuanRisheng 提交于
* move distribute_fpn_proposals * fix some code * fix yaml bugs * add set dtype * move proposal_impl to funcs * fix compile bugs
-
由 ronnywang 提交于
* [CustomDevice] add profiler apis * migrate CalculateEstOccupancy into cuda_tracer * update * add ut
-
- 22 8月, 2022 7 次提交
-
-
由 joanna.wozna.intel 提交于
* Add int8 support for matmul+elementwiae_add fuse * Corrections after review and ernie test fix
-
由 Sławomir Siwek 提交于
* merge conv_concat_relu to conv_act * fix typo * extend unit test * reuse existing gpd * codestyle * enforce mkldnn conv
-
由 zhoutianzi666 提交于
-
由 wanghuancoder 提交于
some python c api use final state
-
由 Yuanle Liu 提交于
-
由 Hui Zhang 提交于
* fix jit layer function * fix comment * fix comment
-
由 ronnywang 提交于
-
- 20 8月, 2022 1 次提交
-
-
由 wanghuancoder 提交于
* pylayer detach output tensor if it is equal with input * pylayer detach output tensor if it is equal with input
-