- 23 5月, 2022 2 次提交
-
-
由 onecatcn 提交于
the doc was editted based on the discussion in the issue: INT32 Failed on paddle.metric.accuracy: https://github.com/PaddlePaddle/Paddle/issues/42845
-
由 Sing_chan 提交于
cherry-pick PR #42700
-
- 19 5月, 2022 1 次提交
-
-
由 Aurelius84 提交于
* [Dy2Stat]Modify all jit.save path into tempfile * [Dy2Stat]Modify all jit.save path into tempfile
-
- 17 5月, 2022 2 次提交
-
-
由 Chen Weihang 提交于
-
由 chenjian 提交于
* put_record_event_in_python_on_timeline_python * fix
-
- 16 5月, 2022 1 次提交
-
-
由 wuhuanzhou 提交于
修复paddle.lerp中示例代码错误。
-
- 11 5月, 2022 1 次提交
-
-
由 Aurelius84 提交于
-
- 10 5月, 2022 4 次提交
-
-
由 JingZhuangzhuang 提交于
* pdnode_compare * panode compare * pdnode_compare
-
由 fwenguang 提交于
* [MLU] add mlu new profiler (#41138) * [MLU] add mlu new profiler * fix format * [MLU] support add callback to stream (#41831) * [MLU] add gather mlu kernel (#41969) * [MLU] add mlu activation kernels (#41751)
-
由 Allen Guo 提交于
set attr ignoreIndex type to string for custom_nllloss_op 部分 cheery-pick of #42534
-
由 zhangbo9674 提交于
-
- 09 5月, 2022 1 次提交
-
-
由 Allen Guo 提交于
add class NameScopeHelper for adding namescope info 添加更多 种类优化器状态的映射 为 IpuStrategy 添加 compilation_progress_logger option 用于输出 编译进度 部分代码清理和杂项优化
-
- 07 5月, 2022 3 次提交
-
-
由 wawltor 提交于
-
由 FlyingQianMM 提交于
Reduce the number of threads per block of deformable_psroi_pooling to solve the bug where too many resources requested for launch (PaddlePaddle#42531) (#42533)
-
由 Ruibiao Chen 提交于
* Reduce time variation for cuda_managed_memory_test (#42458) * Disable standalone executor for test_tensordot (#42476)
-
- 06 5月, 2022 2 次提交
-
-
由 Leo Chen 提交于
* fix wrong place * skip bf16 test if not supported (#42503)
-
由 wawltor 提交于
* Fix the race condition in cumsum operator * Optimize cumsum operator Co-authored-by: NLeo Chen <39020268+leo0519@users.noreply.github.com>
-
- 05 5月, 2022 3 次提交
-
-
由 xiongkun 提交于
-
由 wangxinxin08 提交于
-
由 wawltor 提交于
-
- 04 5月, 2022 8 次提交
-
-
由 seemingwang 提交于
* enable graph-engine to return all id (#42319) * enable graph-engine to return all id * change vector's dimension * change vector's dimension * enlarge returned ids dimensions * change sample result's structure to fit training (#42426) * enable graph-engine to return all id * change vector's dimension * change vector's dimension * enlarge returned ids dimensions * add actual_val * change vlog * fix bug * bug fix * bug fix * fix display test * singleton of gpu_graph_wrapper * change sample result's structure to fit training * recover sample code * fix * secondary sample * add graph partition * fix pybind Co-authored-by: NDesmonDay <908660116@qq.com> Co-authored-by: NDesmonDay <908660116@qq.com>
-
由 XiaoguangHu 提交于
* fix bug of batch_norm_grad kernel with fp16 * format code
-
由 heliqi 提交于
* fix paddle-ort python bug * fix paddle-ort python bug
-
由 Kaipeng Deng 提交于
-
由 Guanghua Yu 提交于
* fix PTQ unittest timeout * fix ut
-
由 cc 提交于
Co-authored-by: Njoanna.wozna.intel <joanna.wozna@intel.com>
-
由 XiaoguangHu 提交于
-
由 LielinJiang 提交于
* fix PIL sample mode deprecated warning * compatible with old pil version
-
- 03 5月, 2022 1 次提交
-
-
由 Huihuang Zheng 提交于
* Fix Release 2.3 Bug * Fix format
-
- 02 5月, 2022 1 次提交
-
-
由 Zhang Zheng 提交于
* Fix test_cudnn_norm_conv and test_cudnn_bn_add_relu in CUDA11.2 * no throw in V100 for some cases
-
- 01 5月, 2022 1 次提交
-
-
由 Chen Weihang 提交于
-
- 30 4月, 2022 6 次提交
-
-
由 Aurelius84 提交于
* [Dy2Stat]Fix losting pre/post hook from outermost layer while jit.save * fix kwargs * fix unittest
-
由 Weilong Wu 提交于
-
由 Weilong Wu 提交于
-
由 Weilong Wu 提交于
-
由 xiongkun 提交于
* Extend python einsum interface to make einsum_v2 support multi-operands and switch it to default. * add opt_einsum dependence * add yaml and support eager model * fix by code review
-
由 littletomatodonkey 提交于
* fix pad3d infer shape * fix pad3d * fix pad default value * fix order * add unit test * fix unittest for ci coverage * add ndhwc check
-
- 29 4月, 2022 3 次提交
-
-
由 WangXi 提交于
* fix FusedResidualDropoutBias nan in v100 (#42344) * fix lod_tensor_array gc (#42377)
-
由 WangXi 提交于
[cherry-pick 2.3] Add fused_multi_transformer op to optimize transformer generation performance (#42311) * Add fused_multi_transformer op to optimize transformer generation performance (#41814) * fix fused_multi_transformer compile failed in cuda arch < sm53 (#42315) * fix ci timeout
-
由 zyfncg 提交于
* fix bug of building InferMetaContext (#42211) * add unitest
-