- 11 11月, 2021 5 次提交
-
-
由 Weilong Wu 提交于
* Expose func for varbase * Expose func for varbase and enhance varbase init func * Change func name and add test case for _CopyGradientWith * Rename func * Add test cases to increase coverage * Refine the logic of _to func * Replace numel() with _numel(), Add test code
-
由 LiYuRio 提交于
-
由 Wilber 提交于
-
由 wanghuancoder 提交于
* fix 2 bug: 1.skip lodtensorarray; 2.delete feed op, test=develop * program clone, test=develop
-
由 Nyakku Shigure 提交于
* add wide resnet * update pretrained weights link
-
- 10 11月, 2021 5 次提交
-
-
由 jakpiase 提交于
* added stack oneDNN FP32 op * minor change * CI fix * added skipping for gpus * fix for stack op * CI fix * CI fix * Added comment * CI fix
-
由 Aurelius84 提交于
-
由 Li Min 提交于
att, bug fix
-
由 baoachun 提交于
-
由 Jack Zhou 提交于
* fix rnn grad bug when num_layers is set 2 and dropout_prob is set 0 * add more test for rnn
-
- 09 11月, 2021 5 次提交
-
-
由 zhangbo9674 提交于
* refine layer to * delete comment * refine logic * refine code * refine pure_fp16_init * refine comment
-
由 Aurelius84 提交于
-
由 wanghuancoder 提交于
* delete profiler.cuda_profiler, test=develop * delete nvprof, test=develop * add required: gpu, test=develop * remove cuda_profiler, test=develop
-
由 Zeng Jinle 提交于
* try to fix CUDA Graph H2D copy bug * remove useless code * fix ci * fix ROCM CI * fix CUDA_VERSION * improve CI coverage
-
由 TTerror 提交于
-
- 08 11月, 2021 7 次提交
-
-
由 wanghuancoder 提交于
* Use cuda virtual memory management and merge blocks, test=develop * refine, test=develop * refine, test=develop * refine, test=develop * refine, test=develop * refine, test=develop * refine, test=develop * refine, test=develop * window dll, test=develop * fix cuda error of CUDA_ERROR_NOT_INITIALIZED, test=develop * use autogrowthv2 for system allocator, test=develop * remove ~CUDAVirtualMemAllocator(), test=develop * refine, test=develop * fix cuda error of CUDA_ERROR_NOT_INITIALIZED, test=develop * fix cuda error of CUDA_ERROR_NOT_INITIALIZED, test=develop * fix bug, test=develop * revert system allocator, test =develop * revert multiprocessing, test=develop * fix AutoGrowthBestFitAllocatorV2 mutxt, test=develop * catch cudaErrorInitializationError when create allocator, test=develop * fix cuMemSetAccess use, test=develop * refine cuda api use, test=develop * refine, test=develop * for test, test=develop * for test, test=develop * switch to v2, test=develop * refine virtual allocator, test=develop * Record cuMemCreate and cuMemRelease, test=develop * refine, test=develop * avoid out of bounds, test=develop * rename allocator, test=develop * refine, test=develop * use PADDLE_ENFORCE_CUDA_SUCCESS, test=develop * for test,test=develop * refine, test=develop * refine, test=develop * refine, test=develop * refine, test=develop * refine, test=develop * refine, test=develop
-
由 Li Min 提交于
目前的fused_attention_op不支持attn_mask=None的输入,本PR对此进行了补充,并补充了相应的单测逻辑。
-
由 Wilber 提交于
-
由 kuizhiqing 提交于
-
由 0x45f 提交于
-
由 zyfncg 提交于
-
由 xiongkun 提交于
* add scope as membership * functions complete * fix bugs: garbage collectior * deal unknow variable holder * add * 1. add unittest for operator_base * code format
-
- 06 11月, 2021 1 次提交
-
-
由 Zhen Wang 提交于
* Update the batch size used in test_resnet50_with_cinn.py. * Enable more debug info.
-
- 05 11月, 2021 4 次提交
-
-
由 Weilong Wu 提交于
-
由 Jacek Czaja 提交于
* - WIP - compilation fix - fix - fixes - fix - fix - fix again - fix - another fix - another compilation fix - fix - fix - fix - lint * - pool2d partially stripped from cache - pool2d partially stripped of caching * - compilation fix * - compilation fix * - Fix to UT of caching * - Enabling test_conv3d_mkldnn * - conv_transpose stripped of cache * - compilation fix * - fix * - fix * - compilation fix * - fix * Reverted disabling caching of conv2d * - compilation fix * - ut reverted
-
由 Zeng Jinle 提交于
-
由 Zhen Wang 提交于
* Use a more appropriate `Compile` method in cinn_launch_op. * Update the VizGraph method of CinnCompiler. * Add resnet50 model training with CINN.
-
- 04 11月, 2021 4 次提交
- 03 11月, 2021 4 次提交
-
-
由 Zhen Wang 提交于
Add FLAGS_allow_cinn_ops & FLAGS_deny_cinn_ops for controlling op types used in training with CINN. (#36842) * Update UT test_parallel_executor_run_cinn.py. * Add FLAGS_allow_cinn_ops & FLAGS_deny_cinn_ops & FLAGS_cinn_ops_delim. * Use the custom StringSplit function and remove the FLAGS_cinn_ops_delim flag. * Add FlagController test. * Apply lock to the cache_ only in CinnCompiler. * Add VizGraph & ReadableKey method for CinnCompiler. * Update the dot style of VizGraph in CinnCompiler.
-
由 fuqianya 提交于
* add squeezenet
-
由 0x45f 提交于
* Support Py3 type annotations in @to_static * support type hint for args in func * support type hint assign * if annotation and value(Constant) are diffent type, we use value type * polish type_from_annotation() * code format * code format * remove useless commentary * fix review Co-authored-by: NAurelius84 <zhangliujie@baidu.com>
-
由 LiYuRio 提交于
-
- 02 11月, 2021 5 次提交
-
-
由 Nyakku Shigure 提交于
add shufflenetv2 Co-authored-by: Ainavo <ainavo@163.com> Co-authored-by: Npithygit <pyg20200403@163.com> Co-authored-by: Ainavo <ainavo@163.com> Co-authored-by: Npithygit <pyg20200403@163.com>
-
由 jakpiase 提交于
* added conv + hard_sigmoid fuse pass * Removed IsOptional() statements * Reverted removing optional
-
由 yujun 提交于
* add randint like * rm .cc .cu * Update unity_build_rule.cmake * try to make test pass * use python * update * update randint_like * rename test_randint_like_op -> test_randint_like * update * update randint like docs * update randint like * update * update * add bool * update randint like test * update * update
-
由 Liu-xiandong 提交于
-
由 joanna.wozna.intel 提交于
* Refactor conv2d int8 unit test * Correct according to review and add int8 check
-