- 17 6月, 2020 2 次提交
-
-
由 Leo Chen 提交于
* fix bug of prelu when rank not equal 4, test=develop * fix prelu inference, test=develop * fix api, test=develop * fix shape when mode is chennel, test=develop * remove debug code, test=develop * add unittest, test=develop
-
由 zlsh80826 提交于
* blockReduce opt * launch threads align to warpSize * reduce unnecessary shared memory for broadcast reduced value * vectorize SoftmaxKernelWithEltadd * add fp16 constrain * test=develop
-
- 16 6月, 2020 5 次提交
-
-
由 Huihuang Zheng 提交于
As the title
-
由 hutuxian 提交于
* Add a StatValue class in the backend to represent a stat. * Add a singleton StatRegistry to maintain the collection of stats. * For the sake of code neatness, we only support type of int and float, which can cover most of the scenarios.
-
由 Huihuang Zheng 提交于
Some big models can timeout on Windows CPU machine. I added some timeout properties.
-
由 T8T9 提交于
-
由 Leo Chen 提交于
-
- 15 6月, 2020 6 次提交
-
-
由 Yiqun Liu 提交于
-
由 Jeng Bai-Cheng 提交于
This commit fixs the compiling bug regarding unique_ptr of IOptimizationProfile. IOptimizationProfile has protected dtor and is controlled by TensorRT internally. Application shouldn't delete the pointer of IOptimizationProfile. See TensorRT document: https://docs.nvidia.com/deeplearning/sdk/tensorrt-api/c_api/classnvinfer1_1_1_i_builder.html#a9ac47e100454151d8206ac91d543299a test=develop
-
由 zlsh80826 提交于
* parallel move shared data test=develop * test=develop
-
由 tianshuo78520a 提交于
* update readme of 1.8.2;test=document_fix * test=develop;test=document_fix * test=develop;test=document_fix
-
由 Divano 提交于
-
由 Huihuang Zheng 提交于
Add TSM as ProgramTranslator Unit Test. The TSM code is referred from PaddlePaddle/models#4229
-
- 14 6月, 2020 1 次提交
-
-
由 tianshuo78520a 提交于
* test=develop * test=develop * fix bug * test=develop * test=develop
-
- 12 6月, 2020 7 次提交
-
-
由 Leo Chen 提交于
-
由 lilong12 提交于
-
由 Aurelius84 提交于
* add MobileNet unittest test=develop * fix cudnn random test=develop
-
由 tangwei12 提交于
* fix sync barrier with barrier monitor, test=develop
-
由 ceci3 提交于
-
由 Leo Chen 提交于
* add summary_env, test=develop * update issue template, test=develop * refine link, test=develop Co-authored-by: Nroot <root@yq01-gpu-255-129-15-00.epc.baidu.com>
-
由 hong 提交于
* enable load_program_state run in imperative mode; test=develop * remove useless code; test=develop
-
- 11 6月, 2020 2 次提交
-
-
由 liym27 提交于
[Dy2Static]Convert var.shape stmt and Convert the return variables of Tensor-dependent 'if' staments to Tensor if it not (#24911) * Support int and long: int or long -> six.integer_types. * Modify test_tensor_shape: fix bug and modify comment. * Support convert_var_shape to convert var.shape stmt * Modify code in ifelse_simple_func.py because don't support return non-Tensor in Tensor-dependent 'if' stament currently. * Convert the return variables of Tensor-dependent 'if' staments to Tensor if it not. test=develop
-
由 Leo Chen 提交于
* use allow list instead of white list, test=develop * reduce include, test=develop
-
- 10 6月, 2020 9 次提交
-
-
由 Zhang Ting 提交于
-
由 liu zhengxi 提交于
-
由 hutuxian 提交于
Support CMatchAucCalculator based on CMatchRankAucCalculator with a new parameter ignore_rank
-
由 Huihuang Zheng 提交于
[Dy2stat] decrease the batch size to decrease GPU usage.
-
由 Zhou Wei 提交于
* windows publish package scripts,test=develop * windows publish package scripts,test=develop * windows publish package scripts,test=develop
-
由 Zhou Wei 提交于
fix bug in CUDA_NVCC_FALS and CMAKE_CUDA_FLAGS
-
由 Leo Chen 提交于
-
由 wangchaochaohu 提交于
-
由 silingtong123 提交于
* test=develop, add log message in the function UpdateDllFlag * test=develop, add the test
-
- 09 6月, 2020 8 次提交
-
-
由 Chen Weihang 提交于
-
由 Sylwester Fraczek 提交于
* remove gmock from ut test=develop * coverage enabled for r+t+m fuse pass test=develop
-
由 liym27 提交于
* Move function 'convert_len' to file convert_operators.py * Support that for statements are transformed to while statements. * Fix bug: raise None -> return None. * Support variable loaded and created in loop. * Use int64 in Py2 and Py3 in function to_static_variable.
-
由 wawltor 提交于
Fix the bug for elementwise_div op, when the first var is scalar; Use the shape 1 replace the -1 in shape.
-
由 Huihuang Zheng 提交于
[Dy2stat] Add word2vec as unittest
-
由 wawltor 提交于
Add the support the 5d,6d tensor support for the reduce ops; Add the same time, the compile time, it was 22 minutes, it was 21 minutes after fixed.
-
由 liuwei1031 提交于
-
由 silingtong123 提交于
-