- 18 6月, 2020 8 次提交
-
-
由 Zhou Wei 提交于
* add new api (set_global_initializer/reset_global_initializer),test=develop * add new api (set_global_initializer/reset_global_initializer),test=develop * fix doc and example code of set_global_initializer,test=develop
-
由 Chen Weihang 提交于
* add for iter var list, test=develop * add enumerate unittest, test=develop
-
由 liym27 提交于
-
由 Jacek Czaja 提交于
* Start implementing int8 eltwise add test=develop * - Fix to Michal PR * - Fix test=develop * - Lint fixes test=develop * - Added checking if elementwise_mul can be used test=develop * - Added attribs to skip_attrs_set test=develop * - Improved broadcasting test=develop - fixes to compilation - fix - fix - Lint fixes test=develop * - removed redundant condition test=develop Co-authored-by: NMichal Gallus <michal.gallus@intel.com>
-
由 Zhaolong Xing 提交于
test=develop
-
由 LielinJiang 提交于
* add relu for lenet, test=develop * fix test model, test=develop
-
由 FlyingQianMM 提交于
-
由 石晓伟 提交于
-
- 17 6月, 2020 5 次提交
-
-
由 liym27 提交于
* The arg of append() can be not Tensor temporarily. * Add Seq2Seq as ProgramTranslator Unit Test. * set dtype of vocab_size_tensor to int64 to pass Windows-CI.
-
由 cc 提交于
-
由 石晓伟 提交于
-
由 Leo Chen 提交于
* fix bug of prelu when rank not equal 4, test=develop * fix prelu inference, test=develop * fix api, test=develop * fix shape when mode is chennel, test=develop * remove debug code, test=develop * add unittest, test=develop
-
由 zlsh80826 提交于
* blockReduce opt * launch threads align to warpSize * reduce unnecessary shared memory for broadcast reduced value * vectorize SoftmaxKernelWithEltadd * add fp16 constrain * test=develop
-
- 16 6月, 2020 5 次提交
-
-
由 Huihuang Zheng 提交于
As the title
-
由 hutuxian 提交于
* Add a StatValue class in the backend to represent a stat. * Add a singleton StatRegistry to maintain the collection of stats. * For the sake of code neatness, we only support type of int and float, which can cover most of the scenarios.
-
由 Huihuang Zheng 提交于
Some big models can timeout on Windows CPU machine. I added some timeout properties.
-
由 T8T9 提交于
-
由 Leo Chen 提交于
-
- 15 6月, 2020 6 次提交
-
-
由 Yiqun Liu 提交于
-
由 Jeng Bai-Cheng 提交于
This commit fixs the compiling bug regarding unique_ptr of IOptimizationProfile. IOptimizationProfile has protected dtor and is controlled by TensorRT internally. Application shouldn't delete the pointer of IOptimizationProfile. See TensorRT document: https://docs.nvidia.com/deeplearning/sdk/tensorrt-api/c_api/classnvinfer1_1_1_i_builder.html#a9ac47e100454151d8206ac91d543299a test=develop
-
由 zlsh80826 提交于
* parallel move shared data test=develop * test=develop
-
由 tianshuo78520a 提交于
* update readme of 1.8.2;test=document_fix * test=develop;test=document_fix * test=develop;test=document_fix
-
由 Divano 提交于
-
由 Huihuang Zheng 提交于
Add TSM as ProgramTranslator Unit Test. The TSM code is referred from PaddlePaddle/models#4229
-
- 14 6月, 2020 1 次提交
-
-
由 tianshuo78520a 提交于
* test=develop * test=develop * fix bug * test=develop * test=develop
-
- 12 6月, 2020 7 次提交
-
-
由 Leo Chen 提交于
-
由 lilong12 提交于
-
由 Aurelius84 提交于
* add MobileNet unittest test=develop * fix cudnn random test=develop
-
由 tangwei12 提交于
* fix sync barrier with barrier monitor, test=develop
-
由 ceci3 提交于
-
由 Leo Chen 提交于
* add summary_env, test=develop * update issue template, test=develop * refine link, test=develop Co-authored-by: Nroot <root@yq01-gpu-255-129-15-00.epc.baidu.com>
-
由 hong 提交于
* enable load_program_state run in imperative mode; test=develop * remove useless code; test=develop
-
- 11 6月, 2020 2 次提交
-
-
由 liym27 提交于
[Dy2Static]Convert var.shape stmt and Convert the return variables of Tensor-dependent 'if' staments to Tensor if it not (#24911) * Support int and long: int or long -> six.integer_types. * Modify test_tensor_shape: fix bug and modify comment. * Support convert_var_shape to convert var.shape stmt * Modify code in ifelse_simple_func.py because don't support return non-Tensor in Tensor-dependent 'if' stament currently. * Convert the return variables of Tensor-dependent 'if' staments to Tensor if it not. test=develop
-
由 Leo Chen 提交于
* use allow list instead of white list, test=develop * reduce include, test=develop
-
- 10 6月, 2020 6 次提交
-
-
由 Zhang Ting 提交于
-
由 liu zhengxi 提交于
-
由 hutuxian 提交于
Support CMatchAucCalculator based on CMatchRankAucCalculator with a new parameter ignore_rank
-
由 Huihuang Zheng 提交于
[Dy2stat] decrease the batch size to decrease GPU usage.
-
由 Zhou Wei 提交于
* windows publish package scripts,test=develop * windows publish package scripts,test=develop * windows publish package scripts,test=develop
-
由 Zhou Wei 提交于
fix bug in CUDA_NVCC_FALS and CMAKE_CUDA_FLAGS
-