- 13 7月, 2020 2 次提交
-
-
由 Huihuang Zheng 提交于
Add Similarity Net as unit test. During the unit test, we found three problems: 1. The run_program_op has memory optimization error when running dy2stat net multiple times. 2. The support for SelectedRows can cause problem in dy2stat. 3. The return grammar has problem. This PR fixes the 1. problem but modify codes for the 2. 3. problems to make PR smaller. I will fix those two problems in the next PR(s)
-
由 yaoxuefeng 提交于
-
- 11 7月, 2020 2 次提交
-
-
由 Zhen Wang 提交于
* Add the imperative quantization aware training. * This is the python part of Imperative QAT. test=develop
-
由 Chen Weihang 提交于
* fix softmax_with_cross_entropy cuda kernel overflow bug, test=develop * replace old macro & for condition, test=develop * polish details, test=develop
-
- 10 7月, 2020 2 次提交
-
-
由 zlsh80826 提交于
* add explicit specialization * add skiplayernorm vector load if available * test=develop
-
由 zhupengyang 提交于
-
- 09 7月, 2020 1 次提交
-
-
由 Zhen Wang 提交于
-
- 08 7月, 2020 4 次提交
-
-
由 Jacek Czaja 提交于
-
由 GaoWei8 提交于
* fix concat shape error test=develop
-
由 ceci3 提交于
* fix bn & in in dy, test=develop * update instance_norm,test=develop * fix bugs,test=develop * add more case in unittest,test=develop * fix,test=develop * fix,test=develop
-
- 07 7月, 2020 2 次提交
-
-
由 zhupengyang 提交于
-
由 zhupengyang 提交于
-
- 06 7月, 2020 2 次提交
-
-
由 WuHaobo 提交于
-
由 Zhang Ting 提交于
-
- 03 7月, 2020 1 次提交
-
-
由 Kaipeng Deng 提交于
* add mish op. test=develop
-
- 02 7月, 2020 1 次提交
-
-
由 Yang Zhang 提交于
* Add `matrix_nms_op` test=develop * Make ci happy test=develop * Exit early when no detection test=develop * Fix license year test=develop * Output index as well test=develop * Match nms2 lod behavior and add `return_index` flag test=develop * Make CI happy test=develop * Fix wording test=develop
-
- 01 7月, 2020 1 次提交
-
-
由 Chengmo 提交于
* test=develop, fix_embedding
-
- 30 6月, 2020 3 次提交
- 29 6月, 2020 1 次提交
-
-
由 Wilber 提交于
-
- 28 6月, 2020 1 次提交
-
-
由 liym27 提交于
-
- 26 6月, 2020 1 次提交
-
-
由 liu zhengxi 提交于
* fix beam_search infershape, test=develop * fix beam search op unittest, test=develop
-
- 23 6月, 2020 1 次提交
-
-
由 Adam 提交于
* Refactor of conv fp32 oneDNN operator test=develop * Formatting fix test=develop * Return Enforces test=develop * GetWeights improvements test=develop
-
- 21 6月, 2020 1 次提交
-
-
由 Shibo Tao 提交于
* don't re-generate header file if content doesn't change. test=develop * add copy_if_different function. test=develop
-
- 18 6月, 2020 1 次提交
-
-
由 Jacek Czaja 提交于
* Start implementing int8 eltwise add test=develop * - Fix to Michal PR * - Fix test=develop * - Lint fixes test=develop * - Added checking if elementwise_mul can be used test=develop * - Added attribs to skip_attrs_set test=develop * - Improved broadcasting test=develop - fixes to compilation - fix - fix - Lint fixes test=develop * - removed redundant condition test=develop Co-authored-by: NMichal Gallus <michal.gallus@intel.com>
-
- 17 6月, 2020 2 次提交
-
-
由 Leo Chen 提交于
* fix bug of prelu when rank not equal 4, test=develop * fix prelu inference, test=develop * fix api, test=develop * fix shape when mode is chennel, test=develop * remove debug code, test=develop * add unittest, test=develop
-
由 zlsh80826 提交于
* blockReduce opt * launch threads align to warpSize * reduce unnecessary shared memory for broadcast reduced value * vectorize SoftmaxKernelWithEltadd * add fp16 constrain * test=develop
-
- 16 6月, 2020 1 次提交
-
-
由 Leo Chen 提交于
-
- 14 6月, 2020 1 次提交
-
-
由 tianshuo78520a 提交于
* test=develop * test=develop * fix bug * test=develop * test=develop
-
- 12 6月, 2020 2 次提交
- 10 6月, 2020 2 次提交
-
-
由 Zhang Ting 提交于
-
由 wangchaochaohu 提交于
-
- 09 6月, 2020 2 次提交
-
-
由 Chen Weihang 提交于
-
由 wawltor 提交于
Add the support the 5d,6d tensor support for the reduce ops; Add the same time, the compile time, it was 22 minutes, it was 21 minutes after fixed.
-
- 08 6月, 2020 2 次提交
-
-
由 mapingshuo 提交于
fixes the CUDAPlace info in the Print op
-
由 Aurelius84 提交于
* Support LoDTensorArray in reverse_op test=develop * polish en doc and unittest code test=develop * refine sample code test=develop * add example of LoDTensorArray test=develop * fix typo test=develop
-
- 05 6月, 2020 1 次提交
-
-
由 Leo Chen 提交于
* refine isfinite, test=develop * use namespace std of isfinite, test=develop, test=win_gpu
-