- 16 11月, 2020 3 次提交
-
-
由 Guo Sheng 提交于
* Fix gradients with ignore_idx in softmax_with_cross_entropy. test=develop * Fix gradients with ignore_idx in softmax_with_cross_entropy on cpu. Remove softmax_with_cross_entropy from op_threshold_white_list. test=develop * Fix test_softmax_cross_entropy_op.py. test=develop
-
由 Leo Chen 提交于
-
由 Leo Chen 提交于
* register op_version for addto * upgrade pass capability * change eq to le * change eq to le * fix merge
-
- 13 11月, 2020 3 次提交
- 12 11月, 2020 2 次提交
-
-
由 Shang Zhizhou 提交于
* skip_layernorm_op done * add unittest * slice op convertor support trt < 6 * skip_layernorm only work in ernie
-
由 joejiong 提交于
As the title
-
- 11 11月, 2020 2 次提交
-
-
由 wangchaochaohu 提交于
-
由 wangchaochaohu 提交于
* upgrade pass capability
-
- 10 11月, 2020 1 次提交
-
-
由 zhupengyang 提交于
-
- 09 11月, 2020 1 次提交
-
-
由 wangchaochaohu 提交于
-
- 08 11月, 2020 1 次提交
-
-
由 YUNSHEN XIE 提交于
* disable ut test_parallel_executor_fetch_isolated_var,test=document_fix * test for limiting ut exec time as 15S * fix an error caused by cannot find ut * fix some error * can not find test_transformer * fix error caused by ut not run in windows * fix error caused by Compiler Options * fix error caused by setting timeout value as 15 in python/paddle/tests/CMakeLists.txt * setting timeout value to 120s for old ut * add the timeout value setting * fix error caused by ut only run in coverage_ci * add analyzer_transformer_profile_tester * fix some error * fix some error * fix error with inference option * fix error with inference option setting as ON_INFER * add some ut to set timeout * modified some option * fix error * fix some timeout error * fix error * fix error * fix timeout for test_analyzer_bfloat16_resnet50 * fix error * setting timeout properity for some ut * first pr for new ut timeout as 15S
-
- 06 11月, 2020 3 次提交
-
-
由 taixiurong 提交于
-
由 QingshuChen 提交于
*test=kunlun
-
由 joanna.wozna.intel 提交于
* Add bfloat16 softmax and gelu * Add pass attr bfloat16_enabled_op_types * Changes from review
-
- 05 11月, 2020 2 次提交
-
-
由 石晓伟 提交于
-
由 Jacek Czaja 提交于
* - Added sum bf16 oneDNN test=develop * - Fix to UT of sum bf16 test=develop
-
- 04 11月, 2020 2 次提交
-
-
由 Leo Chen 提交于
* add broadcast_shape api * add ut * follow comments * add example code, test=dodument_fix * update example code, test=document_fix
-
由 石晓伟 提交于
* enhance the op_version_registry, test=develop * add unittests, test=develop * enhance the op_version_registry, test=develop * fix bugs, test=develop * revert pybind_boost_headers.h, test=develop * fix a attribute bug, test=develop
-
- 03 11月, 2020 5 次提交
-
-
由 Shang Zhizhou 提交于
* fp16 result ok * change -DWITH_NVINFER_PLUGIN toconfig.EnableTensorRtOSS * auto detect special slice op converter for ernie with trt oss * ernie oss only support fp16 * fix special_slice_plugin serialize bug * matmul in tensorrt ok * ernie unittest ok * add matmul tensorrt unittest * remove demo code
-
由 Jacek Czaja 提交于
-
由 Wilber 提交于
-
由 Leo Chen 提交于
* pool2d cuda kernel supports fp16 * fix compile issue of template * add ut
-
由 Guo Sheng 提交于
* Add rnn_op. test=develop * Fix rnn_op grad maker's drop_empty_grad. test=develop
-
- 02 11月, 2020 1 次提交
-
-
由 wangguanzhong 提交于
* add generate_proposals_v2 op
-
- 29 10月, 2020 2 次提交
-
-
由 joanna.wozna.intel 提交于
-
由 Guanghua Yu 提交于
* Enhance multiclass_nms to support LoD for dygraph mode * fix some error in multiclass_nms * update GetLodFromRoisNum to GetNmsLodFromRoisNum
-
- 28 10月, 2020 5 次提交
-
-
由 Leo Chen 提交于
-
由 Tao Luo 提交于
-
由 Jacek Czaja 提交于
test=develop
-
由 Jacek Czaja 提交于
-
由 wangxinxin08 提交于
* update matrix nms op to api 2.0 * modify code according to review
-
- 27 10月, 2020 4 次提交
-
-
由 Leo Chen 提交于
-
由 wangchaochaohu 提交于
-
由 wangchaochaohu 提交于
-
由 Zhang Ting 提交于
* add fuse_bn_add_act pass
-
- 23 10月, 2020 1 次提交
-
-
由 Chen Weihang 提交于
* add compile limit for paddle enforce * polish elementwise_op_function.cu.h * fix failed unittest * fix windows compile failed * detail polish * revert no type constructor
-
- 22 10月, 2020 2 次提交
-
-
由 Double_V 提交于
-
由 Feiyu Chan 提交于
* fix strided_slice_op's GetExpectedKernelType when input tensor is at CUDAPinnedPlace * add unittest for tensors in cuda pinned place * skip test for cuda pinned place on cpu machines
-