- 27 11月, 2020 2 次提交
-
-
由 Jack Zhou 提交于
Add eigen gru and fix the dropout bug in the rnn
-
由 Shang Zhizhou 提交于
* remove -DSUPPORTS_CUDA_FP16 in cuda.cmake * comile with cuda9 * add some unittest * notest;test=coverage * add unittest for trt plugin swish && split * update ernie unittest * fix some error message * remove repeated judgement of CUDA version in mbEltwiseLayerNormOpConverter * fix comile errror when CUDA_ARCH_NAME < Pascal" * fix comile error * update unittest timeout * compile with cuda9 * update error msg * fix code style * add some comments * add define IF_CUDA_ARCH_SUPPORT_FP16 * rename IF_CUDA_ARCH_SUPPORT_FP16 to CUDA_ARCH_FP16_SUPPORTED
-
- 25 11月, 2020 1 次提交
-
-
由 wawltor 提交于
remove eigen threadpool for the speed up
-
- 20 11月, 2020 2 次提交
-
-
由 Jack Zhou 提交于
* add lstm, simple rnn op kernel * fix the test_lstm for the rnn op * change func name * fix forward postprocess bug * add gru forward, backward code * remove unittest.skipIf; use a big rnn op instead of combination op * fix input doesn't have gradient bug * add eigen lstm forward, backward Co-authored-by: Nwawltor <fangzeyang0904@hotmail.com>
-
由 QingshuChen 提交于
* adjust kunlun header file *test=kunlun * update kunlun unittest *test=kunlun * update xpu unitest * test = kunlun * update xpu unittest * test=kunlun * update xpu unitest * test=kunlun
-
- 08 11月, 2020 1 次提交
-
-
由 YUNSHEN XIE 提交于
* disable ut test_parallel_executor_fetch_isolated_var,test=document_fix * test for limiting ut exec time as 15S * fix an error caused by cannot find ut * fix some error * can not find test_transformer * fix error caused by ut not run in windows * fix error caused by Compiler Options * fix error caused by setting timeout value as 15 in python/paddle/tests/CMakeLists.txt * setting timeout value to 120s for old ut * add the timeout value setting * fix error caused by ut only run in coverage_ci * add analyzer_transformer_profile_tester * fix some error * fix some error * fix error with inference option * fix error with inference option setting as ON_INFER * add some ut to set timeout * modified some option * fix error * fix some timeout error * fix error * fix error * fix timeout for test_analyzer_bfloat16_resnet50 * fix error * setting timeout properity for some ut * first pr for new ut timeout as 15S
-
- 03 11月, 2020 2 次提交
- 21 10月, 2020 1 次提交
-
-
由 Double_V 提交于
-
- 20 10月, 2020 1 次提交
-
-
由 wangchaochaohu 提交于
-
- 14 10月, 2020 1 次提交
-
-
由 wangchaochaohu 提交于
-
- 10 10月, 2020 1 次提交
-
-
由 Double_V 提交于
* fix pool3d bug, test=develop * fix unitest, test=develop * fix test and fix pool2d bug, test=develop
-
- 27 9月, 2020 1 次提交
-
-
由 Li Fuchen 提交于
* add float64 input to ctc_loss * modified error message of warpctc * update repo and tag of warpctc * add test for warpctc with float64 input * modified warpctc.cmake to make sure build always * resolved sample code bug of warpctc * add core.ops in warpctc dygraph * fix a bug of test
-
- 26 9月, 2020 1 次提交
-
-
由 Zhong Hui 提交于
fix cpplint error for the autmic max/min
-
- 25 9月, 2020 1 次提交
-
-
由 ShenLiang 提交于
* add fp16 for matmul
-
- 24 9月, 2020 2 次提交
-
-
由 wanghuancoder 提交于
* use iwyu clean include, test=develop, test=win * compilation error, test=develop * fix compilation error2, test=develop * fix compilation error3, test=develop * fix compilation error4, test=develop * fix compilation error5, test=develop * fix compilation error6, test=develop * fix compilation error7, test=develop * fix compilation error8, test=develop * fix compilation error8, test=develop * fix compilation error10, test=develop * fix compilation error11, test=develop
-
由 Zhong Hui 提交于
Add GPU Kernels of Segment Ops, support, sum, max, min, mean
-
- 22 9月, 2020 1 次提交
-
-
由 Zhong Hui 提交于
Add the cpu version of segment sum mean max min op
-
- 18 9月, 2020 1 次提交
-
-
- 17 9月, 2020 1 次提交
-
-
由 Jack Zhou 提交于
enhance reduce op which can reduce tensor with arbitrary rank
-
- 16 9月, 2020 1 次提交
-
-
由 Jack Zhou 提交于
Error description optimize for the math dir
-
- 14 9月, 2020 2 次提交
- 10 9月, 2020 1 次提交
-
-
由 Steffy-zxf 提交于
update error info for selected_rows_functor
-
- 09 9月, 2020 2 次提交
-
-
由 wangchaochaohu 提交于
-
由 kinghuin 提交于
optimize the error message for math dir
-
- 08 9月, 2020 1 次提交
-
-
由 kinghuin 提交于
fix the error message for the unpooling.cc
-
- 03 9月, 2020 1 次提交
-
-
由 joanna.wozna.intel 提交于
-
- 28 8月, 2020 1 次提交
-
-
由 Leo Chen 提交于
* refine manual seed * fix ci problem * fix unittests * fix unittest * set is_init_py=false in manual_seed * fix unittest * fix bernoulli_op * fix(unittest): change random_seed to manual_seed *
🐞 fix(unittest): fix manual_seed * trigger ci * fix test_sentiment * fix test_imperative_save_load * fix test_uniform_random_op * fix test_uniform_random_op * fix test_jit_save_load * merge develop * fix manual_seed * fix manual_seed * use global engine * use shared_ptr * fix double free * fix bug * fix bug * fix bug * fix test bug * fix test bug * fix test bug * fix ci
-
- 26 8月, 2020 1 次提交
-
-
由 Bai Yifan 提交于
-
- 24 8月, 2020 1 次提交
-
-
由 yaoxuefeng 提交于
-
- 22 8月, 2020 1 次提交
-
-
由 ShenLiang 提交于
* add matmul_v2
-
- 21 8月, 2020 1 次提交
-
-
由 QingshuChen 提交于
* support Baidu AI Accelerator * test=kunlun * minor * test=kunlun * support xpu op in separate file * test=kunlun * update XPU error message and remove duplicated code * test=kunlun * minor * test=kunlun * minor * test=kunlun
-
- 05 8月, 2020 1 次提交
-
-
由 Pei Yang 提交于
* develop dynamic shape serilization * add test param for gelu * fix bugs * delete redundant comments * debug * fix conflict. test=develop * fix bug. test=develop * add trt dynamic shape serialized support * fix ernie serialized bug test=develop * fix codestyle test=develop * fix bug test=develop * fix bug.test=develop * modify cmakelist test=develop * fix bug test=develop * fix error message. test=develop * fix trt register plugin based on pr#25003 * add trt dynload * fix deserialization bug of not finding plugin registration * refine code style * recover engine key in tensorrt_subgraph_pass * for ci coverage * add unittest for deserialization Co-authored-by: Nhaozech <chenhaoze94@gmail.com>
-
- 03 8月, 2020 1 次提交
-
-
由 Zhang Ting 提交于
-
- 31 7月, 2020 1 次提交
-
-
由 ShenLiang 提交于
* fix inverse bug, test=develop * fix the untest, test=develop * add singular checking, test=develop * fix the utest, test=develop * use memory::copy, test=develop * fix bost_get, test=develop * fix position, test=develop
-
- 23 7月, 2020 1 次提交
-
-
由 joanna.wozna.intel 提交于
test=develop
-
- 21 7月, 2020 1 次提交
-
-
由 Zhang Ting 提交于
-
- 11 7月, 2020 1 次提交
-
-
由 Chen Weihang 提交于
* fix softmax_with_cross_entropy cuda kernel overflow bug, test=develop * replace old macro & for condition, test=develop * polish details, test=develop
-
- 10 7月, 2020 1 次提交
-
-
由 zlsh80826 提交于
* add explicit specialization * add skiplayernorm vector load if available * test=develop
-