- 24 9月, 2020 1 次提交
-
-
由 Zhong Hui 提交于
Add GPU Kernels of Segment Ops, support, sum, max, min, mean
-
- 22 9月, 2020 1 次提交
-
-
由 Zhong Hui 提交于
Add the cpu version of segment sum mean max min op
-
- 18 9月, 2020 1 次提交
-
-
- 17 9月, 2020 1 次提交
-
-
由 Jack Zhou 提交于
enhance reduce op which can reduce tensor with arbitrary rank
-
- 16 9月, 2020 1 次提交
-
-
由 Jack Zhou 提交于
Error description optimize for the math dir
-
- 14 9月, 2020 2 次提交
- 10 9月, 2020 1 次提交
-
-
由 Steffy-zxf 提交于
update error info for selected_rows_functor
-
- 09 9月, 2020 2 次提交
-
-
由 wangchaochaohu 提交于
-
由 kinghuin 提交于
optimize the error message for math dir
-
- 08 9月, 2020 1 次提交
-
-
由 kinghuin 提交于
fix the error message for the unpooling.cc
-
- 03 9月, 2020 1 次提交
-
-
由 joanna.wozna.intel 提交于
-
- 28 8月, 2020 1 次提交
-
-
由 Leo Chen 提交于
* refine manual seed * fix ci problem * fix unittests * fix unittest * set is_init_py=false in manual_seed * fix unittest * fix bernoulli_op * fix(unittest): change random_seed to manual_seed *
🐞 fix(unittest): fix manual_seed * trigger ci * fix test_sentiment * fix test_imperative_save_load * fix test_uniform_random_op * fix test_uniform_random_op * fix test_jit_save_load * merge develop * fix manual_seed * fix manual_seed * use global engine * use shared_ptr * fix double free * fix bug * fix bug * fix bug * fix test bug * fix test bug * fix test bug * fix ci
-
- 26 8月, 2020 1 次提交
-
-
由 Bai Yifan 提交于
-
- 24 8月, 2020 1 次提交
-
-
由 yaoxuefeng 提交于
-
- 22 8月, 2020 1 次提交
-
-
由 ShenLiang 提交于
* add matmul_v2
-
- 21 8月, 2020 1 次提交
-
-
由 QingshuChen 提交于
* support Baidu AI Accelerator * test=kunlun * minor * test=kunlun * support xpu op in separate file * test=kunlun * update XPU error message and remove duplicated code * test=kunlun * minor * test=kunlun * minor * test=kunlun
-
- 05 8月, 2020 1 次提交
-
-
由 Pei Yang 提交于
* develop dynamic shape serilization * add test param for gelu * fix bugs * delete redundant comments * debug * fix conflict. test=develop * fix bug. test=develop * add trt dynamic shape serialized support * fix ernie serialized bug test=develop * fix codestyle test=develop * fix bug test=develop * fix bug.test=develop * modify cmakelist test=develop * fix bug test=develop * fix error message. test=develop * fix trt register plugin based on pr#25003 * add trt dynload * fix deserialization bug of not finding plugin registration * refine code style * recover engine key in tensorrt_subgraph_pass * for ci coverage * add unittest for deserialization Co-authored-by: Nhaozech <chenhaoze94@gmail.com>
-
- 03 8月, 2020 1 次提交
-
-
由 Zhang Ting 提交于
-
- 31 7月, 2020 1 次提交
-
-
由 ShenLiang 提交于
* fix inverse bug, test=develop * fix the untest, test=develop * add singular checking, test=develop * fix the utest, test=develop * use memory::copy, test=develop * fix bost_get, test=develop * fix position, test=develop
-
- 23 7月, 2020 1 次提交
-
-
由 joanna.wozna.intel 提交于
test=develop
-
- 21 7月, 2020 1 次提交
-
-
由 Zhang Ting 提交于
-
- 11 7月, 2020 1 次提交
-
-
由 Chen Weihang 提交于
* fix softmax_with_cross_entropy cuda kernel overflow bug, test=develop * replace old macro & for condition, test=develop * polish details, test=develop
-
- 10 7月, 2020 1 次提交
-
-
由 zlsh80826 提交于
* add explicit specialization * add skiplayernorm vector load if available * test=develop
-
- 07 7月, 2020 1 次提交
-
-
由 zhupengyang 提交于
-
- 17 6月, 2020 2 次提交
-
-
由 Leo Chen 提交于
* fix bug of prelu when rank not equal 4, test=develop * fix prelu inference, test=develop * fix api, test=develop * fix shape when mode is chennel, test=develop * remove debug code, test=develop * add unittest, test=develop
-
由 zlsh80826 提交于
* blockReduce opt * launch threads align to warpSize * reduce unnecessary shared memory for broadcast reduced value * vectorize SoftmaxKernelWithEltadd * add fp16 constrain * test=develop
-
- 12 6月, 2020 1 次提交
-
-
由 ceci3 提交于
-
- 03 6月, 2020 2 次提交
-
-
由 Chen Weihang 提交于
* remove REPLACE_ENFORCE_GLOG compile option & add ci rule prohibit LOG(FATAL) using, test=develop * remove ci test case, test=develop * replace all LOG(FATAL) & polish message, test=develop * fix typo, test=develop * polish error info detail, test=develop
-
由 Leo Chen 提交于
* support gradient accumulation of fp16 in imperative mode, test=develop * enhance coverage test, test=develop * follow comments, test=develop
-
- 11 5月, 2020 1 次提交
-
-
由 Chen Weihang 提交于
* add new macro BOOST_GET_SAFELY & unittests, test=develop * add different macro type, test=develop * fix get macro type in executor, test=develop * four macro part change backup * using one macro for all case, test=develop * revert attribute change, test=develop * change to three func to solve gcc4.8 bug, test=develop * polish some details, test=develop
-
- 27 4月, 2020 1 次提交
-
-
由 Yiqun Liu 提交于
-
- 24 4月, 2020 2 次提交
-
-
由 Guo Sheng 提交于
* Add cholesky_op forward part. test=develop * Complete cholesky_op forward part. test=develop * Add cholesky_op backward part. test=develop * Complete cholesky_op backward part. test=develop * Refine cholesky_op error check and docs. test=develop * Add grad_check unit test for cholesky_op. test=develop * Fix sample code in cholesky doc. test=develop * Refine some error messages of cholesky_op. test=develop * Refine some error messages of cholesky_op. test=develop * Remove unused input in cholesky_grad. test=develop * Remove unused input in cholesky_grad. test=develop * Fix stream for cusolverDnSetStream. test=develop * Update PADDLE_ENFORCE_CUDA_SUCCESS from cholesky_op to adapt to latest code. test=develop * Add CUSOLVER ERROR in enforce.h test=develop * Fix the missing return value in cholesky. test=develop
-
由 Zeng Jinle 提交于
-
- 20 4月, 2020 1 次提交
-
-
由 Zhou Wei 提交于
* Optimize the error messages of paddle CUDA API, test=develop * fix the error messages of paddle CUDA API, test=develop * Refactoring PADDLE_ENFORCE_CUDA_SUCCESS, and apply to curand/cudnn/cublas/NCCL,test=develop * remove build_ex_string,test=develop * merge conflict,test=develop
-
- 17 4月, 2020 1 次提交
-
-
由 Zhang Ting 提交于
-
- 10 4月, 2020 1 次提交
-
-
由 littletomatodonkey 提交于
add addmm op
-
- 26 3月, 2020 1 次提交
-
-
由 Zhaolong Xing 提交于
* add dynamic plugin support. test=develop * change emb eltwise layernorm to math function test=develop * add emb eltwise layernorm test=develop * can run dynamic shape ernie test=develop * fix ci test=develop * add ut for trt ernie dynamic test=develop * refine dynamic shape c++ interface. test=develop * fix comments test=develop * fix comments test=develop
-
- 11 3月, 2020 1 次提交
-
-
由 Zhaolong Xing 提交于
* 1. add embedding eltwise layernorm fuse 2. add embedding eltwise layernorm op 3. refine inplace_add_relu 4. refine fc_eltwise_layernorm test=develop * 1. refine fc test=develop * fix comments test=develop * fix comments test=develop
-
- 28 2月, 2020 1 次提交
-
-
由 tianshuo78520a 提交于
-