- 12 1月, 2021 1 次提交
-
-
由 chentianyu03 提交于
* complex gradient matmul (#29966) * dot op support complex types * matmul support complex types * add test case * matmul broadcast gradient support complex * move conjFunctor to complex_functor.h * change the kron gradient when complex types (#29995) * type promotion for grad (#30177) * type promotion for grad * add type promotion for div op
-
- 07 1月, 2021 1 次提交
-
-
由 ShenLiang 提交于
-
- 29 12月, 2020 1 次提交
-
-
由 Chen Weihang 提交于
* [Complex] Add support for complex grad accumulated (#29889) * add support for complex grad accumulated * add unittest for coverage * update test dtype * remove useless blank line * [Complex] Handle complex to real after type promotion (#29855) * try to add fwd op input dtypes * refactor base impl * return tmp_ins after dygraph prepare data * fix typo found in debug * polish comment & add complex net test * revert detail change * fix unittest failed * add complex kernel condition control * fix xpu test failed & polish comment * polish details by review comments * Complex op test (#29753) * delete no need to calculate inputs in dygraph op_test * delete no need to calculate inputs in dygraph op_test * change grad elementwise_mul for complex types (#29757) * add conj op for complex types * add conj for complex types * add more test case * add conj_op test * modify conj api and impl * add complex type for fill_constant_op xpu * add setConstant for complex type * remove complex conj test file * user define grad for test_conj_op * add test case for static mode of conj api * modify conj doc * change input args name to x * remove useless codes * conj support real types * add conj test case for real number * delete no need to calculate inputs in dygraph op_test * delete no need to calculate inputs in dygraph op_test * modify grad of mul for complex types * fix the grads of inputs args order not match bug * change the grad of div when complex types (#29804) * change the grad of div when complex types * fix the grads of inputs args order not match bug Co-authored-by: Nchentianyu03 <chentianyu03@baidu.com>
-
- 18 12月, 2020 1 次提交
-
-
由 Chen Weihang 提交于
* Add complex dtype op (add) test example (#29603) * add op test case for complex * polish code details * add xpu set constant support * fix argument rror * remove useless pyc file * [Complex] Add real & imag op and api for complex tensor (#29672) * add complex real op & api & unittest * add imag op & api & unittest * refactor op impl * revert simplify writing due to complile failed * polish details * polish grad op code * add conj op for complex types (#29527) * add conj op for complex types * add conj for complex types * add more test case * add conj_op test * modify conj api and impl * add complex type for fill_constant_op xpu * add setConstant for complex type * remove complex conj test file * user define grad for test_conj_op * add test case for static mode of conj api * modify conj doc * change input args name to x * remove useless codes * conj support real types * add conj test case for real number Co-authored-by: Nchentianyu03 <chentianyu03@baidu.com>
-
- 01 12月, 2020 1 次提交
-
-
由 chentianyu03 提交于
* add complex64 and complex128 type; add +-*/@ and slice opreator for complex types * add test cases for complex elementwise, matmul and getitem unittest * add test cases for complex types * add test cases for complex matmul unittest
-
- 30 11月, 2020 1 次提交
-
-
由 Jack Zhou 提交于
fix gru gcc7.4 bug for the gru compile
-
- 27 11月, 2020 2 次提交
-
-
由 Jack Zhou 提交于
Add eigen gru and fix the dropout bug in the rnn
-
由 Shang Zhizhou 提交于
* remove -DSUPPORTS_CUDA_FP16 in cuda.cmake * comile with cuda9 * add some unittest * notest;test=coverage * add unittest for trt plugin swish && split * update ernie unittest * fix some error message * remove repeated judgement of CUDA version in mbEltwiseLayerNormOpConverter * fix comile errror when CUDA_ARCH_NAME < Pascal" * fix comile error * update unittest timeout * compile with cuda9 * update error msg * fix code style * add some comments * add define IF_CUDA_ARCH_SUPPORT_FP16 * rename IF_CUDA_ARCH_SUPPORT_FP16 to CUDA_ARCH_FP16_SUPPORTED
-
- 25 11月, 2020 1 次提交
-
-
由 wawltor 提交于
remove eigen threadpool for the speed up
-
- 20 11月, 2020 2 次提交
-
-
由 Jack Zhou 提交于
* add lstm, simple rnn op kernel * fix the test_lstm for the rnn op * change func name * fix forward postprocess bug * add gru forward, backward code * remove unittest.skipIf; use a big rnn op instead of combination op * fix input doesn't have gradient bug * add eigen lstm forward, backward Co-authored-by: Nwawltor <fangzeyang0904@hotmail.com>
-
由 QingshuChen 提交于
* adjust kunlun header file *test=kunlun * update kunlun unittest *test=kunlun * update xpu unitest * test = kunlun * update xpu unittest * test=kunlun * update xpu unitest * test=kunlun
-
- 08 11月, 2020 1 次提交
-
-
由 YUNSHEN XIE 提交于
* disable ut test_parallel_executor_fetch_isolated_var,test=document_fix * test for limiting ut exec time as 15S * fix an error caused by cannot find ut * fix some error * can not find test_transformer * fix error caused by ut not run in windows * fix error caused by Compiler Options * fix error caused by setting timeout value as 15 in python/paddle/tests/CMakeLists.txt * setting timeout value to 120s for old ut * add the timeout value setting * fix error caused by ut only run in coverage_ci * add analyzer_transformer_profile_tester * fix some error * fix some error * fix error with inference option * fix error with inference option setting as ON_INFER * add some ut to set timeout * modified some option * fix error * fix some timeout error * fix error * fix error * fix timeout for test_analyzer_bfloat16_resnet50 * fix error * setting timeout properity for some ut * first pr for new ut timeout as 15S
-
- 03 11月, 2020 2 次提交
- 21 10月, 2020 1 次提交
-
-
由 Double_V 提交于
-
- 20 10月, 2020 1 次提交
-
-
由 wangchaochaohu 提交于
-
- 14 10月, 2020 1 次提交
-
-
由 wangchaochaohu 提交于
-
- 10 10月, 2020 1 次提交
-
-
由 Double_V 提交于
* fix pool3d bug, test=develop * fix unitest, test=develop * fix test and fix pool2d bug, test=develop
-
- 27 9月, 2020 1 次提交
-
-
由 Li Fuchen 提交于
* add float64 input to ctc_loss * modified error message of warpctc * update repo and tag of warpctc * add test for warpctc with float64 input * modified warpctc.cmake to make sure build always * resolved sample code bug of warpctc * add core.ops in warpctc dygraph * fix a bug of test
-
- 26 9月, 2020 1 次提交
-
-
由 Zhong Hui 提交于
fix cpplint error for the autmic max/min
-
- 25 9月, 2020 1 次提交
-
-
由 ShenLiang 提交于
* add fp16 for matmul
-
- 24 9月, 2020 2 次提交
-
-
由 wanghuancoder 提交于
* use iwyu clean include, test=develop, test=win * compilation error, test=develop * fix compilation error2, test=develop * fix compilation error3, test=develop * fix compilation error4, test=develop * fix compilation error5, test=develop * fix compilation error6, test=develop * fix compilation error7, test=develop * fix compilation error8, test=develop * fix compilation error8, test=develop * fix compilation error10, test=develop * fix compilation error11, test=develop
-
由 Zhong Hui 提交于
Add GPU Kernels of Segment Ops, support, sum, max, min, mean
-
- 22 9月, 2020 1 次提交
-
-
由 Zhong Hui 提交于
Add the cpu version of segment sum mean max min op
-
- 18 9月, 2020 1 次提交
-
-
- 17 9月, 2020 1 次提交
-
-
由 Jack Zhou 提交于
enhance reduce op which can reduce tensor with arbitrary rank
-
- 16 9月, 2020 1 次提交
-
-
由 Jack Zhou 提交于
Error description optimize for the math dir
-
- 14 9月, 2020 2 次提交
- 10 9月, 2020 1 次提交
-
-
由 Steffy-zxf 提交于
update error info for selected_rows_functor
-
- 09 9月, 2020 2 次提交
-
-
由 wangchaochaohu 提交于
-
由 kinghuin 提交于
optimize the error message for math dir
-
- 08 9月, 2020 1 次提交
-
-
由 kinghuin 提交于
fix the error message for the unpooling.cc
-
- 03 9月, 2020 1 次提交
-
-
由 joanna.wozna.intel 提交于
-
- 28 8月, 2020 1 次提交
-
-
由 Leo Chen 提交于
* refine manual seed * fix ci problem * fix unittests * fix unittest * set is_init_py=false in manual_seed * fix unittest * fix bernoulli_op * fix(unittest): change random_seed to manual_seed *
🐞 fix(unittest): fix manual_seed * trigger ci * fix test_sentiment * fix test_imperative_save_load * fix test_uniform_random_op * fix test_uniform_random_op * fix test_jit_save_load * merge develop * fix manual_seed * fix manual_seed * use global engine * use shared_ptr * fix double free * fix bug * fix bug * fix bug * fix test bug * fix test bug * fix test bug * fix ci
-
- 26 8月, 2020 1 次提交
-
-
由 Bai Yifan 提交于
-
- 24 8月, 2020 1 次提交
-
-
由 yaoxuefeng 提交于
-
- 22 8月, 2020 1 次提交
-
-
由 ShenLiang 提交于
* add matmul_v2
-
- 21 8月, 2020 1 次提交
-
-
由 QingshuChen 提交于
* support Baidu AI Accelerator * test=kunlun * minor * test=kunlun * support xpu op in separate file * test=kunlun * update XPU error message and remove duplicated code * test=kunlun * minor * test=kunlun * minor * test=kunlun
-
- 05 8月, 2020 1 次提交
-
-
由 Pei Yang 提交于
* develop dynamic shape serilization * add test param for gelu * fix bugs * delete redundant comments * debug * fix conflict. test=develop * fix bug. test=develop * add trt dynamic shape serialized support * fix ernie serialized bug test=develop * fix codestyle test=develop * fix bug test=develop * fix bug.test=develop * modify cmakelist test=develop * fix bug test=develop * fix error message. test=develop * fix trt register plugin based on pr#25003 * add trt dynload * fix deserialization bug of not finding plugin registration * refine code style * recover engine key in tensorrt_subgraph_pass * for ci coverage * add unittest for deserialization Co-authored-by: Nhaozech <chenhaoze94@gmail.com>
-