- 31 5月, 2021 2 次提交
-
-
由 cc 提交于
-
由 wuhuanzhou 提交于
-
- 28 5月, 2021 3 次提交
-
-
由 chentianyu03 提交于
* modify to complex template types for fill_constant op * modify to complex template types for py_layer, strided_slice and reduce_sum_op.part
-
由 levi131 提交于
* run pre-commit * use HOST or DEVICE instead of HOSTDEVICE in implementation of lgamma op * add test for fp32 * add lgamma to op_threshold_white_list * add cuda kernel for lgamma kernel * modify numeric grad delta * fix small English issue * change LaunchElementwiseCudaKernel to LaunchSameDimsElementwiseCudaKernel
-
由 chentianyu03 提交于
-
- 27 5月, 2021 3 次提交
-
-
由 chentianyu03 提交于
* modify kron OP to complex template types * modify reshape, slice, trace, transpose OPs to complex template types * modify to complex template types in eigen slice files * change to complex template types for pad.cc and pac.cu * format code style
-
由 Jacek Czaja 提交于
-
由 Thunderbrook 提交于
* support ssd in PsCore * remove log * remove bz2 * defalut value * code style * parse table class * code style * add define
-
- 26 5月, 2021 7 次提交
-
-
由 Leo Chen 提交于
-
由 chentianyu03 提交于
* modify matmul Op to complex template types * remove complex64/128 head file
-
由 Leo Chen 提交于
* refine ~npuOpRunner * implement destructor and forbid copy * use reference to avoid copy * use const reference * relax adam precision * fix top_k
-
由 wuhuanzhou 提交于
* optimize OP's compilation time, test=develop * add more op and run ci test, test=develop * CUDA Kernel register in cc file, test=develop * fix macros, test=develop * fix undefined symbol error, test=develop * fix compilation error and undefined symbol, test=develop * fix compilation error on Windows, test=develop * fix compilation error on Windows, test=develop
-
由 Yuang Liu 提交于
-
由 Zhanlue Yang 提交于
Sigmoid: Out = Sigmoid(X) SigmoidGrad: DX = DOut*(1-Out)*Out [This Patch] Out DOut -> SigmoidGradGrad -> DOutNew DDX DDOut DDOut = (1-Out)*Out*DDX DOutNew = (1-2*Out)*DOut*DDX
-
由 jakpiase 提交于
* added op cast functionality for fp32/bf16 * added newline * added entries in static mode white list and unity build * fixed failing tests * changes after review * added formatting * upgraded tests file as reviewer suggested * changes after review * minor change
-
- 25 5月, 2021 5 次提交
-
-
由 chentianyu03 提交于
* modify complex template for elementwise ops * modify mul, div grad struct * add complex template for CudaShuffleDownSync CudaShuffleXorSync funcs and fix the bug when delete cuda<9000 * fix shuffle func args bug * fix shuffle func args bug * fix shuffle func args bug
-
由 石晓伟 提交于
* add the op def proto, test=develop * add while.pbtxt
-
由 chentianyu03 提交于
* modify conj, real, imag OP to complex template * replace with complex template to dot Op * replace with complex template to Abs Op * add support for complex64 and complex128
-
由 jakpiase 提交于
-
由 niuliling123 提交于
-
- 24 5月, 2021 1 次提交
-
-
由 limingshu 提交于
-
- 22 5月, 2021 1 次提交
-
-
由 jakpiase 提交于
* added support for most matmul cases * added more functionality * full functionality of matmul op, fp32 only * added bf16 tests and functionality * added formatting * changes after review * minor change * added reviewers suggestions
-
- 21 5月, 2021 3 次提交
-
-
由 chentianyu03 提交于
* replace complex in set tensor from and to numpy * replace complex template in cast op
-
由 Feng Xing 提交于
* optimize softmax with cross entropy hard label * label ignore_index cleaning
-
由 pangyoki 提交于
* cast indices and label if their type is not consistent * fix bug * add unittest
-
- 20 5月, 2021 4 次提交
-
-
由 TTerror 提交于
* fix gather op and add logsumexp op on kunlun * update xpu depence * update tests and fix elementwise_add
-
由 Baibaifan 提交于
-
由 chentianyu03 提交于
* add complex template file * add numtraits for complex template * add complex template type register * modify specify template of complex * modify specify template of complex * modify specify template of complex * modify specify template of complex * make TensorCheckerVisitor support complex type * fix operator= error * add complex template * add complex template type * add complex template type to pyarray transform * add complex template type to pyarray transform * remove complex type for dlpack register * set dlpack supprot complex type * set dlpack supprot complex type * set dlpack supprot complex type * remove explict for complex constructor * add complex unit test file
-
由 limingshu 提交于
-
- 19 5月, 2021 2 次提交
-
-
由 zhulei 提交于
* [Rocm] fix test of random_crop_op * [Rocm] fix test of random_crop_op * [Rocm] fix test of random_crop_op & simple_rnn_op * [Rocm] fix test of random_crop_op & simple_rnn_op & logsumexp * [Rocm] fix test of random_crop_op & simple_rnn_op & logsumexp * [Rocm] fix test of random_crop_op & simple_rnn_op & logsumexp * [Rocm] fix test of random_crop_op & logsumexp
-
由 Jacek Czaja 提交于
-
- 18 5月, 2021 4 次提交
-
-
由 pangyoki 提交于
* Output indices of top_k npu op change to int64 * fix accuracy npu bug * fix errors * change cast method to FillNpuTensorWithConstant * change cast method to FillNpuTensorWithConstant
-
由 liuyuhui 提交于
-
由 wuhuanzhou 提交于
-
由 wawltor 提交于
* fix the paddle compare op for the broadcast * fix compare op in for in the cuda device
-
- 14 5月, 2021 4 次提交
-
-
由 Kqnonrime 提交于
* fix two error message * fix two error message * fix error * fix error * fix error * fix error * fix some error message * fix some error * fix error * fix some error * fix some error * fix some error * fix one error * fix some error * fix seven error message * fix error * fix error * fix error * fix error * fix some error message * fix error * fix some error * fix some error * fix four error message * fix error * fix error
-
由 Jacek Czaja 提交于
-
由 Baibaifan 提交于
-
由 limingshu 提交于
-
- 13 5月, 2021 1 次提交
-
-
由 Leo Chen 提交于
* add use_global_beta_pow * add use_global_beta_pow * update npu kernel * update python api * refine code * add ut for use_global_beta_pow * fix npu kernel * add ut for api * add ut for exception * add ut for save/load
-