- 03 6月, 2021 1 次提交
-
-
由 王明冬 提交于
-
- 02 6月, 2021 11 次提交
-
-
由 limingshu 提交于
-
由 Qi Li 提交于
-
由 wuhuanzhou 提交于
-
由 Pei Yang 提交于
-
由 Zhang Zheng 提交于
-
由 chentianyu03 提交于
-
由 wuhuanzhou 提交于
-
由 wuhuanzhou 提交于
-
由 Qi Li 提交于
-
由 wuhuanzhou 提交于
-
由 limingshu 提交于
-
- 01 6月, 2021 4 次提交
-
-
由 Wangzheee 提交于
* add trt convert op: reshape * reshape * fix * input: Shape,ShapeTensor * Add some comments
-
由 chentianyu03 提交于
-
由 Wangzheee 提交于
* download_verify * checksum
-
由 chentianyu03 提交于
* replace and remove complex64/128 types in custom OP and other files * fix custom_tensor_test fail bug * fix custom_conj_test fail bug * fix dispatch_test_op build fail bug
-
- 31 5月, 2021 6 次提交
-
-
由 wangguanzhong 提交于
* enhance error message for conv * fix ci coverage
-
由 whs 提交于
-
由 XiangGao 提交于
* improve group norm cpu precision and performance * add unit test to group norm
-
由 Leo Chen 提交于
-
由 cc 提交于
-
由 wuhuanzhou 提交于
-
- 28 5月, 2021 4 次提交
-
-
由 wenbin 提交于
* add more conditions * dynamic shape * ut * correct contidions * commnent * remove rebandadnt op type * remove rebandant if
-
由 chentianyu03 提交于
* modify to complex template types for fill_constant op * modify to complex template types for py_layer, strided_slice and reduce_sum_op.part
-
由 levi131 提交于
* run pre-commit * use HOST or DEVICE instead of HOSTDEVICE in implementation of lgamma op * add test for fp32 * add lgamma to op_threshold_white_list * add cuda kernel for lgamma kernel * modify numeric grad delta * fix small English issue * change LaunchElementwiseCudaKernel to LaunchSameDimsElementwiseCudaKernel
-
由 chentianyu03 提交于
-
- 27 5月, 2021 4 次提交
-
-
由 chentianyu03 提交于
* modify kron OP to complex template types * modify reshape, slice, trace, transpose OPs to complex template types * modify to complex template types in eigen slice files * change to complex template types for pad.cc and pac.cu * format code style
-
由 Jacek Czaja 提交于
-
由 Thunderbrook 提交于
* support ssd in PsCore * remove log * remove bz2 * defalut value * code style * parse table class * code style * add define
-
由 Zhou Wei 提交于
* Unify all external API error message mechanism and enhance third-party API error msg * fix some comment * fix some comment
-
- 26 5月, 2021 9 次提交
-
-
由 tangwei12 提交于
Change-Id: I2e82dfcee6a1d0512b94cebc32281123fa5bf597 * pretty print for datafeed error Change-Id: I056a8b6f03608e96679a83846c97aed289cef7e6 * fix fleet dist infer ut
-
由 Leo Chen 提交于
-
由 chentianyu03 提交于
* modify matmul Op to complex template types * remove complex64/128 head file
-
由 Leo Chen 提交于
* refine ~npuOpRunner * implement destructor and forbid copy * use reference to avoid copy * use const reference * relax adam precision * fix top_k
-
由 wuhuanzhou 提交于
* optimize OP's compilation time, test=develop * add more op and run ci test, test=develop * CUDA Kernel register in cc file, test=develop * fix macros, test=develop * fix undefined symbol error, test=develop * fix compilation error and undefined symbol, test=develop * fix compilation error on Windows, test=develop * fix compilation error on Windows, test=develop
-
由 Yuang Liu 提交于
-
由 Zhanlue Yang 提交于
Sigmoid: Out = Sigmoid(X) SigmoidGrad: DX = DOut*(1-Out)*Out [This Patch] Out DOut -> SigmoidGradGrad -> DOutNew DDX DDOut DDOut = (1-Out)*Out*DDX DOutNew = (1-2*Out)*DOut*DDX
-
由 jakpiase 提交于
* added op cast functionality for fp32/bf16 * added newline * added entries in static mode white list and unity build * fixed failing tests * changes after review * added formatting * upgraded tests file as reviewer suggested * changes after review * minor change
-
由 wuhuanzhou 提交于
-
- 25 5月, 2021 1 次提交
-
-
由 chentianyu03 提交于
* modify complex template for elementwise ops * modify mul, div grad struct * add complex template for CudaShuffleDownSync CudaShuffleXorSync funcs and fix the bug when delete cuda<9000 * fix shuffle func args bug * fix shuffle func args bug * fix shuffle func args bug
-