- 24 2月, 2021 9 次提交
-
-
由 Thunderbrook 提交于
* push multi node * multi node * MultiThread * remove log * solve bug in 30829
-
由 liu zhengxi 提交于
* add get_cublas_handle() api * update format * add unittests * alter function name
-
由 Pei Yang 提交于
* add group norm plugin * fix compile problems * move concat axis check to trt op teller * add nbDims for scale and bias nv dims * add group norm unit test * fix unittest * add trt version restriction for group norm op teller * fix unittest
-
由 Chen Weihang 提交于
* add new custom op so * fix use new method error * fix test failed
-
由 tangwei12 提交于
* fix entry * fix distributed lookup table fuse case * fix entry bug at first time * move entry from paddle.fluid -> paddle.distributed * fix ut with paddle.enable_static() Co-authored-by: Nmalin10 <malin10@baidu.com>
-
由 Qi Li 提交于
-
由 yaoxuefeng 提交于
-
由 Zhou Wei 提交于
* fix some problem of Windows custom op * fix some problem of Windows custom op * fix some problem of Windows custom op
-
由 Zhou Wei 提交于
-
- 23 2月, 2021 9 次提交
-
-
由 alncat 提交于
* added support for fake_quantize_dequantize_abs_max op in quantization inference pass * remove const_cast to pass ci * remove compare operator to pass ci-coverage * added detailed error message for unregistered tensorrt_subgrah_pass
-
由 Jacek Czaja 提交于
-
由 Guanghua Yu 提交于
-
由 Qi Li 提交于
-
由 wangchaochaohu 提交于
* fix windows for optimization of elementwise_add Op
-
由 joanna.wozna.intel 提交于
* Unification of bfloat16 enablement process and refactor * Remove unnecessary function * Standardize the output name search
-
由 Zhong Hui 提交于
[BUG FIX] Fix softmax cross entropy overflow problem.
-
由 Zhou Wei 提交于
-
由 Qi Li 提交于
-
- 22 2月, 2021 8 次提交
-
-
由 Thunderbrook 提交于
* save multi table one path * format
-
由 Qi Li 提交于
-
由 Qi Li 提交于
-
由 Qi Li 提交于
-
由 Shang Zhizhou 提交于
* update trt int8 calibrator to IEntropyCalibratorV2 * add delele opt_cache for trt_split_converter_test
-
由 Zhou Wei 提交于
* [2.0.1]Support New Custom OP on windows * fix CI * fix code style * fix CI * fix CI * fix coverage * fix CI * fix CI
-
由 Qi Li 提交于
* [ROCM] update fluid imperative for rocm (part1), test=develop * [ROCM] update reducer.cc after merge, test=develop * update reducer cmake after merge, test=develop
-
由 JamesLim 提交于
-
- 20 2月, 2021 8 次提交
-
-
由 Chengmo 提交于
* remove pe special profiler * add profiler info
-
由 Chen Weihang 提交于
* add more dispatch marco * add more dispatch marco * add more tests * revert unneeded change * add timeout for test dispatch * add float and complex test * remove and marco
-
由 TTerror 提交于
add squeeze_op/unsqueeze_op on kunlun;fix conv op and parallel executor;optimize lookup_table op (#31056) * add squeeze_op/unsqueeze_op on kunlun; fix conv op and parallel executor on kunlun; optimize lookup_table op on kunlun * update squeeze/unsqueeze op
-
由 123malin 提交于
* test=develop, save/load, shrink Co-authored-by: NseiriosPlus <tangwei12@baidu.com>
-
由 Jiabin Yang 提交于
-
由 Wilber 提交于
* update paddle_fluid.so to paddle_inference.so
-
由 liym27 提交于
* [static setitem] support the index step > 1. tensor_a[::3] = value * [static setitem] support the index step < 0. Eg: tensor_a[::-3] = value * [static setitem] support the index is Tensor. eg: tensor_a[tensor_3:0:-1] = value * Add op version.
-
由 Qi Li 提交于
-
- 19 2月, 2021 6 次提交
-
-
由 Jacek Czaja 提交于
* - added Reshape grad bf16 * - Added reshape grad bf16 * - cosmetics in py
-
由 Wojciech Uss 提交于
* Modify relu native implementation * fix GPU performance
-
由 ShenLiang 提交于
-
由 Wilber 提交于
-
由 Wilber 提交于
-
由 Wilber 提交于
-