- 02 3月, 2022 3 次提交
-
-
由 Yuang Liu 提交于
[fleet_executor] Add entrance of FleetExecutor in AnalysisPredictor for distributed inference (#39992)
-
由 Wangzheee 提交于
-
由 JingZhuangzhuang 提交于
-
- 01 3月, 2022 2 次提交
-
-
由 joanna.wozna.intel 提交于
* Add mobilenetv3_large performance test * Disable the BF16 test if the device does not support BF16 computations * Change test timeout
-
由 wenbin 提交于
* remove * pass * more pass
-
- 28 2月, 2022 1 次提交
-
-
由 Chen Weihang 提交于
* rename pten_utils to phi_utils * rename pten_utils target * rename Pten to Phi * replace pten with phi * resolve conflict
-
- 25 2月, 2022 2 次提交
-
-
由 YUNSHEN XIE 提交于
* disable some distribute test case when in CPU test env * disable some test case when in CPU test env * fix
-
由 Chen Weihang 提交于
* support cudnn kernel moving * polish cmake rules * add unittest for coverage * remove orig kernel * remove softmax cudnn kernel * fix softmax test failed * fix npu func error * resolve conflict * rename gpu dnn kernels * fix name rule error * fix compile error * update fp16 namespace
-
- 24 2月, 2022 2 次提交
-
-
由 Chen Weihang 提交于
* rename pten to phi * fix infrt compile failed * resolve conflict
-
由 Wangzheee 提交于
* fix plugin: special slice for ernie
-
- 23 2月, 2022 1 次提交
-
-
由 Allen Guo 提交于
* update inference part * restore white space
-
- 22 2月, 2022 3 次提交
-
-
由 feng_shuai 提交于
* delete gather_ut skip_case * add trt version limit
-
由 xiongkun 提交于
* change Vector to std::vector and provide MixVector class as a helper wrapper class * solve the multi-gpu hang problem * remove the duplicate template instantialize * Copy vector to cpu * add CopyToCPU * xxx * final version: fix the problem of all reduce * remove mixvector dependence * fix * merge * fix code * fix by CI
-
由 Wangzheee 提交于
* fix pass and convert_op for preln_ernie and add preln_ernie'flag in pass
-
- 21 2月, 2022 1 次提交
-
-
由 chenjian 提交于
* fix RecordEvent interface * modify default level to 4 * update interface use * add const default trace level * update record event interface using * update record event interface using * update operator.cc * update part2 * update part1 * fix include profiler.h header in ps server * fix include profiler.h header in ps server * fix profiler.h header
-
- 20 2月, 2022 1 次提交
-
-
由 Chen Weihang 提交于
* rename pten dir to phi * rename namespace to phi * rename infrt pten dir to phi * resolve conflict * rename pten to phi in cmake * revert all infrt change * change needed files * fix infrt failed * fix inference failed
-
- 19 2月, 2022 1 次提交
-
-
由 Aurelius84 提交于
* Unify paddle/pten::framework::ddim into pten::ddim * fix paddle namespace * compile sucessfully * fix npu src file * fix conflict * fix conflict * fix tensorrt compiler error * fix conflict * fix conflict * fix tesst file conflict * fix conflict * fix mlu file conflict * fix mlu file conflict * fix cinn header file conflict * fix conflict * fix conflict * fix conflict * fix conflict
-
- 18 2月, 2022 2 次提交
-
-
由 Feiyu Chan 提交于
* move blas related files * move lapack related files
-
由 zlsh80826 提交于
-
- 17 2月, 2022 2 次提交
-
-
由 Sylwester Fraczek 提交于
* fix and add more tests * remove unwanted changes * check only concat and elementwise * move check to a function * add todo comment * Revert "fix ptq fc attr name fuse_activation->activation_type" This reverts commit ffd023353a5e9b0fd15e81b9e9f9fe1794035017.
-
由 wenbin 提交于
* first commit * teller fix * bug fix * enable for pool2d only * fix global_pooling issue * pooling_type * fix test
-
- 16 2月, 2022 2 次提交
-
-
由 Wangzheee 提交于
[Paddle-Inference] support preln-ernie: add preln_emb_eltwise_layernorm_op, preln_skip_layernorm_op (#39570) * support preln_ernie: add preln_emb_eltwise_layernorm_op, preln_skip_layernorm_op * support preln_ernie: add preln_emb_eltwise_layernorm_op, preln_skip_layernorm_op
-
由 YuanRisheng 提交于
* remove reshape and elementwise_add registry * delete code * fix bugs when run ci ut * remove log * fix bugs when run unit test * fix bugs when run unit test * fix bugs when run cinn * fix bugs when run ci-mac-python3 * fix compile bugs * fix compile bugs * fix compile bugs * fix bugs when run kunlun * fix bugs when compile * update code according comment
-
- 15 2月, 2022 5 次提交
-
-
由 Wangzheee 提交于
[Paddle-Inference] support preln_ernie: add preln_embedding_eltwise_layernorm_fuse_pass, preln_skip_layernorm_fuse_pass (#39508) * support preln_ernie * support preln_ernie
-
由 feng_shuai 提交于
-
由 Leo Chen 提交于
* Replace GeLU plugin with TRT built-in layers for approximate GeLU * Add TensorRT built-in layer for nonapproximate GeLU
-
由 feng_shuai 提交于
-
由 Aurelius84 提交于
* #1 migrate dist-related type()-> dtype() * move datatype function from pten -> fluid/framework * change type() in imperative into convert(dtype()) * modify xx_tensor->type into xx_tensor->dtype * change the set_type interface and the caller * modify xx_tensor.type into xx_tensor.dtype * fix mutable_data(place, dtype()) * change caller of mutable_data in pten and distributed * change the caller of mutable_data in fluid/framework * change the caller of mutable_data in imperative directory * mutable_data: inference * update the call of mutable_data * transfer MakePenScalarArray MakePtenScalar ResetHolderWithType * pass the compile. the next step is remove VarType in Pten * fix all and remove VarType from pten. success in linux. Next task is other platform * fix conflict with develop * fix compiled error * Fix reset conversion * fix conflict * fix compiled problem * fix typo * Fix << in tensor_utils.cc * fix type->dtype * fix unittest * fix tensor init constructor * fix DataTypeSize for BFloat16 * fix code style * fix npu compiled error * fix npu * compile npu sucessfully * fix conflict * fix conflict Co-authored-by: Nxiongkun <xiongkun03@baidu.com>
-
- 14 2月, 2022 1 次提交
-
-
由 Sylwester Fraczek 提交于
* prevent squashing pair u8 dequantize -> s8 quantize * add relu op to check for uint8 * fix ptq fc attr name fuse_activation->activation_type * fix * add unit test * remove unused variable * test fix unsuccessful * fix test and logic * multiline comment * remove cout * Revert "fix ptq fc attr name fuse_activation->activation_type" This reverts commit ffd023353a5e9b0fd15e81b9e9f9fe1794035017. * fix ptq fc attr name fuse_activation->activation_type
-
- 11 2月, 2022 3 次提交
-
-
由 Leo Chen 提交于
-
由 JingZhuangzhuang 提交于
-
由 Wangzheee 提交于
* support ernie quant model with interleaved * support ernie quant model with interleaved * support ernie quant model with interleaved * support ernie quant model with interleaved * support ernie quant model with interleaved * support ernie quant model with interleaved * support ernie quant model with interleaved
-
- 10 2月, 2022 2 次提交
-
-
由 chenyanlann 提交于
-
由 wenbin 提交于
* mkldnn conv fix * definetion
-
- 09 2月, 2022 1 次提交
-
-
由 Wangzheee 提交于
* rebuild matmul pass: trt and gpu_cpu * rebuild matmul pass: trt and gpu_cpu * rebuild matmul pass: trt and gpu_cpu * rebuild matmul pass: trt and gpu_cpu
-
- 06 2月, 2022 1 次提交
-
-
由 Wilber 提交于
-
- 28 1月, 2022 1 次提交
-
-
由 wenbin 提交于
* slice * shuffle pass enhancement
-
- 27 1月, 2022 3 次提交
-
-
由 Aganlengzi 提交于
* [Demo] custom kernel based on pten kernel * merge and npu custom work well * del comments * delete other code * fix CUDAContext * fix not found small_vector.h * support NPU * fix NPUContext * fix DeviceContext support * add UT * fix call * add UT * fix * fix for comments and ut * add MACRO control * fix multi input output * support env CUSTOM_DEVICE_ROOT * deal with special cases * fix for Windows * try coverage with test_custom_kernel_dot.py * fix test_custom_kernel_dot * fix test_custom_kernel_dot * fix merge * fix merge * fix CI * update * merge and fix * remove WITH_CUSTOM_KERNEL * fix merge * merge and fix * fix ut * fix ut for mac * add more UT * add more UT * fix
-
由 wenbin 提交于
* shuffle channel pass * add ut * timeout fix * makefile fix
-
由 王明冬 提交于
-