- 06 2月, 2023 3 次提交
-
-
由 zyfncg 提交于
* remove extra input of conv2d * fix bug * fix unittest bug * adjust conv2d.pbtxt * fix cpu_quantize_pass_tester * revert use_addto of conv2d * fix runtime attribute * fix bug * recover force_fp32_output in conv2d * refine error info * fix bug
-
由 engineer1109 提交于
-
由 jiangcheng 提交于
-
- 03 2月, 2023 6 次提交
-
-
由 Sławomir Siwek 提交于
* replace matmul with matmul_v2 in fuse passes * Remove fusion logic from matmul * removing fusion methods * add proper name * adjust namespaces * clean attrs in python tests * delete checkpoint and restore matmul version * remove unused code * matmul and reshape/transpose fuses migrated * split MatmulOneDNN headers * fuse activation and eltwise_add * add fuse_activation * matmul_transpose_reshape/reshape_transpose_matmul * matmul + elementwise_add (fused) * activation temporary modifciation * merge newest develop * remove depedency from other PR * revert pbtxt * remove placeholders from matmul_v2 * add description in OPMaker * remove matmul_v2_op.h and all depedencies * remove dims changing in base op * add possibility to fuse already fused_matmul * restart broken CI * Empty-Commit * revert matmul_utils.h * codestyle * adjust imports * add pbtxt file * 100% matmul unit tests coverage * trigger CI with minimal changes to develop * adjust changes to develop * add fused_matmul op * inherit base ops * add "v2" * move OPMaker * Gradually add fused_matmul files * second batch of fused_matmul changes * split infershapes of matmul_v2 and fused_matmul * inherit fused_matmul from matmul_v2 * Update paddle/phi/backends/onednn/onednn_reuse.h Co-authored-by: NTomasz Socha <tomasz.socha@intel.com> * Update paddle/phi/kernels/fusion/onednn/fused_matmul_kernel.cc Co-authored-by: NTomasz Socha <tomasz.socha@intel.com> --------- Co-authored-by: NTomasz Socha <tomasz.socha@intel.com>
-
由 RedContritio 提交于
-
由 HappyHeavyRain 提交于
* generate some static graph ops * fix the bug of pow * add REGISTER_ACTIVATION_OP in operators.cmake * modify the file operators.cmake
-
由 Yuang Liu 提交于
-
由 Jiabin Yang 提交于
* optimize log * fix type error * fix type error2
-
由 Jiabin Yang 提交于
* refactor dir for prim * support blacklist for bwd comp * fix type error * remove additional file * fix git ignore * add more test * merge develop
-
- 02 2月, 2023 1 次提交
-
-
由 YuanRisheng 提交于
* fix bugs * fix ci bugs
-
- 01 2月, 2023 4 次提交
-
-
由 RedContritio 提交于
* add stride check for MaxPool * add unittests
-
由 Wang Bojun 提交于
* preln_residual 2 fused_bias_residual * skip layernorm fix and ut * code refine * code style refine * fix ut * fix output * add trt layer fall back info * refine op teller and ut * DropoutMaskOut output fix
-
由 zhoutianzi666 提交于
-
由 Aurelius84 提交于
* [PrimCinn]Fix some vars are wrongly gc in CINN+InterpreterCore * fix baseline unittest config * fix code style
-
- 31 1月, 2023 2 次提交
-
-
由 Charles-hit 提交于
* polish static grad op maker gen * fix some bugs * fix static code gen * solve conflict * modify composite grad maker name * integrate phi and fluid info in static code gen * rename some composite maker * modify static code gen format
-
由 TeFeng Chen 提交于
* support inplaced variable in cinn_launch * fix error hint when compiling * fix inplaced output variable of the subgraph * skip CinnCompiler check * using existed definition * fix namespace reference error * modify error message * update cinn tage * fix namespace * skip enforce check * fix unittest attribute throw
-
- 30 1月, 2023 1 次提交
-
-
由 zmxdream 提交于
* add set slot_num for psgpuwraper (#177) * add set slot_num_for_pull_feature for psgpuwarper * Add get_epoch_finish python interface (#182) * add get_epoch_finish interface * add return * delete return * add unzip op (#183) * fix miss key for error dataset (#186) * fix miss key for error dataset * fix miss key for error dataset Co-authored-by: Nyangjunchao <yangjunchao@baidu.com> * add excluded_train_pair and infer_node_type (#187) * support return of degree (#188) * fix task stuck in barrier (#189) Co-authored-by: Nyangjunchao <yangjunchao@baidu.com> * check node/feature format when loading (#190) * check node&feature format when loading * check node&feature format when loading (2£ (2) * degrade log (#191) * [PGLBOX]fix conflict * [PGLBOX]fix conflict * [PGLBOX]replace LodTensor with phi::DenseTensor * [PGLBOX]fix gpu_primitives.h include path * [PGLBOX]from platform::PADDLE_CUDA_NUM_THREADS to phi::PADDLE_CUDA_NUM_THREADS * [PGLBOX]fix unzip example code * [PGLBOX]fix unzip example code * [PGLBOX]fix unzip example code * [PGLBOX]fix unzip example code * [PGLBOX]fix unzip ut * [PGLBOX]fix unzip ut * [PGLBOX]fix code style * [PGLBOX]fix code style * [PGLBOX]fix code style * fix code style * fix code style * fix unzip ut * fix unzip ut * fix unzip ut * fix unzip * fix code stype * add ut * add c++ ut & fix train_mode_ set * fix load into memory * fix c++ ut * fix c++ ut * fix c++ ut * fix c++ ut * fix code style * fix collective * fix unzip_op.cc * fix barrier * fix code style * fix barrier * fix barrier * fix code styple * fix unzip * add unzip.py * add unzip.py * fix unzip.py --------- Co-authored-by: Nchao9527 <33347532+chao9527@users.noreply.github.com> Co-authored-by: NSiming Dai <908660116@qq.com> Co-authored-by: Nhuwei02 <53012141+huwei02@users.noreply.github.com> Co-authored-by: Nyangjunchao <yangjunchao@baidu.com>
-
- 29 1月, 2023 1 次提交
-
-
由 ronnywang 提交于
[CustomDevice] registering feed_dense_tensor, feed_sparse_coo_tensor, feed_strings kernels for custom device (#50042) * [CustomDevice] registering feed_dense_tensor, feed_sparse_coo_tensor, feed_strings kernels for custom device * update * update * update
-
- 19 1月, 2023 1 次提交
-
-
由 Feiyu Chan 提交于
-
- 18 1月, 2023 2 次提交
- 17 1月, 2023 4 次提交
-
-
由 Jiabin Yang 提交于
* add test for composite with dy2st * add more log
-
由 YuanRisheng 提交于
* change feed_op to phi kernel * fix ci bugs * fix build bugs * fix ci bugs * fix compile bugs * fix ci bugs * perfect code * perfect comment code * fix install bugs * modify code according comment * remove visitor in feed_op * modify according comment * perfect code according comment * add infershape * fix py3 bugs * fix getexpected kernel type * fix getexpected kernel type * fix ci bugs * add registry for custom device * fix py3 bugs * fix floating point error * fix py3 test bugs
-
由 WangZhen 提交于
* Support call backward() without params in dy2st
-
由 Xiaoxu Chen 提交于
* support elementwise base func * fix compiling error and add test * support vjp for div using comp * remove additional change * fix dy2st error with magic num * fix dy magic num * another magic * another magic * another magic * add skip rename strategy * support add vjp * support add with new axis cal * support sub vjp * [prim] add multiply vjp rules * [prim] add multiply vjp rules * [prim] fix no infershape with composite in _append_backward_ops * [prim] add expand vjp rule * [prim] add exp vjp rule * uncomment infer shape for reshape/sum static prim api * [prim] fix tanh nullptr error * remove some print message * fix magic number in run_program relative tests @JiaBinYang * [prim] add expand,multiply,exp vjp rules * fix only support single direction reduce error * infer reduce dims using out dims Co-authored-by: NJiabinYang <360788950@qq.com>
-
- 16 1月, 2023 5 次提交
-
-
由 HappyHeavyRain 提交于
* support the 'data_transform' for generating static graph ops * reset 'pow' code * change the 'GetKernelTypeForVar'
-
由 Aurelius84 提交于
* [CINN]Switch cinn GIT_TAG from v0.2 into develop * fix branch name * specify commit * disable unittest * disable unittest
-
由 Jiabin Yang 提交于
This reverts commit 4d5265b8.
-
由 Charles-hit 提交于
* polish static grad op maker gen * fix some bugs * fix static code gen * solve conflict * modify composite grad maker name
-
由 xiaoguoguo626807 提交于
-
- 13 1月, 2023 5 次提交
-
-
由 Zhang Jun 提交于
* update trt engine to set in/out data type * update * Update engine.cc * Update engine.cc * update * set engine output type before freeze the network * update * update trt autoscan ut * update * update ut * fix equal bug, update ut * fix cast and equal ut * update cast ut using TRT < 8.4 * set datatype from scope * check output var is nullptr * Update op_converter.h * update tensorrt_engine_op_test ut * update
-
由 Jiabin Yang 提交于
* support elementwise base func * fix compiling error and add test * remove additional param * support vjp for div using comp * remove additional change * fix dy2st error with magic num * fix dy magic num * another magic * another magic * add more test * fix windows problem * another magic * fix windows compile * invoke ci * add skip rename strategy * support add vjp * fix test_tanh * support add with new axis cal * fix resnet and some test * add composite log * support sub vjp
-
由 jameszhang 提交于
* kunlun add support for c_concat and c_split * replace mutable_data() and ShareDataWith()
-
由 zyfncg 提交于
* generate static graph code of stack, unbind, unique_consecutive op * fix bug
-
由 Yuanle Liu 提交于
-
- 12 1月, 2023 1 次提交
-
-
由 YuanRisheng 提交于
* rename kernel * delete sig * modify code according comment * fix ci bugs
-
- 11 1月, 2023 2 次提交
-
-
由 kangguangli 提交于
-
由 duanyanhui 提交于
-
- 10 1月, 2023 2 次提交
-
-
由 Wen Sun 提交于
* refactor: gloo comm context migration * fix: headers & avoid mutable_data usage * fix: cmake gloo dep * style: rename funcs * refactor: move to new files * fix: gloo deps * refactor: simplify create device
-
由 Ryan 提交于
* try sequence_padding * fix cant use mutable_data * fix mistake fluid_sequence_scale.hh/CMakeLists.t include * fix namespace bug * fix framework::ToAbsOffset not found * fix codestyle
-