- 14 3月, 2022 17 次提交
-
-
由 Tomasz Socha 提交于
* Add elementwise add and activation fuse pass * Fix copy ellision * More flexible pattern detector * More flexible fusion pass * Update lists for pass * Add support for Pow operator * Add support for more activation types * Style * Rename fusion pass * First version of tests * Dirty version of pass * Polished version * Update pbtxt * Style * Update names * Style * Use PADDLE_ENFORCE_EQ * Save error message to variable * WO for error checks * CR * Static style check * Add missing 'activation_scale' attribute * Add relu6 and sigmoid activations * Style * Fix fuse list formating * Sync filenames for fuse pass files * Fix cmake after move * Fix registration * Fix pass name in tests * Add missing activations to checker * WIPS * Working mul op * Working sub * Working Add * Remove pten includes * Remove some forward declarations * Remove Includes * Fixes * Remove default kernels * Add check if post_ops attributes are avaliable * Style * Code adjustment * Register default kernels * We have year 2022 not 2021... Co-authored-by: Njakpiase <jakpia21@gmail.com> Co-authored-by: NSylwester Fraczek <sylwester.fraczek@intel.com> * Fast review fixes Co-authored-by: Njakpiase <jakpia21@gmail.com> Co-authored-by: NSylwester Fraczek <sylwester.fraczek@intel.com> * Review Fix * Rename one_dnn -> onednn * Style after review * Fast and dirty fix for quantization * Update tests * Style * Fix mkldnn_quantizer config * Add Joanna's suggestion. * Check if operator is explicitly disables on OneDNN * Try to use unregistered attributes * Style * Test new framework * FXI * FXII * Update test * Style Co-authored-by: Njakpiase <jakpia21@gmail.com> Co-authored-by: NSylwester Fraczek <sylwester.fraczek@intel.com>
-
由 fwenguang 提交于
-
由 crystal 提交于
* optimize backwad * optimize group_norm backward * Add vectorized code * move assignment code * merge function * move code * optimize code * Modify function name
-
由 Lijunhui 提交于
* bilinear_bw init * optimize code * optimize * optimize 2 * optimize functions * modify func name
-
由 Xiaoxu Chen 提交于
-
由 Jiabin Yang 提交于
* eager, test=develop * fix bug, test=develop * eager, test=develop * merge legacy to fluid * eager, test=develop * eager, test=develop * Refactor TensorAdd func by template and remove gradient_accumulation in eager * Remove needless target name * eager, test=develop * eager, test=develop * Use overload instead of template * Remove legacy code * Remove legacy code * selectedrows, test=develop * Remove DataType test * eager, test=develop * eager, test=develop * support gan, test=develop * Using Tensor directly instead of using EagerTensor * support gradient_accumulation * make test_imperative_lod_tensor_to_selected_rows longer * make test_imperative_lod_tensor_to_selected_rows longer * refine code * ptb, test=develop * Rename all EagerTensor to Tensor * Rename some EagerTensor to Tensor * rename EagerTensor to EagerVariable * eager, test=develop * eager, test=develop * eager, test=develop * eager, test=develop * add more test * eager, test=develop * Support copiable selected rows and merge develop * save load, eager, test=develop * save load, eager, test=develop * refine, test=develop * remove useless _set_value method * refine, test=develop * refine, test=develop * revert static_runner, test=develop * EagerTensor to Tensor, test=develop * refine, test=develop * refine, test=develop * clear grad, test=develop * merge, develop * merge, develop * merge, test=develop * merge, test=develop * Support quant and part of slice * support legacy static save * extend slim tests time * remove imperative on inference * remove imperative on inference * merge develop * fix typo * fix typo * split slice related code into 2 part for imperative and eager * split slice from inference * split slice from inference * fix test_tensor_register_hook * support custom op in eager mode * fix inference deps error * split eager utils from custom operator * fix type match * fix typo Co-authored-by: NWang Huan <wanghuan29@baidu.com> Co-authored-by: NWeilong Wu <veyron_wu@163.com> Co-authored-by: Nwanghuancoder <wanghuancoder@163.com>
-
由 Zhang Zheng 提交于
* Optimize performance of log_softmax * delete unity build * modify to phi * fix * fixfixfixfix * fix * fix * fix * fix * simplify * fix * fix enforce
-
由 0x45f 提交于
-
由 Lijunhui 提交于
[KP] Add unittests for brelu,ceil,celu,elu,floor,hard_shrink,hard_sigmoid,log1p,logsigmoid,relu6,silu,soft_relu,softsign,swish (#40448) * solve unexecuted UT * add 24 activation op UT * append swish&thresholded_relu to kpfirst_list * rm thresholded_relu
-
由 crystal 提交于
* migrate matrix_rank to phi * migrate eigh and matrix_rank to phi * fix matrix_rank * optimize code * move matrix_rank to phi * add max functor * migrate matrix_rank to phi * optimize code
-
由 zhouweiwei2014 提交于
-
由 liutiexing 提交于
-
由 Zhanlue Yang 提交于
-
由 zmxdream 提交于
-
由 Zhong Hui 提交于
[multiprocessing] Add paddle.incubate.multiprocessing for sharing tensors between python processes. (#37302) * Add support for paddle.multiprocessing * move multiprocessing to incubate.
-
由 From00 提交于
* Move Pool OPs to phi * Fix CI error * Fix conflicts
-
由 0x45f 提交于
* refine partial_program * fix code for test_mnist.py train * support quantify UT * make __fake_vars and _double_grads to lazy * fix comments
-
- 13 3月, 2022 2 次提交
-
-
由 Chen Weihang 提交于
-
由 zyfncg 提交于
* refactor infermeta files * update
-
- 12 3月, 2022 6 次提交
-
-
由 Chen Weihang 提交于
* rename softmax kernel name * move softmax infershape * fix failed test
-
由 Chen Weihang 提交于
* move allclose kernel * remove allclose op kernel * fix coverage failed
-
由 zyfncg 提交于
* move roi_align kernel to phi * fix bug of roi_align xpu
-
由 Aganlengzi 提交于
* [custom kernel] fix static object de-initialize bug * fix text * fix text * refine log info
-
由 jiangcheng 提交于
* fix NetBuilder API Name bug in cinn_lib_test * update cinn version to newest
-
由 Chen Weihang 提交于
* fix eager benchmark test failed * fix test_tracer failed
-
- 11 3月, 2022 15 次提交
-
-
由 Tomasz Socha 提交于
-
由 lilong12 提交于
-
由 Chen Weihang 提交于
-
由 Zhanlue Yang 提交于
-
由 Sylwester Fraczek 提交于
-
由 From00 提交于
* Move psroi_pool OP to phi * Replace platform::TensorCopy with phi::Copy
-
由 Chen Weihang 提交于
* remove needless deps in unittests * add gpu marco * fix other unittests * fix kernel name error * fix test_prepare_op * fix failed dygraph unittests * fix gpu failed tests * fix cinn test failed * fix cinn test failed * fix dropout tests
-
由 Yuang Liu 提交于
-
由 zhouweiwei2014 提交于
-
由 zn 提交于
-
由 Aurelius84 提交于
* [Phi] Migrate tile_op into Phi * fix tile_sig * fix include headers * fix using
-
由 chentianyu03 提交于
* add reduce_sum grad kernel * add reduce_grad * modify reduce grad * update reduce grad functions * fix build error * add argument mapping * move cast input after grad * add dims.size=1 cpu reduce_sum grad compute method * update reduce grad GPU * remove raw reduce_sum_grad kernel * modify header files * add namespace funcs for reduce_grad_funcstions
-
由 Jeffrey Chen 提交于
-
由 Zhang Zheng 提交于
* first commit * fix * fix * fix * fix * fix * fix xpu and npu * fix
-
由 wuyefeilin 提交于
* mv erf op to phi * fix as review * fix as review * fix format
-