- 03 10月, 2022 2 次提交
-
-
由 jakpiase 提交于
* added transpose * CI fix * fix for transpose * fix after review
-
由 Jacek Czaja 提交于
* - some more MD changes * - lint * - compilation fixes * - compilation fixes * - lint * - fix
-
- 30 9月, 2022 12 次提交
-
-
由 Wen Sun 提交于
-
由 Ruibiao Chen 提交于
-
由 HongyuJia 提交于
-
由 HongyuJia 提交于
-
由 Allen Guo 提交于
* paddle-inference support custom-ops Co-authored-by: NZhixin Yao <zhixiny@graphcore.ai> * fix tolower Co-authored-by: NZhixin Yao <zhixiny@graphcore.ai>
-
由 Chenxiao Niu 提交于
-
由 光明和真理 提交于
-
由 ykkk2333 提交于
* migrate sigmoid with cross entropy, and tile xpu kernels to phi, test=kunlun * migrate add_n kernep to phi, test=kunlun * fix bugs of tipc, test=kunlun
-
由 HongyuJia 提交于
* opt GetExpectedKernelType code of conv_transpose_op * fix if error
-
由 HongyuJia 提交于
-
由 HongyuJia 提交于
-
由 sneaxiy 提交于
* support pure bfloat16 * support bf16 linear * update PR to pass CI * tiny fix where_grad_kernel.cu * add bfloat16 to selu_grad to pass CI * fix selu grad compilation error
-
- 29 9月, 2022 7 次提交
-
-
由 Xinger 提交于
-
由 zmxdream 提交于
* add afs OpenWriter * update
-
由 Leo Guo 提交于
Add index_select, index_select_grad, reduce_min kernel and their unittests for kunlun. Add registers of index_select, index_select_grad, reduce_min, sqrt, sqrt_grad to xpu2_op_list.test=kunlun. (#46557)
-
由 carryyu 提交于
* fix P40 topk: Make the optimized topk compatible with P40. * fix P40 topk: Make the optimized topk compatible with P40. * fix P40 topk: Make the optimized topk compatible with P40.
-
由 yeliang2258 提交于
* remove calibration file path * remove useless code
-
由 光明和真理 提交于
Co-authored-by: Nliupeiyu <liupeiyu@cambricon.com>
-
由 Weilong Wu 提交于
[Eager, Performance optimization] support mod / matmul ( % and @ operator) to sink to Cpp layer (#46565) * [Eager, Performance optimization] support mod ( % operator) to sink to Cpp layer * fix mod logic * support matmul math operator * rm LOG(warning), use VLOG(6) * fix conflicts mistake
-
- 28 9月, 2022 12 次提交
-
-
由 sneaxiy 提交于
-
由 Chen Weihang 提交于
* remove needless using tensor * remove needless using tensor * resolve conflict * replace tensor using * fix format error * revert needless changing * fix rocm and npu compile error * fix cinn compile error * fix format error * fix mkldnn format error * fix mkldnn format error * fix cinn compile error * fix cinn compile error * fix cinn compile error * resolve conflict
-
由 Ruibiao Chen 提交于
* Convert GradMergeAllReduceOpHandle in GraphToBlock * Set FLAGS_CONVERT_GRAPH_TO_PROGRAM to False
-
由 Weilong Wu 提交于
[Eager, Performance optimization] support less_than & less_equal( < & <= operator) to sink to Cpp layer (#46542)
-
由 zmxdream 提交于
-
由 Leo Chen 提交于
-
由 Jacek Czaja 提交于
* added fc int8 tests * CI fix * added skipping UTs for GPUs * fixes for CI * added support for residual connections inside fc * fix for quant int8 bias * - lint Co-authored-by: Njakpiase <jakpia21@gmail.com>
-
由 limingshu 提交于
-
由 Sławomir Siwek 提交于
* Relu6 * remove fluid handler * add individual kernel signature * coding style * replace bounded_relu with clip * whitespace * code style
-
由 kangguangli 提交于
* add gpu kernel for transfer layout * comment error throw * fix: flag setting in testcase; add condition check for raising error * fix typo * fix: add error type for PADDLE_THROW * remove kernel fallback in data_transfer.cc * remove useless variable definition
-
由 Weilong Wu 提交于
-
由 zyfncg 提交于
* fix clip_extra code in remove_training_info * revert rnn opmaker clear
-
- 27 9月, 2022 7 次提交
-
-
由 Chenxiao Niu 提交于
-
由 Weilong Wu 提交于
* [Eager] math op sink to Cpp level * fix ci errors * draft version * support + and - operator under cpp directly * add static test * polish code * promote types or unify right type to left * recover static test case * polish code and fix some ci errors * support complex and polish code * fix conflicts * fix windows ci errors * fix windows-inference-ci errors * polish and fix tests * fix test case * polish code * [Eager, Performance optimization] support multiply( * operator) to sink to Cpp layer * rm useless glog * [Eager, Performance optimization] support divide( / and // operator) to sink to Cpp layer * polish code * polish code and fix code-format * polish code * fix CI * polish code * update test * support div operator under cpp * fix scalar as input * Polish div logic, fix ci test * fix errors
-
由 levi131 提交于
* init dropout * small format fix * fix pr comments * add value test
-
由 ceci3 提交于
-
由 wenbin 提交于
* half2 * add epsilon
-
由 Wangzheee 提交于
* [Paddle Inference]support n lookup_tables fuse to embeddinglayernorm(3)
-
由 Ruibiao Chen 提交于
-