- 27 1月, 2022 15 次提交
-
-
由 QingshuChen 提交于
* optimize kunlun/xpu softmax_with_cross_entropy add add unitest *test=kunlun * minor *test=kunlun * minor *test=kunlun * minor *test=kunlun * minor *test=kunlun
-
由 王明冬 提交于
-
由 zyfncg 提交于
* fix slice bug * fix syntax error
-
由 Yan Chunwei 提交于
-
由 Thunderbrook 提交于
* compile for afs api * with pslib
-
由 zhangkaihuo 提交于
* fix bug: 1. atten: set the default value of attn_dropout_rate to None 2. ffn: add activation parameter * for pure fp16 * Add a SparseCsrTensor * remove unused functional * remove const * remove SetMemoberTensor * remove non_zero_nums_, the number of non zero elements of each batch can be obtained from the crows * SparseCooTensor * add SetMember * merge upstream; add SetMember * merge upstream * merge upstream; add newline at end of file * add newline at end of file * remove newline at end of file * remove newline at end of file * stash * user pten::framework::make_ddim * user pten::framework::make_ddim * merge upstream; use the latest mutable_data * merge upstream; use the latest mutable_data * return mutable dense tensor
-
由 caozhou 提交于
* update dist param grad for pass * update unitest * update unitests * fix conflict
-
由 Wangzheee 提交于
* Paddle-Inference:fix_concat_slice * Paddle-Inference:fix_concat_slice * Paddle-Inference:fix_concat_slice * Paddle-Inference:fix_concat_slice * [Paddle-Inference]: fix concat slice * [Paddle-Inference]: fix concat slice * [Paddle-Inference]: fix concat slice
-
由 Yuang Liu 提交于
-
由 huangxu96 提交于
Support the cases that the indices shape size is larger than the arr shape size
-
由 Feiyu Chan 提交于
-
由 zhangbo9674 提交于
* add master weight for opt state_dict * check empty of master weight * strict gpu test * refine unittest
-
由 Aganlengzi 提交于
-
由 Zhanlue Yang 提交于
-
由 Zhanlue Yang 提交于
* Removed debug info * Added automatic code generation for final state Eager Dygraph * Modified backward yaml * Fixed CI Issues
-
- 26 1月, 2022 25 次提交
-
-
由 Leo Chen 提交于
* update cmake file to remove fluid kernel * add pten declaration.h to where pybind.h used * fix sync_bn and tensorrt_engine * refine detection_library * fix interpreter_core * support eager legacy * fit eager legacy for pten * fall back to cpu if not found kernel * fix compile problem * fix compile problem * refine fallback logic * fit operator.run() * fix xpu compile * fit for new_exec * add REGISTER_OP_WITHOUT_GRADIENT * un-cache pt_kernel_context * fix compile * fix cudnn * fix compiling with on_infer * fix mkldnn * fix isfinite_v2 * fix xpu problem * fix op_device * refine fallback for xpu * fix xpu compile * merge develop * refine code format * fix compile * fix compile * add data_transfer * fix PreparePtenData * fix cpu context * merge develop * fix compile * fix error device context * fix xpu * fix dev_ctx
-
由 hlygit66666 提交于
* add fuse_relu_depthwise_conv_pass unittest * fix atol and rtol * fix according to review * add FuseBatchNormAddActPass and unittest * Update test_dist_fuse_bn_add_act_pass.py * solve conflict
-
由 chentianyu03 提交于
* cast xpu kernel init * cast xpu kernel * replace with raw cast xpu kernel * fix cast kernel bug * add the missing break * modify namespace and header file
-
由 xiongkun 提交于
-
由 Weilong Wu 提交于
* Added selected_rows and rw_lock to pten * Renamed the unit test target to fix CI * Removed Class SelectedRows in Fluid, changed include/cmake relationship, use pten::SelectedRows in Fluid * Remove rw_lock.h,rw_lock_test.cc in fluid * Use pten::RWLock and pten::AutoRDLock, fix CI * Use pten::SelectedRows * Use pten::SelectedRows * Fix to pass NPU CI * Selected_Rows inherits from TensorBase * Use pten::SelectedRows, to pass NPU CI * To fix NPU CI * To fix NPU CI again * Use paddle/pten/core/enforce and polish code * Support imperative selected_rows_to_lod_tensor * Polish code
-
由 qipengh 提交于
* [MLU]Add conv2d op * [MLU]fix comment * [MLU]adapt NCHW of conv2d op
-
由 Allen Guo 提交于
* sync misc changes * apply comments 01 * fix compile error * remove is_ipu_place check * add authors Co-authored-by: NXiaobing Wang <xiaobingw@graphcore.ai> Co-authored-by: NAllen Guo <alleng@graphcore.ai> Co-authored-by: NZhixin Yao <zhixiny@graphcore.ai> Co-authored-by: NHaicheng Jiang <haichengj@graphcore.ai> Co-authored-by: NHan Zhao <hanzhao@graphcore.ai> * sync changes * restore cmake * update ir cmake and setup.py * update inference_lib cmake * split PR Co-authored-by: NXiaobing Wang <xiaobingw@graphcore.ai> Co-authored-by: NZhixin Yao <zhixiny@graphcore.ai> Co-authored-by: NHaicheng Jiang <haichengj@graphcore.ai> Co-authored-by: NHan Zhao <hanzhao@graphcore.ai>
-
由 yaozhixin 提交于
-
由 Weilong Wu 提交于
* Added selected_rows and rw_lock to pten * Renamed the unit test target to fix CI * Removed Class SelectedRows in Fluid, changed include/cmake relationship, use pten::SelectedRows in Fluid * Remove rw_lock.h,rw_lock_test.cc in fluid * Use pten::RWLock and pten::AutoRDLock, fix CI * Use pten::SelectedRows * Use pten::SelectedRows * Fix to pass NPU CI * Selected_Rows inherits from TensorBase * Use pten::SelectedRows, to pass NPU CI * To fix NPU CI * To fix NPU CI again * Use paddle/pten/core/enforce and polish code * Use pten::DataType instead of using proto_type * Move part of data_type to pten * Polish Code
-
由 YuanRisheng 提交于
* move kernel_primitives * use pten's errors
-
由 Wilber 提交于
* add eigen deps * update
-
由 Allen Guo 提交于
* sync misc changes * apply comments 01 * fix compile error * remove is_ipu_place check * add authors Co-authored-by: NXiaobing Wang <xiaobingw@graphcore.ai> Co-authored-by: NAllen Guo <alleng@graphcore.ai> Co-authored-by: NZhixin Yao <zhixiny@graphcore.ai> Co-authored-by: NHaicheng Jiang <haichengj@graphcore.ai> Co-authored-by: NHan Zhao <hanzhao@graphcore.ai> * sync changes * restore cmake * update ir cmake and setup.py * update inference_lib cmake * restore for split PR Co-authored-by: NXiaobing Wang <xiaobingw@graphcore.ai> Co-authored-by: NZhixin Yao <zhixiny@graphcore.ai> Co-authored-by: NHaicheng Jiang <haichengj@graphcore.ai> Co-authored-by: NHan Zhao <hanzhao@graphcore.ai>
-
由 Leo Chen 提交于
-
由 Weilong Wu 提交于
* Added selected_rows and rw_lock to pten * Renamed the unit test target to fix CI * Removed Class SelectedRows in Fluid, changed include/cmake relationship, use pten::SelectedRows in Fluid * Remove rw_lock.h,rw_lock_test.cc in fluid * Use pten::RWLock and pten::AutoRDLock, fix CI * Use pten::SelectedRows * Use pten::SelectedRows * Fix to pass NPU CI * Selected_Rows inherits from TensorBase * Use pten::SelectedRows, to pass NPU CI * To fix NPU CI * To fix NPU CI again * Use paddle/pten/core/enforce and polish code
-
由 pangyoki 提交于
* add profile record for dygraph * add op type in record * fix little bug * solve conflict
-
由 yaozhixin 提交于
-
由 zyfncg 提交于
-
由 石晓伟 提交于
-
由 Li Min 提交于
* Optimize layer_norm fwd when cols is 1024.
-
由 yaozhixin 提交于
-
由 houj04 提交于
* add sigmoid cross entropy with logits to kl2. test=kunlun * add sigmoid cross entropy with logits to kl2. test=kunlun * follow comments. test=kunlun
-
由 baoachun 提交于
* support npu weight unified H2D copy * remove redundant variable
-
由 houj04 提交于
* fix gradient accumulator bug. test=kunlun * fix typo. test=kunlun * fix typo. test=kunlun * fix unit tests. test=kunlun * using TensorCopySync. test=kunlun * only fix for xpu place. test=kunlun
-
由 Yuang Liu 提交于
-
由 joeqiao12 提交于
-