- 25 3月, 2022 24 次提交
-
-
由 Aurelius84 提交于
* [Phi] Migrate strided_slice into Phi * [Phi] Migrate strided_slice into Phi * fix compilation problem
-
由 tianshuo78520a 提交于
-
由 Jiaqi Liu 提交于
-
由 zhangbo9674 提交于
* add cast_grad phi kernel * refie unittest * refien unittest * refine unittest * refine include header path * refien xpu cast unittest * refine code
-
由 Aurelius84 提交于
* [Phi] Migrate Adam and Adamw into Phi * fix compile error and unittest ok * fix compile error and unittest ok * fix undefined reference to fLI::FLAGS * test depend on operator * fix cmake * fix xpu compile * fix infrt * fix amp_type_traits * fix amp_type_traits * modify according reviewer * modify according reviewer * fix dtype float16 * fix typo * fix Cmake * fix code style
-
由 liutiexing 提交于
* add align for WorkQueue * add spinlock * merge develop * merge * Add EventsWaiter * Revert "Add EventsWaiter" This reverts commit e206173aa9be7401b83a53581627bfaf557c8fb2. * Update ThreadDataRegistry Co-authored-by: Nliutiexing <liutiexing@google.com>
-
由 z8hanghuan 提交于
* support multi_dims for tril_triu, *test=kunlun * support multi_dims for tril_triu, *test=kunlun * support multi_dims for tril_triu, *test=kunlun * update xpu.cmake date, support multi_dims for tril_triu, *test=kunlun
-
由 FlyingQianMM 提交于
* add maximum limit for grid of reduce, elementwise and gather * add {} after if
-
由 Chen Weihang 提交于
-
由 Chen Weihang 提交于
* move part sum op kernel * remove deprecated names
-
由 Qi Li 提交于
-
由 zhouweiwei2014 提交于
-
由 Liyulingyue 提交于
* Update transforms.py * Update transforms.py * Update transforms.py * Update functional.py
-
由 Jiabin Yang 提交于
* refactor eager flags * fix flags error when we switch from eager to dygraph * fix ci problem * fix ci * fix ci * merge develop and fix code style * merge develop and fix code style * fix op test error * fix op test error * fix op test error * fix op test error * fix op test error * merge develop
-
由 TTerror 提交于
-
由 xiongkun 提交于
* 1. add the python api grad 2. add final and intermediate state vlog 3. change the python_api error logic * add python api or close the check_eager=True * fix the compatibility * matmul * disable unittests: test_elementwise_add_op test_scatter_nd_op test_gather_nd_op test_scatter_op test_index_sample_op test_elementwise_add_mkldnn_op * refine the logic of prepara_parameter logic * fix Tensor(gpu) 2 Scalar segment fault. * add multi-attribute. (test_unsqueeze_op); add python_sig_out for customizing op sig out * fix some bugs, support python_out_sig
-
由 王明冬 提交于
-
由 FlyingQianMM 提交于
-
由 0x45f 提交于
* Fix loop index for FillZeroForEmptyGradInputs * Call fill zero in run_program_grad
-
由 seemingwang 提交于
-
由 Aganlengzi 提交于
* [NPU] add merged_momentum * fix * fix device
-
由 Zhangjingyu06 提交于
-
由 zyfncg 提交于
* Scalar support marking data_type in yaml * fix code-gene bug
-
由 FlyingQianMM 提交于
* support get_item where the index is a bool scalar tensor * add unittests for supporting get_item where the index is a bool scalar tensor
-
- 24 3月, 2022 16 次提交
-
-
由 Chen Weihang 提交于
* add mean phi kernel * remove original mean kernel * add alias name
-
由 Chen Weihang 提交于
* move batch size like infershape * revert other op change * call infermeta in infershape * adjust batchsize like pos
-
由 zhiboniu 提交于
-
由 Leo Chen 提交于
-
由 jiangcheng 提交于
* fix build_cinn_pass internal var may be control var problem * add annotation and vlog by review advice
-
由 zyfncg 提交于
* support intermediate for saprse api * close intermediate in yaml * fix dygraph_api dep for eager
-
由 zhangbo9674 提交于
* approve amp for intermediate_dygraph * add amp_utils for intermediate_dygraph * add amp needcast check for mlu & npu * test unittest * add SetGradNode for set_stop_gradient && add checktensor for GradientHooks * refine code * refien unittest of imperative_amp for new dygraph * inplace api skip amp * add test_imperative_qat_amp for intermediate amp * refine code * refine test_amp ci strategy * refine unittest code * refine amp_utils code * refine amp getpromotetype for some special op * refine unittest code
-
由 Aurelius84 提交于
-
由 joanna.wozna.intel 提交于
* Correct MultipleQuantizeSquash * Correct logging
-
由 Ren Wei (任卫) 提交于
-
由 Roc 提交于
* # This is a combination of 10 commits. # The first commit's message is: add expert count op add ut for expert_count # This is the 2nd commit message: update UT only for cuda # This is the 3rd commit message: fix for rocm # This is the 4th commit message: update ut # This is the 5th commit message: add moe module # This is the 6th commit message: add expert count op add ut for expert_count # This is the 7th commit message: update UT only for cuda # This is the 8th commit message: update ut # This is the 9th commit message: add moe module # This is the 10th commit message: make expert count private * add assign pos op * fix upper num name * add api _assign pos * add ut for assign pos op * update date * fix for win * update for test (timeout) * fix ut * update * fix ut for number count Co-authored-by: Nhlygit66666 <2570058140@qq.com>
-
由 lilong12 提交于
-
由 Guoxia Wang 提交于
-
由 Sing_chan 提交于
* make vcvars64 and cuda_version can be set in xly pipe * make third_party_path reused by ci and build pipe;test=windows_ci_inference;test=windows_op;test=windows_ci
-
由 tianshuo78520a 提交于
-
由 liutiexing 提交于
* add align for WorkQueue * add spinlock * merge develop * merge * Add EventsWaiter * Add EventsWaiter * update * Revert "Add EventsWaiter" This reverts commit e206173aa9be7401b83a53581627bfaf557c8fb2. * update * update Error MSG * update EventsWaiter * update Co-authored-by: Nliutiexing <liutiexing@google.com>
-