- 25 3月, 2022 13 次提交
-
-
由 liutiexing 提交于
* add align for WorkQueue * add spinlock * merge develop * merge * Add EventsWaiter * Revert "Add EventsWaiter" This reverts commit e206173aa9be7401b83a53581627bfaf557c8fb2. * Update ThreadDataRegistry Co-authored-by: Nliutiexing <liutiexing@google.com>
-
由 z8hanghuan 提交于
* support multi_dims for tril_triu, *test=kunlun * support multi_dims for tril_triu, *test=kunlun * support multi_dims for tril_triu, *test=kunlun * update xpu.cmake date, support multi_dims for tril_triu, *test=kunlun
-
由 FlyingQianMM 提交于
* add maximum limit for grid of reduce, elementwise and gather * add {} after if
-
由 Chen Weihang 提交于
-
由 Chen Weihang 提交于
* move part sum op kernel * remove deprecated names
-
由 zhouweiwei2014 提交于
-
由 Jiabin Yang 提交于
* refactor eager flags * fix flags error when we switch from eager to dygraph * fix ci problem * fix ci * fix ci * merge develop and fix code style * merge develop and fix code style * fix op test error * fix op test error * fix op test error * fix op test error * fix op test error * merge develop
-
由 FlyingQianMM 提交于
-
由 0x45f 提交于
* Fix loop index for FillZeroForEmptyGradInputs * Call fill zero in run_program_grad
-
由 seemingwang 提交于
-
由 Aganlengzi 提交于
* [NPU] add merged_momentum * fix * fix device
-
由 Zhangjingyu06 提交于
-
由 zyfncg 提交于
* Scalar support marking data_type in yaml * fix code-gene bug
-
- 24 3月, 2022 17 次提交
-
-
由 Chen Weihang 提交于
* add mean phi kernel * remove original mean kernel * add alias name
-
由 Chen Weihang 提交于
* move batch size like infershape * revert other op change * call infermeta in infershape * adjust batchsize like pos
-
由 zhiboniu 提交于
-
由 Leo Chen 提交于
-
由 jiangcheng 提交于
* fix build_cinn_pass internal var may be control var problem * add annotation and vlog by review advice
-
由 zyfncg 提交于
* support intermediate for saprse api * close intermediate in yaml * fix dygraph_api dep for eager
-
由 zhangbo9674 提交于
* approve amp for intermediate_dygraph * add amp_utils for intermediate_dygraph * add amp needcast check for mlu & npu * test unittest * add SetGradNode for set_stop_gradient && add checktensor for GradientHooks * refine code * refien unittest of imperative_amp for new dygraph * inplace api skip amp * add test_imperative_qat_amp for intermediate amp * refine code * refine test_amp ci strategy * refine unittest code * refine amp_utils code * refine amp getpromotetype for some special op * refine unittest code
-
由 joanna.wozna.intel 提交于
* Correct MultipleQuantizeSquash * Correct logging
-
由 Roc 提交于
* # This is a combination of 10 commits. # The first commit's message is: add expert count op add ut for expert_count # This is the 2nd commit message: update UT only for cuda # This is the 3rd commit message: fix for rocm # This is the 4th commit message: update ut # This is the 5th commit message: add moe module # This is the 6th commit message: add expert count op add ut for expert_count # This is the 7th commit message: update UT only for cuda # This is the 8th commit message: update ut # This is the 9th commit message: add moe module # This is the 10th commit message: make expert count private * add assign pos op * fix upper num name * add api _assign pos * add ut for assign pos op * update date * fix for win * update for test (timeout) * fix ut * update * fix ut for number count Co-authored-by: Nhlygit66666 <2570058140@qq.com>
-
由 liutiexing 提交于
* add align for WorkQueue * add spinlock * merge develop * merge * Add EventsWaiter * Add EventsWaiter * update * Revert "Add EventsWaiter" This reverts commit e206173aa9be7401b83a53581627bfaf557c8fb2. * update * update Error MSG * update EventsWaiter * update Co-authored-by: Nliutiexing <liutiexing@google.com>
-
由 zhangkaihuo 提交于
-
由 caozhou 提交于
* migrate infershape * fix tril_triu infershape error * fix qr_op infershape * add parse qr mode func * move order
-
由 Zhanlue Yang 提交于
* [Refactor] refactored eager_gen.py PR #1 * [Refactor] refactored eager_gen.py PR #1 * Refactored version 2 * Added automatic code generation utils * Fixed merge issues
-
由 seemingwang 提交于
* extract sub-graph * graph-engine merging * fix * fix * fix heter-ps config * test performance * test performance * test performance * test * test * update bfs * change cmake
-
由 0x45f 提交于
* Refine eager run_program OP for dy2st UT * append run_program error string and refine run_program_grad * remove some comments * refine ConstructXGradTensors
-
由 Chen Weihang 提交于
* add mul phi kernel * remove mul op kernel * remove original mul grad op * fix cinn test * fix dygraph test failed
-
由 niuliling123 提交于
-
- 23 3月, 2022 10 次提交
-
-
由 Leo Chen 提交于
-
由 jakpiase 提交于
* added missing BF16 activations * added softplus bf16 * minor change * disabled tests for GPU
-
由 furnace 提交于
* [NPU] add npu support for conv3d and conv3d_grad * [NPU] delete failed unittests due to Ascend not support * [NPU] delete debug codes * [NPU] optimize codes, notest * [NPU] remove const_cast * [NPU] optimize for remove const_cast * [NPU] fix written errors
-
由 Zhanlue Yang 提交于
-
由 zhaocaibei123 提交于
* fix benchmark and communicator config * fix bugs of the_one_ps * multi program and fix bug in optimizer * multi program in the_one_ps * public commcontext * ps optimizer multi programs * cvm & datanorm backend * fix dim * fix unittest * fix * the one ps merge * remove comm * add DownpourLiteWorker * all * fix * fix * device worker downpour lite * fix * fix bug in global shuffle * save inference model * fix & add log * fix * remove log * fix * fix save summary * fix * fix pscore * fix * fix * fix * fix * fix * remove logs * fix * fix * fix * fix * fix * add some comments * fix Co-authored-by: Nesythan <esythan@126.com>
-
由 Weilong Wu 提交于
* disable scatter case in test_inplace_eager_fluid * Update register_hook logic * Add register_hook test cases Co-authored-by: Npangyoki <pangyoki@126.com>
-
由 Jiabin Yang 提交于
* suppor sharding api * support multi api for sharding in eager * support multi api for sharding in eager * fix test * fix test coverage
-
由 liutiexing 提交于
* AddAwaitableTask for WorkQueue Co-authored-by: Nliutiexing <liutiexing@google.com>
-
由 zyfncg 提交于
* move deformable_conv_grad to phi * move infershape of deformable_conv to phi * adjust some code format * move deformable_conv_v1 to phi
-
由 YuanRisheng 提交于
* remove flatten infermeta * fix bugs when run inference ci * fix bugs when run inference ci * fix bugs when run ci * support infrt * inplace infershape code'
-