- 24 3月, 2022 5 次提交
-
-
由 zhiboniu 提交于
-
由 Roc 提交于
* # This is a combination of 10 commits. # The first commit's message is: add expert count op add ut for expert_count # This is the 2nd commit message: update UT only for cuda # This is the 3rd commit message: fix for rocm # This is the 4th commit message: update ut # This is the 5th commit message: add moe module # This is the 6th commit message: add expert count op add ut for expert_count # This is the 7th commit message: update UT only for cuda # This is the 8th commit message: update ut # This is the 9th commit message: add moe module # This is the 10th commit message: make expert count private * add assign pos op * fix upper num name * add api _assign pos * add ut for assign pos op * update date * fix for win * update for test (timeout) * fix ut * update * fix ut for number count Co-authored-by: Nhlygit66666 <2570058140@qq.com>
-
由 caozhou 提交于
* migrate infershape * fix tril_triu infershape error * fix qr_op infershape * add parse qr mode func * move order
-
由 Chen Weihang 提交于
* add mul phi kernel * remove mul op kernel * remove original mul grad op * fix cinn test * fix dygraph test failed
-
由 niuliling123 提交于
-
- 23 3月, 2022 12 次提交
-
-
由 jakpiase 提交于
* added missing BF16 activations * added softplus bf16 * minor change * disabled tests for GPU
-
由 furnace 提交于
* [NPU] add npu support for conv3d and conv3d_grad * [NPU] delete failed unittests due to Ascend not support * [NPU] delete debug codes * [NPU] optimize codes, notest * [NPU] remove const_cast * [NPU] optimize for remove const_cast * [NPU] fix written errors
-
由 zhaocaibei123 提交于
* fix benchmark and communicator config * fix bugs of the_one_ps * multi program and fix bug in optimizer * multi program in the_one_ps * public commcontext * ps optimizer multi programs * cvm & datanorm backend * fix dim * fix unittest * fix * the one ps merge * remove comm * add DownpourLiteWorker * all * fix * fix * device worker downpour lite * fix * fix bug in global shuffle * save inference model * fix & add log * fix * remove log * fix * fix save summary * fix * fix pscore * fix * fix * fix * fix * fix * remove logs * fix * fix * fix * fix * fix * add some comments * fix Co-authored-by: Nesythan <esythan@126.com>
-
由 zyfncg 提交于
* move deformable_conv_grad to phi * move infershape of deformable_conv to phi * adjust some code format * move deformable_conv_v1 to phi
-
由 YuanRisheng 提交于
* remove flatten infermeta * fix bugs when run inference ci * fix bugs when run inference ci * fix bugs when run ci * support infrt * inplace infershape code'
-
由 whs 提交于
-
由 KP 提交于
* Add stft_op. * Add stft_grad_op. * Add stft_op unittest. * [DLTP-45176] Add complex compatibility in static mode for stft api. * [DLTP-45176] Add complex compatibility in static mode for stft api. * Add doc. * Update unitests of stft op. * Update spectral helper. * fix coding style.
-
由 niuliling123 提交于
-
由 jiangcheng 提交于
-
由 xiongkun 提交于
* transfer unsqueeze to phi * fix conflict * add squeeze * add infershape * fix xpu and npu error
-
由 YuanRisheng 提交于
* move activation * fix bugs when run ce
-
由 Chen Weihang 提交于
* add full_batch_size_like phi kernel * remove fill constant bs like * update year
-
- 22 3月, 2022 5 次提交
-
-
由 hong 提交于
* move embeding to phi; * update sig; test=develop * move reset impl to phi; test=develop * remove old register; test=develop * fix cpu bf16 bug; test=develop * fix lookup speed error * polish code * fix paddle throw type
-
由 Chen Weihang 提交于
* add reverse phi kernel * add reverse infermeta * remove original reverse op kernl & infershape
-
由 Siming Dai 提交于
* add out_size shape for graph_send_recv * fix bug in register kernel: no const int& support * add out_size in infermeta * change unittest * fix unittest * fix out_size default value * fix doc * delete arg mapping * add sig * move -1 to 0 * move -1 to 0
-
由 chentianyu03 提交于
* modify out and out_grad order in reduce_grad_kernel * delete unsed boolReduceKernel * fix conflict
-
由 crystal 提交于
* fix group_norm address misalignment * fix vectorize * fix code * fix vectorize length * optimize code
-
- 21 3月, 2022 6 次提交
-
-
由 chentianyu03 提交于
* add pad3d kernel into phi * add pad3d infermeta * fix build error * remove raw pad3d infershape function
-
由 zhangyikun02 提交于
-
由 From00 提交于
* Move conv-transpose OPs to phi * Fix CI errors * Fix CI errors
-
由 From00 提交于
-
由 Allen Guo 提交于
* sync changes * copy sOpNamescope * fix UTs * add authors Co-authored-by: NXiaobing Wang <xiaobingw@graphcore.ai> Co-authored-by: NAllen Guo <alleng@graphcore.ai> Co-authored-by: NZhixin Yao <zhixiny@graphcore.ai> Co-authored-by: NZhaorui Chen <zhaoruic@graphcore.ai> Co-authored-by: NHan Zhao <hanzhao@graphcore.ai> * fix code-format * fix compile error * add comments for feed_op Co-authored-by: NXiaobing Wang <xiaobingw@graphcore.ai> Co-authored-by: NZhixin Yao <zhixiny@graphcore.ai> Co-authored-by: NZhaorui Chen <zhaoruic@graphcore.ai> Co-authored-by: NHan Zhao <hanzhao@graphcore.ai>
-
由 zn 提交于
-
- 19 3月, 2022 2 次提交
-
-
由 zyfncg 提交于
-
由 hong 提交于
* add infer meta; test=develop * add histogram infer meta; test=develop * fix unitest bug; test=develop * format; test=develop * format; test=develop * bn not use new infer meta; test=develop * add infer meta; test=develop * fixbug; test=develop * fix bug; * recover unitest; test=develop
-
- 18 3月, 2022 8 次提交
-
-
由 shentanyue 提交于
* add gelu * fix gelu * add log_softmax * add prelu kernel and prelu/gelu/logsoftmax infershape * fix * fix * fix * fix * fix ci * log_softmax rewrite * fix * fix * fix conflict * fix compile error * fix comment * fix * ci_fix Co-authored-by: NYan Li <liyan665@gmail.com>
-
由 Zhang Zheng 提交于
* first commit * fix compile error * support std::vector<std::srting> * fix * fix op support on GPU by chenweihang * pass test * infershape * add set_dtype * fix order * fix * unify the impl of dt and sr * fix
-
由 furnace 提交于
[NPU] fix fp16 (PART I)
-
由 zyfncg 提交于
* move infershape of roi_pool to phi * polish code
-
由 xiongkun 提交于
* tranfer kthvalue from fluid to phi * transfer infershape
-
由 chentianyu03 提交于
* move reduce_mean_grad kernel into phi * move reduce_max/min_grad into phi * remove raw max/min grad kernel * fix bug * fix max/min grad error * move all reduce_grad kernel into one file * add prod grad kernel * add infermeta for prod kernel
-
由 furnace 提交于
[NPU] fix fp16 (PART II)
-
由 Zhang Zheng 提交于
* Optimize perf of softmax_with_cross_entropy_bwd * fix * fix
-
- 17 3月, 2022 2 次提交
-
-
由 Chen Weihang 提交于
* move assign kernel init commit * change vec<tensor> to vec<tensor*> * support tensor array * support api declare * fix test_list failed * fix npu and xpu failed * fix infrt failed * remove assign array size in operator * move assign sr header into sr dir * add infermeta for assign * test op success * fix test_list failed * fix kunlun failed * add set host allocator in tests * support tensor array in arg ctx * open set layout in share_meta * fix meta tensor layout error * fix test failed
-