- 05 8月, 2022 13 次提交
-
-
由 Zhen Wang 提交于
-
由 Sławomir Siwek 提交于
* remove v2_transpose_reshape * matmul_transpose_reshape * reshape_transpose_matmul * restore ut * adjust old ut * restore parallel UT ruels * feedback from review
-
由 OccupyMars2025 提交于
-
由 joanna.wozna.intel 提交于
-
由 Qi Li 提交于
-
由 YuanRisheng 提交于
-
由 duanboqiang 提交于
* migrate kernel * fix sig order * remove header files * remove header * remove header * modify logits grad
-
由 carryyu 提交于
* add fused_multi_transformer post_layer_norm * add test post_layer_norm
-
由 Zhang Jun 提交于
* update trt workspace size param * update * update * update * use int64_t * use int64_t * upate * update
-
由 zhangxiaoci 提交于
-
由 Feiyu Chan 提交于
* move fft kernels to phi, done with cufft, pocketfft, mkl_cdft, hipfft * make stft_op use fft from phi/kernels/funcs, clean code
-
由 Ruibiao Chen 提交于
-
由 Allen Guo 提交于
* restore to gcc8.2 * test=document_fix
-
- 04 8月, 2022 26 次提交
-
-
由 Sławomir Siwek 提交于
* Add unit tests * matmul_v2 + activation * matmuls + elementwise_add * matmul_v2 postops * transform matmul to v2 * opcompat * fix fusing matmul with multipe outs * add shape constraints * remove unused vars * change pass order * - Unit tests to be debugged - fix - refactor - diagnostic - more diagnostic - fix - Fix number two - fix - fix - fix - alpha added - more fixes - compilation fix - removed diagnostic code - cosmetic fixes * lint * add alpha constraint * merge matmul refactor * trigger CI * - fix * - another fix * code style * add support for matmul+elementwise_add+activation * code style * fix bfloat16 bugs * change append_binary to append_sum Co-authored-by: NJacek Czaja <jacek.czaja@intel.com>
-
由 WangZhen 提交于
-
由 Nyakku Shigure 提交于
* add name attribute to code-block, test=document_fix * remove redundant labels, test=document_fix * remove redundant labels (from upstream), test=document_fix * more COPY-FROM (try multiple code example), test=document_fix * empty commit, try to trigger PR-CI-build * fix some `Examples:` format issues * fix some ci errors
-
由 zhoutianzi666 提交于
* add rnn
-
由 jakpiase 提交于
-
由 limingshu 提交于
* first commit * add fp16 ctest files for compare op * add cpu register of float16 for compare ops
-
由 ccrrong 提交于
-
由 dongfangshenzhu 提交于
* add merged_momentum *test=kunlun * add merged_momentum *test=kunlun * add fp16 to merged_momentum,*test=kunlun
-
由 Weilong Wu 提交于
* [Eager] fix slice's input mistake * add tests for slice
-
由 zhiboniu 提交于
* phi_fill_diagonal_tensor * delete extra lines * update * add legacy api test * rename sig
-
由 zhiboniu 提交于
* phi_generate_proposals_v2 * remove old kernels * optest add eager_check * del lod * update * update * update test_detection with_lod * update nms_util * remove old nms_util.h
-
由 xiaoting 提交于
* fix conflicts * mv unused file * revert backward.h * revert lu_unpack kernel * rm .cu file * Update lu_unpack_kernel.cc * format phi yaml
-
由 Wilber 提交于
* convert support multi block. * update
-
由 Aganlengzi 提交于
-
由 WangZhen 提交于
* Move Function classes to a sub dir * Format code
-
由 kuizhiqing 提交于
-
由 kuizhiqing 提交于
-
由 zyfncg 提交于
-
由 Leo Chen 提交于
-
由 jiangcheng 提交于
-
由 houj04 提交于
* [XPU] fleet dist_model support xpu. test=kunlun * [XPU] fleet dist_model support xpu. test=kunlun * move unittest file location. test=kunlun
-
由 王明冬 提交于
-
由 Chen Weihang 提交于
-
由 sneaxiy 提交于
-
由 Aurelius84 提交于
-
由 Yuang Liu 提交于
Set the lr var's dtype to fp32 when create a fp16 lr var in optimizer if user not mean to use global fp16. (#44840)
-
- 03 8月, 2022 1 次提交
-
-
由 wuyefeilin 提交于
-