- 01 8月, 2022 2 次提交
-
-
由 danleifeng 提交于
Co-authored-by: seemingwang <zsasuke@qq.com> Co-authored-by: NDesmonDay <908660116@qq.com> Co-authored-by: Nseemingwang <seemingwang@users.noreply.github.com> Co-authored-by: NThunderbrook <a754913769@163.com> Co-authored-by: Nxuewujiao <105861147+xuewujiao@users.noreply.github.com> Co-authored-by: Nroot <root@yq01-sys-hic-k8s-v100-box-a225-0693.yq01.baidu.com> Co-authored-by: NThunderbrook <52529258+Thunderbrook@users.noreply.github.com> Co-authored-by: Nroot <root@yq01-inf-hic-k8s-a100-ab2-0009.yq01.baidu.com> Co-authored-by: Nhuwei02 <53012141+huwei02@users.noreply.github.com> Co-authored-by: Nyaoxuefeng <yaoxuefeng@baidu.com> Co-authored-by: Nlxsbupt <luoxsbupt@163.com> Co-authored-by: Nmiaoli06 <106585574+miaoli06@users.noreply.github.com> Co-authored-by: Nroot <root@yq01-inf-hic-k8s-a100-ab2-0008.yq01.baidu.com> Co-authored-by: Nchao9527 <33347532+chao9527@users.noreply.github.com> Co-authored-by: Nqingshui <qshuihu@gmail.com> Co-authored-by: Nyangjunchao <yangjunchao@baidu.com>
-
由 Wangzheee 提交于
* add varlen_token_prune plugin, pass, convert
-
- 29 7月, 2022 4 次提交
-
-
由 Leo Chen 提交于
* remove cudaDeviceContext * remove more template * fix rocm compile
-
由 JZ-LIANG 提交于
* fixed bug for pass & engine * fixed bug for benchmark GPT-3 * add tuner & profiler * add algorithms & config
-
由 Leo Chen 提交于
* init * move CUDAStream to phi * fix compilation * merge develop * add stream_owned_ member * split cuda_stream.h * fix cpu compile * fix constructor * fix bug * fix windows compile * fix inference test_levit * fix windows tests
-
由 houj04 提交于
-
- 27 7月, 2022 1 次提交
-
-
由 pangyoki 提交于
* fix RemoveNode in fuse_elewise_add_act_pass * fix * change pointer to share_ptr * fix * fix * fix format * fix * fix graph_safe_remove_nodes
-
- 26 7月, 2022 5 次提交
-
-
由 Zhen Wang 提交于
* Add a feed op before each input parameter var. * Fix some issues about the unit test build_cinn_pass_test.
-
由 Ruibiao Chen 提交于
-
由 ziyoujiyi 提交于
* back fl * delete ssl cert * . * make warning * . * unittest paral degree * solve unittest * heter & multi cloud commm ready * . * . * fl-ps v1.0 * . * support N + N mode * . * . * . * . * delete print * . * . * . * . * fix bug * . * . * fl-ps with coordinator ready * merge dev * update message parse only * update fl client scheduler * fix bug * update multithreads sync * fix ci errors * update role_maker.py * update role_maker.py * fix ci error: windows py import error * fix ci error: windows py import error * fix windows ci pylib import error * add dump fields & params * try to fix windows import fleet error * fix ps FLAGS error
-
由 Ruibiao Chen 提交于
* Set more attrs in ReplaceScaleLossGradOp * Fix typos * Fix CI errors * Add UT
-
由 Ruibiao Chen 提交于
-
- 25 7月, 2022 1 次提交
-
-
由 lyq 提交于
-
- 21 7月, 2022 2 次提交
-
-
由 zhaocaibei123 提交于
* add slot attr for push sparse op * add pybind * remove fleet * add unittest * fix
-
由 xiaoxiaohehe001 提交于
* convfusionfp16 * convfusionfp16 * convfusionfp16
-
- 20 7月, 2022 7 次提交
-
-
由 zmxdream 提交于
* Update ps_gpu_wrapper.h * Update ps_gpu_wrapper.h * Update ps_gpu_wrapper.cc
-
由 danleifeng 提交于
* add adam/sharedadam optimzier for gpups;edit optimizer struct;test=develop
-
由 houj04 提交于
* device_guard support xpu. test=kunlun * sum op of xpu support LoDTensorArray. add test for while op of xpu. test=kunlun.
-
由 zmxdream 提交于
* fix FleetWrapper initialize
-
由 Ruibiao Chen 提交于
* Add dependency for read op in standalone executor * Fix CI errors * Add UT * add_dependency -> dependency_utils * Fix CI errors
-
由 tianshuo78520a 提交于
-
- 19 7月, 2022 2 次提交
-
-
由 huzhiqiang 提交于
-
由 Ruibiao Chen 提交于
* Rename BOOST_GET macros * Fix conflicts
-
- 18 7月, 2022 1 次提交
-
-
由 王明冬 提交于
-
- 16 7月, 2022 1 次提交
-
-
由 Leo Chen 提交于
-
- 15 7月, 2022 1 次提交
-
-
由 Ruibiao Chen 提交于
-
- 14 7月, 2022 4 次提交
-
-
由 wanghuancoder 提交于
* Compilation optimization
-
由 YuanRisheng 提交于
* adapt mkldnn kernel in PHI * fix ci compile bugs * fix compile bugs * fix compile bugs * fix compile bugs * fix compile bugs * delete comment * fix compile bugs in windows-inference * delete code for converage * modify code by review * modify code by review * add todo * fix compile bugs * fix compile bugs * fix compile bugs * fix unittest bugsx
-
由 Ruibiao Chen 提交于
-
由 WJJ1995 提交于
-
- 13 7月, 2022 3 次提交
- 12 7月, 2022 4 次提交
-
-
由 joanna.wozna.intel 提交于
-
由 王明冬 提交于
-
由 Sławomir Siwek 提交于
* add method for post ops * format code * gpd * format style * add matmul+act test * implement matmul+activation * whitespaces * code style * python code format * Increase UT timeout * code format * update style * generalize activation fuse passes * change order * Unify activation GPD * Revert changes with op_act * remove softmax mkldnn attrs * set common name for act attributes * whitespace * append postops by helper function * ut style * revert changes related to quantization * Reduce redundancy * reduce number of parameters * trigger CI * validate attribute * trim unit test
-
由 xiaoxiaohehe001 提交于
-
- 11 7月, 2022 2 次提交
-
-
由 王明冬 提交于
-
由 Allen Guo 提交于
* add authors Co-authored-by: NAllen Guo <alleng@graphcore.ai> Co-authored-by: NZhixin Yao <zhixiny@graphcore.ai> Co-authored-by: NZhaorui Chen <zhaoruic@graphcore.ai> * squash cpp changes 1/N * clean code Co-authored-by: NZhixin Yao <zhixiny@graphcore.ai> Co-authored-by: NZhaorui Chen <zhaoruic@graphcore.ai>
-